Projects

A selection of projects I've worked on. Each one represents a problem I wanted to solve or an idea I wanted to explore.

SfraLM

In Progress

Custom built LLM - ongoing project. Code for pretraining, mid-training and post training an LLM (SFT & RL) on a HPC cluster. Implements experimenting, logging and evals.

PythonPyTorchLLMHPC

Code

SfraTorch

Completed

Implements key PyTorch functionalities from scratch with the philosophy "What I cannot build, I cannot understand". Includes autograd, neural network layers, transformers, and ASR models.

PythonDeep LearningFrom Scratch

Code

How Transformers learn grammar - inside the hidden structure of LLMs

Completed

BSc thesis exploring how transformers learn syntax. Training GPT-2 on formal grammars and probing hidden representations to uncover where hierarchical structure emerges.

PythonPyTorchTransformersNLP

Code Report

StableSFTData

Completed

Sample-efficient LLM fine-tuning for math reasoning: training on hard examples outperforms random selection under fixed compute budgets.

PythonLLMFine-tuningLoRA

Code Report

xAI Hackathon Project

Completed

AI-powered news-to-podcast converter. Curate articles and generate personalized audio briefings in ~30 seconds with morning news mode and debate mode using Grok Voice API.

TypeScriptReactElectronGrok API

Code Demo

Social Bias in SSMs

Completed

Examines social biases in State Space Models (Mamba) versus transformers across multiple demographics (race, gender, profession, religion), applying debiasing techniques with LoRA.

PythonMambaBias AnalysisLoRA

Code Report

Want to see more? Check out my GitHub for additional projects and contributions.

View GitHub Profile