SfraLM
Custom built LLM - ongoing project. Code for pretraining, mid-training and post training an LLM (SFT & RL) on a HPC cluster. Implements experimenting, logging and evals.
A selection of projects I've worked on. Each one represents a problem I wanted to solve or an idea I wanted to explore.
Custom built LLM - ongoing project. Code for pretraining, mid-training and post training an LLM (SFT & RL) on a HPC cluster. Implements experimenting, logging and evals.
Implements key PyTorch functionalities from scratch with the philosophy "What I cannot build, I cannot understand". Includes autograd, neural network layers, transformers, and ASR models.
Want to see more? Check out my GitHub for additional projects and contributions.
View GitHub Profile