Show HN: NanoEuler – GPT-2 scale model in pure C/CUDA from scratch (opens in new tab)
GPT-2-style LLM built from scratch in C/CUDA with hand-written backprop, BPE tokenizer, FlashAttention, pretraining, and SFT. - JustVugg/nanoeuler
Read the original article