Bulding my own Diffusion Language Model from scratch was easier than I thought [P]
The author built a diffusion language model from scratch to better understand complex concepts, without the help of AI-generated code. They trained the 7.5M parameter model on the tiny Shakespeare dataset and shared the code on GitHub.