August 14, 2023 I’ve been learning about neural networks recently and came across the blog post: GPT in 60 Lines of NumPy, I found that the structure of GPT-2 is not that complicated, and the blog post was implemented in Python with only 60 lines of code (without comments), however, for beginners, why not implementContinue reading “Write a GPT from scratch using C++ (TinyGPT)”