Andrej Karpathy has released a brand new instructional video, which is like the descent of a Bodhisattva, selflessly teaching people to implement the 124M GPT-2 model from scratch.
This video is a full 4 hours long, with extremely detailed content, it can be said to be a hand-holding tutorial.
Andrej Karpathy stated that even if you have no relevant foundation, you can follow this video step by step to implement the GPT-2 model.