Thanks to visit codestin.com
Credit goes to github.com

Skip to content
forked from karpathy/nanoGPT

From nanoGPT to modern LLMs. Implementation + training.

License

SuchitG04/gippity

 
 

Repository files navigation

gippity

gippity

nanoGPT -> modern LLMs, gippity walks you through the little and big innovations in LLMs. Each branch has the implementation of a specific technique that made LLMs go more brr... and the subsequent branches build upon the previous ones (in the listed order).

branches

  1. master - trains a vanilla GPT (truthful to nanoGPT, and therefore GPT-2 and 3) to establish a baseline.
  2. rope - adds RoPE and RMSNorm to achieve faster training iterations, lower memory footprint, and faster convergence.

About

From nanoGPT to modern LLMs. Implementation + training.

Resources

License

Stars

Watchers

Forks

Languages

  • Python 100.0%