Thanks to visit codestin.com
Credit goes to github.com

Skip to content

A minimal PyTorch approx-implementation of GPT

License

alhaad/zeptogpt

Repository files navigation

zeptogpt

A minimal JAX/PyTorch approx-implementation of GPT based on karpathy's 'Let's build GPT'. The goal here is to learn frameworks (JAX, PyTorch), models (GPT, LLama, Gemma), evals (Hellaswag, MMLU) and more. The vision is to be able to train/finetune/infer SOTA small-medium models on (freely-available) TPUs.

About

A minimal PyTorch approx-implementation of GPT

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published