tinyGPT A tiny decoder only transformer as described in the Attention Is All You Need paper for fun and educational purposes. LLM Optimizations residual connections layer normalization Credits Inspired by the work of A. Karpathy: https://github.com/karpathy/minGPT.