Thanks to visit codestin.com
Credit goes to github.com

Skip to content
#

small-language-models

Here are 137 public repositories matching this topic...

[𝗜𝗖𝗠𝗟 𝟮𝟬𝟮𝟲] Dispersion loss counteracts embedding condensation and improves generalization in small language models

  • Updated May 8, 2026
  • Python

This Repository provides a Jupyter Notebook for building a small language model from scratch using 'TinyStories' dataset. Covers data preprocessing, BPE tokenization, binary storage, GPU memory management, and training a Transformer in PyTorch. Generate sample stories to test your model. Ideal for learning NLP and PyTorch.

  • Updated Jun 7, 2025
  • Jupyter Notebook

A governed local AI build-and-memory system that trains small brains, compares them, protects the better one, archives the worse one, and preserves the evidence of why. v1.0.0/governed-v2.2.0+

  • Updated May 12, 2026
  • Python

Improve this page

Add a description, image, and links to the small-language-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the small-language-models topic, visit your repo's landing page and select "manage topics."

Learn more