SLM

Small Language Models

What is this?

This repo contains a from-scratch implementation of a transformer-based language model. This implementation allows training language models both on next-token and previous-token prediction. New features and upgrades will be made soon.

Current features

Training and inference scripts for forward (next token prediction) and reverse (previous token prediction) GPT2-like language models.
Dataloaders for the TinyShakespeare and TinyStories datasets.

Sources

Inspiered by and adapted from Andrej Karpathy's video series.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.gitignore		.gitignore
README.md		README.md
constants.py		constants.py
data_loaders.py		data_loaders.py
gpt.py		gpt.py
inference.py		inference.py
tiny_shakespeare.txt		tiny_shakespeare.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

SLM

What is this?

Current features

Sources

About

Uh oh!

Releases

Packages

Languages

Uh oh!

Uh oh!

migueldecampos/slm

Folders and files

Latest commit

History

Repository files navigation

SLM

What is this?

Current features

Sources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages