Codestin Search App

Follows https://arxiv.org/pdf/1706.03762 somewhat, but only the decoder part as GPT is a decoder only model

Video to follow along all the steps and questions you have https://www.youtube.com/watch?v=kCc8FmEb1nY
Uses a simple character encoder
Only for learning purpose, and get more insight into the gpt models
Layer norms and residuel connections are done differently in gpt
Using masked attention heads for each, should not be needed?

When running on a old gaming laptop

See training for the results

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
resources		resources
tokenizer		tokenizer
training		training
.gitignore		.gitignore
README.md		README.md
consts.py		consts.py
dataset_processing.py		dataset_processing.py
gpt.py		gpt.py
requirements.txt		requirements.txt
results.py		results.py
typedefs.py		typedefs.py