News-LLM

This project tackles the task of building and training a large language model (LLM) from scratch. The model is trained on the FNSPID dataset, which aggregates financial news articles, to answer the question of whether it is possible to train a transformer that completes sentences similar to those of a financial news article. We transformed and tokenized the FNSPID dataset in a way that can be ingested by a transformer, built a transformer architecture in PyTorch, and trained the transformer using a GPU and random search. In the end, we obtained a model with a perplexity of 13.69 that can complete sentences in a way that appears reasonable to a human and that sound like financial news.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.gitignore		.gitignore
README.md		README.md
Report.pdf		Report.pdf
llm.py		llm.py
train_llm.ipynb		train_llm.ipynb
train_llm_final.ipynb		train_llm_final.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

News-LLM

About

Uh oh!

Releases

Packages

Languages

hugolatendresse/news-llm

Folders and files

Latest commit

History

Repository files navigation

News-LLM

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages