Some projects I'm using to learn AI since my MS
First follow specific instructions for your operating system listed in subheaders below.
Consider modifying pyproject.toml to handle your necessary cuda version as an optional dependency.
Run uv sync to install all dependencies.
Jupyter is great to work in, but it has several shortcomings:
- A lack of a proper git diff when making a PR
- You can't import jupyter notebooks from other jupyter notebooks
- They can't be linted or formatted by tools like ruff or pyright
So I am using jupytext to make syncronized copies of the jupyter notebooks in plain text .py format.
If you then wrap cells (the code you run) in if __name__ == "__main__" you also now gain the ability to use these notebooks as importable libraries in future work.
I use logseq to manage my ./notes and flashcards. This uses the zettelkasten method to create a knowledge graph, which is perfect for studying.
See .github/pull_request_template.md for the styleguide.
brew install sdl sdl_ttf sdl_image sdl_mixer portmidi
sudo apt-get install libsdl2-dev libsdl2-image-dev libsdl2-mixer-dev libsdl2-ttf-dev libfreetype6-dev libportmidi-dev libjpeg-dev python3-setuptools python3-dev python3-numpy
-
Emoji Meanings
- β Indicates Priority
- π Paper Read
- π Notes Taken
- π» Implementation Completed
-
Reinforcement Learning
- Value Based Methods - I'm pretty much up to date with these methods, but might as well implement them. I may go into less explanation though.
- Policy Based Methods
- πππ» REINFORCE *
- πβ Actor-Critic (A2C, A3C) *
- Trust Region Policy Optimization (TRPO)
- [ ]βProximal Policy Optimization (PPO) *
- Deep Deterministic Policy Gradient (DDPG)
- Model Based Reinforcement Learning
- Exploration in RL
- Multi Agent RL
- Human-Timescale Adaptation in an Open-Ended Task Space
- Distributed RL
-
Transformers
- Tokenization
- Word Embeddings
- πβTransformers
- πβBERT
- [ ]βSentence-BERT
- Fine Tuning
- RLHF
- Direct Preference Optimization
- Multimodality
- Mamba and SSM's
- Sentence Transformers
- Multi token prediction
- Time Series
-
RAG
- π Survey on RAG
- [ ]βREALM
- [ ]βHyde
- [ ]βDPR
- [ ]βRaft
- PRCA
- EAE
- MIPS
- Self reinforce
- Survey on Graph RAG
- π Survey on RAG
-
Diffusion Models
-
[ ]βGraph Neural Networks (GNN)
-
Cognitive Science
-
Techniques
- Profiling
- Debugging Metrics