a lot of my work is made private for various reasons, if you want to collaborate, please reach out on this email address : [email protected]
I also write research summaries and walkthroughs @ narrowfoc.us :
- Using RL to make Databases go Brrr (maybe)
- RL for LLMs (important works -- Research Walkthrough)
- Preference Optimisation Methods
thanks !!