Thanks to visit codestin.com
Credit goes to monads.substack.com
m0nads
Subscribe
Sign in
Home
Archive
About
Reasoning without External Rewards
From self-certainty to Reinforcement Learning from Internal Feedback
Jul 20, 2025
•
m0nads
Codestin Search App
Codestin Search App
Codestin Search App
Latest
Top
LLMs: a 5 mins trip
What Large Language Models actually do in mathematical terms
Jul 15, 2025
•
m0nads
Codestin Search App
Codestin Search App
Codestin Search App
The Kullback-Leibler divergence
A fundamental comparison tool
Jul 3, 2025
•
m0nads
Codestin Search App
Codestin Search App
Codestin Search App
Group Relative Policy Optimization
An efficient and effective reinforcement learning algorithm
Feb 25, 2025
•
m0nads
Codestin Search App
Codestin Search App
Codestin Search App
Minimal RAG model
Using Cohere and SerpAPI
Jan 26, 2025
•
m0nads
Codestin Search App
Codestin Search App
Codestin Search App
Byte Latent Transformer
The Efficiency of Dynamic Byte Patching
Jan 14, 2025
•
m0nads
Codestin Search App
Codestin Search App
Codestin Search App
Exploring Florence-2
Multimodal AI with Unified Vision-Language Capabilities
Aug 12, 2024
•
m0nads
Codestin Search App
Codestin Search App
Codestin Search App
Covariance and Correlation
Statistical quantities for variables relationships
Jun 17, 2024
•
m0nads
Codestin Search App
Codestin Search App
Codestin Search App
See all
m0nads
AI and more
Subscribe
m0nads
Subscribe
About
Archive
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts