👽
Stars
(Stepwise controlled Understanding for Trajectories) -- “agent that learns to hunt"
A high-performance attention mechanism that computes softmax normalization in a single streaming pass using running accumulators (online softmax).
MagellaX / rllte
Forked from RLE-Foundation/rllteLong-Term Evolution Project of Reinforcement Learning
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…