-
Alibaba Inc.
- Hangzhou, China
Stars
Lean Algorithmic Trading Engine by QuantConnect (Python, C#)
Bridge Megatron-Core to Hugging Face/Reinforcement Learning
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
A high-performance algorithmic trading platform and event-driven backtester
The absolute trainer to light up AI agents.
A Survey of Reinforcement Learning for Large Reasoning Models
My learning notes for ML SYS.
verl: Volcano Engine Reinforcement Learning for LLMs
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
No fortress, purely open ground. OpenManus is Coming.
Fully open reproduction of DeepSeek-R1
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Learn how to design systems at scale and prepare for system design interviews
✨✨Latest Advances on Multimodal Large Language Models
Jeff Dean's latency numbers plotted over time
OpenYurt - Extending your native Kubernetes to edge(project under CNCF)
An Open Source Machine Learning Framework for Everyone
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Implementation of triplet loss in TensorFlow
An Efficient Enterprise-class Container Engine