Stars
Baselines and Datasets for Pokémon Showdown RL
A python interface for training Reinforcement Learning bots to battle on pokemon showdown
We introduce ChatAnime, the first Emotionally Supportive Role-Playing (ESRP) dataset.
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.
Thai Far‑Field Meeting Corpus for Robust Conversational ASR
Implement a reasoning LLM in PyTorch from scratch, step by step
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
人狼知能コンテスト(自然言語部門) のブラウザ上で動作するビューアならびにエージェント
Persona Vectors: Monitoring and Controlling Character Traits in Language Models
Thammasat School of Engineering CN 101 2568 Semester 1
Wan: Open and Advanced Large-Scale Video Generative Models
AgentSociety: Large-scale Social Simulation to Understand Human Behaviors and Society through LLM-driven Agents
Democratizing AI scientists with ToolUniverse
Kimi K2 is the large language model series developed by Moonshot AI team
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Machine Learning In Production (MLOps)
🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.
Environments for LLM Reinforcement Learning
Implementing DeepSeek R1's GRPO algorithm from scratch
First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting research papers and demos.
Open Source Text-to-Speech (TTS) ภาษาไทย — เครื่องมือสร้างเสียงพูดจากข้อความด้วยเทคนิค Flow Matching