Stars
A lightweight sandboxing tool for enforcing filesystem and network restrictions on arbitrary processes at the OS level, without requiring a container.
🚀 The fast, Pythonic way to build MCP servers and clients
A LLM-free library for extracting main content from HTML strings via Text Density analysis
Build Real-Time Knowledge Graphs for AI Agents
⏩ Ship faster with Continuous AI. Open-source CLI that can be used in TUI mode as a coding agent or Headless mode to run background agents
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
Fully open reproduction of DeepSeek-R1
verl: Volcano Engine Reinforcement Learning for LLMs
A bibliography and survey of the papers surrounding o1
Entropy Based Sampling and Parallel CoT Decoding
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel
Attribute (or cite) statements generated by LLMs back to in-context information.
Efficient Triton Kernels for LLM Training
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance. Accepted to ACL 2024.
Self-Consistent Decoding for More Factual Open Responses
A blazing fast inference solution for text embeddings models
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
RoleInteract: Evaluating the Social Interaction of Role-Playing Agents
Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".
[ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Easily embed, cluster and semantically label text datasets
[ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
Large World Model -- Modeling Text and Video with Millions Context
Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"