Thanks to visit codestin.com
Credit goes to github.com

baochi0212

Follow

🌏

Chi Tran baochi0212

🌏

Follow

ambivalent retriever

60 followers · 334 following

Hanoi, Vietnam
15:53 (UTC -12:00)
in/chi-tran-68127a222

Achievements

Achievements

Highlights

Pro

Lists (19)

Sort

accelerate diffusion model

agent

ar image gen

deployment

diffusion cot

diffusion LM

efficient attn

latent reasoning VL

llm_eval

long context

omni

reasoning with rl

21 repositories

speech

surveys

text embedding frameworks

traing repo

21 repositories

triton

ui agent

web

Stars

inclusionAI / dInfer

dInfer: An Efficient Inference Framework for Diffusion Language Models

Python 421 41 Updated Feb 11, 2026

lasgroup / SDPO

Reinforcement Learning via Self-Distillation (SDPO)

Python 426 40 Updated Feb 18, 2026

DreamLM / Dream-VLX

Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.

Python 102 4 Updated Jan 14, 2026

Noumena-Network / nmoe

MoE training for Me and You and maybe other people

Python 355 29 Updated Feb 7, 2026

microsoft / InfoAgent

Python 33 2 Updated Feb 6, 2026

mindfold-ai / Trellis

All-in-one AI framework & toolkit

Python 2,250 112 Updated Feb 17, 2026

Intelligent-Internet / ii-agent

II-Agent: a new open-source framework to build and deploy intelligent agents

Python 3,160 485 Updated Feb 4, 2026

anthropics / original_performance_takehome

Anthropic's original performance take-home, now open for you to try!

Python 3,478 772 Updated Jan 22, 2026

AQ-MedAI / MrlX

MrlX: A Multi-Agent Reinforcement Learning Framework

Python 190 12 Updated Jan 19, 2026

meta-pytorch / OpenEnv

An interface library for RL post training with environments.

Python 1,157 177 Updated Feb 19, 2026

OpenBMB / AgentCPM

An End-to-End Infrastructure for Training and Evaluating Various LLM Agents

Python 739 62 Updated Feb 9, 2026

MiroMindAI / MiroThinker

MiroThinker is an open source deep research agent optimized for research and prediction. It achieves a 80.8% Avg@8 score on the challenging GAIA benchmark.

Python 6,301 466 Updated Feb 10, 2026

MiroMindAI / MiroRL

MiroRL is an MCP-first reinforcement learning framework for deep research agent.

Python 231 19 Updated Aug 27, 2025

anomalyco / opencode

The open source coding agent.

TypeScript 106,685 10,459 Updated Feb 19, 2026

z-lab / dflash

DFlash: Block Diffusion for Flash Speculative Decoding

Python 551 34 Updated Feb 18, 2026

QwenLM / Qwen3-VL-Embedding

Python 1,019 75 Updated Feb 2, 2026

masamasa59 / ai-agent-papers

A collection of AI Agents papers (Updated biweekly)

1,067 80 Updated Feb 15, 2026

Agent-RL / ReCall

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 1,323 79 Updated May 16, 2025

PRIME-RL / TTRL

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

Python 992 72 Updated Sep 26, 2025

XuankunRong / SafeGRPO

SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization

Python 11 1 Updated Feb 19, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,509 436 Updated Feb 18, 2026

zilliztech / claude-context

Code search MCP for Claude Code. Make entire codebase the context for any coding agent.

TypeScript 5,364 484 Updated Sep 16, 2025

open-thought / reasoning-gym

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,346 111 Updated Jan 16, 2026

SJTU-DENG-Lab / Mantis

The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight

Python 78 1 Updated Jan 16, 2026

EvolvingLMMs-Lab / LLaVA-OneVision-1.5-RL

Fully Open Framework for Democratized Multimodal Reinforcement Learning.

Python 41 3 Updated Dec 19, 2025

inclusionAI / LLaDA2.X

LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.

344 20 Updated Feb 12, 2026

raghavlite / B3

Python 37 Updated Jan 12, 2026

MIT-MI / MEM1

Python 247 18 Updated Jan 3, 2026

JT-Ushio / MHA2MLA

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Python 204 21 Updated Dec 4, 2025

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 2,766 435 Updated Feb 16, 2026