Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View bingshuailiu's full-sized avatar

Block or report bingshuailiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

My learning notes/codes for ML SYS.

Python 4,154 253 Updated Nov 10, 2025

[NeurIPS 2025] Official Implementation of ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding.

Python 31 Updated Oct 31, 2025

A multilingual and multimodal LLM E-Commerce benchmark.

3 Updated Oct 27, 2025

Code from "Exploring optimal transport-based multi-grained alignments for text-molecule retrieval" (IEEE BIBM 2024)

Python 2 Updated Mar 21, 2025

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 3,796 688 Updated Oct 11, 2025

Demystifying Reinforcement Learning in Agentic Reasoning

Python 115 21 Updated Oct 14, 2025

The development and future prospects of large multimodal reasoning models.

542 20 Updated Aug 2, 2025

πŸš€ EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents

Python 2,193 155 Updated Nov 9, 2025

[ACL 2025] Optimal Transport-Based Token Weighting scheme for Enhanced Preference Optimization

Python 7 Updated May 22, 2025

SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts

Python 43 2 Updated Oct 14, 2025
Rust 2 Updated Nov 14, 2025

DCPO: Dynamic Adaptive Clipping for RL

Python 43 5 Updated Sep 25, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,054 303 Updated Nov 13, 2025
Python 66 5 Updated Jun 28, 2025

Interactive Pytorch forward pass visualization in notebooks

Python 607 24 Updated Nov 1, 2025

Awesome-GraphRAG: A curated list of resources (surveys, papers, benchmarks, and opensource projects) on graph-based retrieval-augmented generation.

1,810 160 Updated Nov 10, 2025

About Awesome things towards foundation agents. Papers / Repos / Blogs / ...

1,831 178 Updated Jul 28, 2025

θ―ΊδΊšη›˜ε€ε€§ζ¨‘εž‹η ”ε‘θƒŒεŽηš„ηœŸζ­£ηš„εΏƒι…ΈδΈŽι»‘ζš—ηš„ζ•…δΊ‹γ€‚

11,374 1,363 Updated Jul 9, 2025

Fully open reproduction of DeepSeek-R1

Python 25,641 2,399 Updated Sep 8, 2025

A series of math-specific large language models of our Qwen2 series.

Python 1,030 145 Updated Jan 11, 2025

Recipes to train reward model for RLHF.

Python 1,477 103 Updated Apr 24, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,606 2,521 Updated Nov 14, 2025

Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"

Python 176 14 Updated May 20, 2025

An Open Large Reasoning Model for Real-World Solutions

Python 1,527 80 Updated May 30, 2025

InstantIR: Blind Image Restoration with Instant Generative Reference πŸ”₯

Python 529 59 Updated Nov 14, 2024

GenRM-CoT: Data release for verification rationales

67 6 Updated Oct 16, 2024

A compact LLM pretrained in 9 days by using high quality data

Python 333 26 Updated Apr 9, 2025

ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models

Python 189 5 Updated Oct 8, 2024
Next