Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View 0xzhouchenyu's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report 0xzhouchenyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 19,699 3,143 Updated Dec 22, 2025

A Latex style and template for paper preprints (based on NIPS style)

TeX 1,423 362 Updated Jan 2, 2024

LLM4OR homepage project.

Python 22 Updated Aug 29, 2025

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 389 16 Updated Jan 19, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,132 101 Updated Nov 23, 2025

Muon is Scalable for LLM Training

1,388 78 Updated Aug 3, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 88,819 10,217 Updated Dec 27, 2025

NETLIB LP dataset in .mps format, containing 114 feasible and 29 infeasible instances.

JetBrains MPS 3 Updated Sep 5, 2025

基于selenium的SJTU体育场馆预约脚本

Python 13 2 Updated Oct 13, 2024

Recent research papers about Foundation Models for Combinatorial Optimization

439 34 Updated Dec 26, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,334 334 Updated Dec 24, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,939 923 Updated Dec 15, 2025

Fully open reproduction of DeepSeek-R1

Python 25,759 2,407 Updated Nov 24, 2025

Train transformer language models with reinforcement learning.

Python 16,801 2,380 Updated Dec 26, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,838 2,915 Updated Dec 27, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,662 841 Updated Dec 18, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,526 1,536 Updated Apr 24, 2025

Optimization Modeling Using mip Solvers and large language models

Python 231 45 Updated Nov 4, 2025

Let your Claude able to think

TypeScript 16,630 1,966 Updated Nov 4, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,562 7,829 Updated Dec 26, 2025

A library to generate LaTeX expression from Python code.

Python 7,589 394 Updated Feb 13, 2025

A series of math-specific large language models of our Qwen2 series.

Python 1,055 151 Updated Jan 11, 2025

Code for the paper: Why Transformers Need Adam: A Hessian Perspective

Jupyter Notebook 63 8 Updated Mar 11, 2025

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,503 5,841 Updated Aug 14, 2024

Production-ready platform for agentic workflow development.

TypeScript 123,789 19,236 Updated Dec 27, 2025

ORLM: Training Large Language Models for Optimization Modeling

Python 224 34 Updated Sep 18, 2025

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 446 15 Updated May 13, 2025

The simplest and most practical Node.js backend template, suitable for quickly setting up small-scale backends. | 最爽的方式起个Nodejs小后端

JavaScript 3 1 Updated Feb 9, 2024
Next