Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Zephyr271828's full-sized avatar

Highlights

  • Pro

Organizations

@NYUSH-AIIG

Block or report Zephyr271828

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
108 results for source starred repositories
Clear filter

如何搭建一个树洞

159 18 Updated Nov 21, 2021

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,401 229 Updated Nov 2, 2025

LeetGPU Challenges

Python 455 31 Updated Nov 11, 2025

Survey of Small Language Models from Penn State, ...

213 17 Updated Nov 6, 2025

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 159 30 Updated Nov 12, 2025

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Python 3 Updated Oct 20, 2025

qwen3-base family of models RL on gsm8k using verl, is there an RL power law on downstream tasks?

Python 26 1 Updated Oct 19, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

TypeScript 42,069 2,777 Updated Nov 12, 2025

MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)

Python 3,521 222 Updated Nov 11, 2025

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,553 375 Updated Jun 2, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,900 308 Updated Nov 12, 2025

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 376 12 Updated Jul 11, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,091 733 Updated Nov 11, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,091 1,299 Updated Nov 10, 2025

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 22,018 2,214 Updated Oct 17, 2025

Python tool for converting files and office documents to Markdown.

Python 82,902 4,694 Updated Oct 20, 2025

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,015 52 Updated Oct 25, 2025

Resources for the Enigmata Project.

Python 73 4 Updated Aug 13, 2025

[NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Python 183 20 Updated Jul 7, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,446 248 Updated Nov 11, 2025

TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.

Python 21 1 Updated Sep 24, 2025

🏆 AI Best Paper Awards

HTML 47 4 Updated May 20, 2025

A curated list of neural network pruning resources.

2,481 332 Updated Apr 4, 2024

Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models

Python 253 33 Updated Apr 23, 2024

Awesome List for Agentic RL

HTML 536 15 Updated Nov 9, 2025

[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"

Python 35 2 Updated Jan 5, 2023

Train transformer language models with reinforcement learning.

Python 16,262 2,288 Updated Nov 12, 2025
Next