Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Zephyr271828's full-sized avatar

Highlights

  • Pro

Organizations

@NYUSH-AIIG

Block or report Zephyr271828

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

如何搭建一个树洞

159 18 Updated Nov 21, 2021

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,359 224 Updated Oct 12, 2025

LeetGPU Challenges

Python 309 25 Updated Oct 27, 2025

Survey of Small Language Models from Penn State, ...

209 16 Updated Oct 11, 2025

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 130 18 Updated Oct 27, 2025
Python 1 Updated Oct 20, 2025

qwen3-base family of models RL on gsm8k using verl, is there an RL power law on downstream tasks?

Python 26 1 Updated Oct 19, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

TypeScript 40,470 2,605 Updated Oct 26, 2025

MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)

Python 2,757 162 Updated Oct 27, 2025

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,516 374 Updated Jun 2, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,735 277 Updated Oct 27, 2025

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 360 12 Updated Jul 11, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,032 728 Updated Oct 17, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,420 1,241 Updated Oct 18, 2025

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 21,720 2,189 Updated Oct 17, 2025

Python tool for converting files and office documents to Markdown.

Python 82,151 4,607 Updated Oct 20, 2025

📰 Must-read papers and blogs on Speculative Decoding ⚡️

989 52 Updated Oct 25, 2025

Resources for the Enigmata Project.

Python 72 4 Updated Aug 13, 2025

[NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Python 171 18 Updated Jul 7, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,260 229 Updated Oct 27, 2025

TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.

Python 21 1 Updated Sep 24, 2025

🏆 AI Best Paper Awards

HTML 46 4 Updated May 20, 2025

A curated list of neural network pruning resources.

2,478 332 Updated Apr 4, 2024

Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models

Python 249 32 Updated Apr 23, 2024

An Awesome List of Agentic Model trained with Reinforcement Learning

HTML 525 16 Updated Oct 13, 2025

[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"

Python 35 2 Updated Jan 5, 2023

Train transformer language models with reinforcement learning.

Python 16,025 2,255 Updated Oct 27, 2025
Python 6 Updated Sep 11, 2025
Next