Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View fishcrap's full-sized avatar

Block or report fishcrap

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 95,387 13,114 Updated Jan 29, 2026

A curated list of awesome skills, hooks, slash-commands, agent orchestrators, applications, and plugins for Claude Code by Anthropic

Python 22,234 1,263 Updated Jan 29, 2026

Standardized environment infrastructure for Agentic AI development.

Python 249 25 Updated Jan 29, 2026

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,093 1,389 Updated Jan 21, 2026

The absolute trainer to light up AI agents.

Python 11,863 970 Updated Jan 27, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 68,927 12,997 Updated Jan 29, 2026

PyTorch native quantization and sparsity for training and inference

Python 2,649 413 Updated Jan 29, 2026

FlashInfer: Kernel Library for LLM Serving

Python 4,801 672 Updated Jan 28, 2026

A PyTorch native platform for training generative AI models

Python 5,016 681 Updated Jan 29, 2026

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 545 34 Updated Nov 26, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,498 204 Updated Jan 25, 2026

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,879 328 Updated Nov 13, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,461 276 Updated Jan 29, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,726 207 Updated Jan 29, 2026

slime is an LLM post-training framework for RL Scaling.

Python 3,576 471 Updated Jan 29, 2026

Example models using DeepSpeed

Python 6,777 1,117 Updated Dec 19, 2025
Python 761 49 Updated Dec 23, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,794 3,129 Updated Jan 29, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,514 345 Updated Jan 29, 2026

minimal-cost for training 0.5B R1-Zero

Python 805 103 Updated May 14, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,430 164 Updated Mar 20, 2025

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,211 867 Updated Jul 6, 2024

Democratizing Reinforcement Learning for LLMs

Python 5,052 492 Updated Jan 29, 2026

A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

Python 249 10 Updated Apr 15, 2025

Official Repo for Open-Reasoner-Zero

Python 2,085 117 Updated Jun 2, 2025

Fully open reproduction of DeepSeek-R1

Python 25,845 2,410 Updated Nov 24, 2025

s1: Simple test-time scaling

Python 6,635 767 Updated Jun 25, 2025

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 3,314 316 Updated Jul 7, 2025

LaTeX Plugin for Adobe Illustrator

C++ 295 9 Updated Jan 16, 2026

Common used path planning algorithms with animations.

Python 9,111 1,760 Updated Feb 6, 2023
Next