Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View raspberryice's full-sized avatar

Highlights

  • Pro

Organizations

@UIUC-data-mining

Block or report raspberryice

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,841 375 Updated Oct 17, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,277 806 Updated Oct 27, 2025

AllenAI's post-training codebase

Python 3,275 453 Updated Oct 30, 2025

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 269 26 Updated Oct 29, 2025

Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)

Python 254 12 Updated Oct 24, 2025

Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.

Python 4,425 671 Updated Oct 22, 2025

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Python 377 70 Updated Oct 23, 2025
Python 130 18 Updated Oct 29, 2025

Resources for the Enigmata Project.

Python 72 4 Updated Aug 13, 2025
Python 44 3 Updated Mar 4, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,563 1,258 Updated Oct 29, 2025

Simple RL training for reasoning

Python 3,778 278 Updated Aug 3, 2025
Python 334 20 Updated Jul 29, 2025
Python 63 4 Updated Feb 5, 2025

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 17,206 2,830 Updated Dec 18, 2024

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,182 1,753 Updated Oct 13, 2025

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 218 17 Updated Jun 13, 2025

LongBench v2 and LongBench (ACL 25'&24')

Python 1,005 107 Updated Jan 15, 2025

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,061 215 Updated Aug 17, 2024

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,638 3,894 Updated Oct 29, 2025

Open weights language model from Google DeepMind, based on Griffin.

Python 653 33 Updated Jun 4, 2025

[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

Python 441 56 Updated Oct 15, 2025

Friends of OLMo and their links.

349 30 Updated Sep 15, 2025

Helpful tools and examples for working with flex-attention

Python 1,037 64 Updated Oct 23, 2025

A tiny library for coding with large language models.

Python 1,235 74 Updated Jul 10, 2024

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 1,006 64 Updated Apr 25, 2025

[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

Python 159 11 Updated Feb 20, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,591 279 Updated Oct 30, 2025

Train transformer language models with reinforcement learning.

Python 16,066 2,258 Updated Oct 30, 2025

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,385 1,277 Updated Oct 6, 2025
Next