raspberryice

Zoey Li raspberryice

59 followers · 61 following

CA, United States

Achievements

Highlights

Organizations

Starred repositories

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,841 375 Updated Oct 17, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,277 806 Updated Oct 27, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,275 453 Updated Oct 30, 2025

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 269 26 Updated Oct 29, 2025

xiaowu0162 / LongMemEval

Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)

Python 254 12 Updated Oct 24, 2025

sentient-agi / ROMA

Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.

Python 4,425 671 Updated Oct 22, 2025

sierra-research / tau2-bench

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Python 377 70 Updated Oct 23, 2025

chenchen0103 / ACEBench

Python 130 18 Updated Oct 29, 2025

BytedTsinghua-SIA / Enigmata

Resources for the Enigmata Project.

Python 72 4 Updated Aug 13, 2025

OpenStellarTeam / DeltaBench

Python 44 3 Updated Mar 4, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,563 1,258 Updated Oct 29, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,778 278 Updated Aug 3, 2025

ruixin31 / Spurious_Rewards

Python 334 20 Updated Jul 29, 2025

ekwinox117 / multi-challenge

Python 63 4 Updated Feb 5, 2025

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 17,206 2,830 Updated Dec 18, 2024

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,182 1,753 Updated Oct 13, 2025

google-deepmind / loft

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 218 17 Updated Jun 13, 2025

THUDM / LongBench

LongBench v2 and LongBench (ACL 25'&24')

Python 1,005 107 Updated Jan 15, 2025

gkamradt / LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,061 215 Updated Aug 17, 2024

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,638 3,894 Updated Oct 29, 2025

google-deepmind / recurrentgemma

Open weights language model from Google DeepMind, based on Griffin.

Python 653 33 Updated Jun 4, 2025

bigcode-project / bigcodebench

[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

Python 441 56 Updated Oct 15, 2025

allenai / awesome-open-source-lms

Friends of OLMo and their links.

349 30 Updated Sep 15, 2025

meta-pytorch / attention-gym

Helpful tools and examples for working with flex-attention

Python 1,037 64 Updated Oct 23, 2025

srush / MiniChain

A tiny library for coding with large language models.

Python 1,235 74 Updated Jul 10, 2024

prometheus-eval / prometheus-eval

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 1,006 64 Updated Apr 25, 2025

zjunlp / KnowledgeCircuits

[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

Python 159 11 Updated Feb 20, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,591 279 Updated Oct 30, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,066 2,258 Updated Oct 30, 2025

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,385 1,277 Updated Oct 6, 2025

Zoey Li raspberryice

Highlights

Organizations

Starred repositories

zero-shot-learning

paper-implementations

Machine learning

Deep learning

Awesome Lists