laoliukan

laoliukan

1 follower · 3 following

Stars

lmarena / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Python 948 130 Updated Jun 21, 2025

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,239 677 Updated Oct 24, 2025

PaddlePaddle / PaddleFormers

PaddleFormers is an easy-to-use library of pre-trained large language model zoo based on PaddlePaddle.

Python 12,942 2,127 Updated Oct 30, 2025

RUCAIBox / R1-Searcher

R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Python 649 45 Updated Aug 5, 2025

lechmazur / writing

This benchmark tests how well LLMs incorporate a set of 10 mandatory story elements (characters, objects, core concepts, attributes, motivations, etc.) in a short creative story

Batchfile 313 7 Updated Sep 23, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,425 290 Updated Oct 29, 2025

zjunlp / OmniThink

[EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Python 464 61 Updated Aug 23, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,941 2,383 Updated Oct 30, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,585 2,399 Updated Sep 8, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,330 1,518 Updated Apr 24, 2025

agentscope-ai / agentscope

AgentScope: Agent-Oriented Programming for Building LLM Applications

Python 13,472 1,086 Updated Oct 30, 2025

cjyyx / AI_Gen_Novel

基于大语言模型(LLM)和多智能体(Multi-Agent)，探究AI写小说能力的边界

Python 347 66 Updated Sep 4, 2024

richards199999 / Thinking-Claude

Let your Claude able to think

TypeScript 16,280 1,919 Updated Mar 10, 2025

Open-Source-O1 / o1_Reasoning_Patterns_Study

Python 104 7 Updated Dec 6, 2024

Open-Source-O1 / Open-O1

Python 1,349 54 Updated Nov 21, 2024

zhentingqi / rStar

Python 963 110 Updated Jan 23, 2025

thu-coai / ComplexBench

Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)

Python 96 11 Updated Feb 20, 2025

jxzhangjhu / Awesome-LLM-Prompt-Optimization

Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models

379 16 Updated Mar 27, 2024

confident-ai / deepeval

The LLM Evaluation Framework

Python 11,894 1,042 Updated Oct 30, 2025

prometheus-eval / prometheus-eval

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 1,006 63 Updated Apr 25, 2025

xianshang33 / llm-paper-daily

Daily updated LLM papers. 每日更新 LLM 相关的论文，欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

1,191 52 Updated Jul 31, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,268 7,418 Updated Oct 30, 2025

linexjlin / GPTs

leaked prompts of GPTs

31,199 4,277 Updated Sep 27, 2024

RUC-AIMind / TikTalk

Python 70 2 Updated Jun 1, 2025

ContextualAI / HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 892 49 Updated Sep 30, 2025

langgptai / wonderful-prompts

🔥中文 prompt 精选🔥，ChatGPT 使用指南，提升 ChatGPT 可玩性和可用性！🚀

5,037 447 Updated Oct 22, 2025

langgptai / LangGPT

LangGPT: Empowering everyone to become a prompt expert! 🚀 📌 结构化提示词（Structured Prompt）提出者 📌 元提示词（Meta-Prompt）发起者 📌 最流行的提示词落地范式 | Language of GPT The pioneering framework for structured & meta-prompt…

Jupyter Notebook 10,983 868 Updated Oct 24, 2025

LAION-AI / Open-Instruction-Generalist

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

Python 209 19 Updated Jan 13, 2024

NicholasCao / Awesome-Chinese-ChatGPT

收录实现中文版ChatGPT的各种技术路线，数据及其他资料

35 2 Updated Jul 12, 2023

h11128 / sequential_hierarchical-explanation

Layer-wise Analysis of Bert Model for Sentiment Analysis

Jupyter Notebook 2 1 Updated Oct 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly