The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static to dynamic evaluation"

400 40 Updated Sep 13, 2025

OpenDCAI / DataFlow

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 2,692 169 Updated Jan 23, 2026

PolyAI-LDN / conversational-datasets

Large datasets for conversational AI

Python 1,381 176 Updated Nov 16, 2019

lumina37 / aiotieba

贴吧接口合集✨可用于工具箱/吧务管理/数据采集

Python 565 82 Updated Jan 20, 2026

thepanacealab / covid19_twitter

Covid-19 Twitter dataset for non-commercial research use and pre-processing scripts - under active development

Jupyter Notebook 480 189 Updated Apr 17, 2023

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 66,485 8,089 Updated Jan 25, 2026

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 4,625 509 Updated Nov 27, 2025

saferlhf-v / saferlhf-v

Python 20 1 Updated Jun 16, 2025

yueliu1999 / GuardReasoner

[ICLR Workshop 2025] An official source code for paper "GuardReasoner: Towards Reasoning-based LLM Safeguards".

Python 164 18 Updated May 19, 2025

xunguangwang / SoK4JailbreakGuardrails

[S&P 2026] SoK: Evaluating Jailbreak Guardrails for Large Language Models

Python 35 4 Updated Dec 17, 2025

AmenRa / GuardBench

A Python library for guardrail models evaluation.

Python 30 6 Updated Oct 9, 2025

thu-coai / ShieldLM

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors [EMNLP 2024 Findings]

Python 221 10 Updated Sep 29, 2024

togethercomputer / RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,923 369 Updated Dec 7, 2024

justincui03 / or-bench

[ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"

Python 21 2 Updated Mar 4, 2025

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,780 318 Updated Jan 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zhuxiangyang yangyangyang127

Achievements

Achievements

Block or report yangyangyang127

Stars

lmmlzn / Awesome-LLMs-Datasets

lxltx2025 / LXLTX-nsfc_writer

phenixace / TOMG-Bench

yongkangning / HPD-Kit

eliseu31 / MSDS-Analyser

mims-harvard / TDC

YuyangSunshine / bioprotocolbench

InternScience / Awesome-Scientific-Datasets-and-LLMs

baceolus / BioLP-bench

awesomedata / awesome-public-datasets

centerforaisafety / safetywashing

hq-King / SDEval

FoundationAgents / OpenManus

microsoft / autogen

deep-spin / zsb

SeekingDream / Static-to-Dynamic-LLMEval