Stars
Summarize existing representative LLMs text datasets.
Build Your Own Local AI-Powered NSFC Proposal Writing Assistant
Material Safety Data Sheets - Operator Procedures Prediction (aka MSDS-OPP)
Therapeutics Commons (TDC): Multimodal Foundation for Therapeutic Science
A curated collection of papers, datasets, and resources on Scientific Datasets and Large Language Models (LLMs)
Benchmark for evaluating capabilities of AI models to understand biological lab protocols
A topic-centric list of HQ open datasets.
Measuring correlations between safety benchmarks and general AI capabilities benchmarks.
No fortress, purely open ground. OpenManus is Coming.
The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static to dynamic evaluation"
Easy Data Preparation with latest LLMs-based Operators and Pipelines.
Large datasets for conversational AI
Covid-19 Twitter dataset for non-commercial research use and pre-processing scripts - under active development
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Align Anything: Training All-modality Model with Feedback
[ICLR Workshop 2025] An official source code for paper "GuardReasoner: Towards Reasoning-based LLM Safeguards".
[S&P 2026] SoK: Evaluating Jailbreak Guardrails for Large Language Models
A Python library for guardrail models evaluation.
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors [EMNLP 2024 Findings]
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
[ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷