Stars
Code for the NPJ AI paper "How Large Language Models Encode Theory-of-Mind: A Study on Sparse Parameter Patterns"
EMNLP 2025 Main paper: RW-Steering for LLMs with disproportionate inappropriate context. Includes raw_data (fake_news/hate_speech/non_factual/privacy), derived datasets (Alignment, Awareness, RW-St…
A community-maintained Python framework for creating mathematical animations.
A resource repository for machine unlearning in large language models
Econometrics AI Agent: A specialized LLM-driven agent for automating complex econometric analysis with zero-shot learning, outperforming general AI in expert tasks. 🚀
AuditBench: A Benchmark for Large Language Models in Financial Statement Auditing
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
RM-R1: Unleashing the Reasoning Potential of Reward Models
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.
Code and Slides
Official code repository for the paper "Internal Activation as the Polar Star for Steering Unsafe LLM Behavior"
Measuring Copyright Risks of Large Language Model via Partial Information Probing
This repository is dedicated to summarizing papers related to large language models with the field of law
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
Code Repo for EMNLP paper: Do LLMs Know to Respect Copyright Notice
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
BookNLP, a natural language processing pipeline for books
Reference implementation for DPO (Direct Preference Optimization)
TruthfulQA: Measuring How Models Imitate Human Falsehoods
A quick guide (especially) for trending instruction finetuning datasets
LaTeX samples for NSF Research.gov Proposal Submission. For more information about Research.gov Proposal Submission visit https://www.research.gov/research-web/content/aboutpsm Feedback [email protected]
AAAI'24 Self-Paced Unified Representation Learning for Hierarchical Multi-Label Classification
FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
Learning to Walk with Dual Agents for Knowledge Graph Reasoning (AAAI'22)
Explanation method for Graph Neural Networks (GNNs)