Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View dahua966's full-sized avatar

Block or report dahua966

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1 Updated Sep 24, 2025

Repo for ACL2023 Findings paper "Emergent Modularity in Pre-trained Transformers"

Python 25 1 Updated Jun 7, 2023

A resource repository for representation engineering in large language models

140 5 Updated Nov 14, 2024

An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.

Python 23,997 3,167 Updated Oct 25, 2025

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

Python 1,492 234 Updated Jul 25, 2025

Open source replication of Anthropic's Crosscoders for Model Diffing

Python 59 23 Updated Oct 27, 2024

AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM

Python 75 8 Updated Nov 3, 2024

Awesome papers involving LLMs in Social Science.

549 40 Updated Sep 20, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 10,643 1,086 Updated Apr 30, 2025

The LLM Evaluation Framework

Python 11,914 1,041 Updated Oct 31, 2025

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 4,574 505 Updated Aug 25, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,757 99 Updated Mar 18, 2025

A collection of resources that investigate social agents.

192 18 Updated Apr 22, 2025

Official code for "Goal-Conditioned On-Policy Reinforcement Learning" (NeurIPS 2024).

Jupyter Notebook 21 Updated Dec 9, 2024

Official code for "Iterative Regularized Policy Optimization with Imperfect Demonstrations" (ICML2024).

Jupyter Notebook 28 Updated May 27, 2024

Demonstrations generation and training scripts for fly-craft/VVCGym (ICML2024, ICLR2025, ICML2025).

Jupyter Notebook 44 1 Updated Oct 29, 2025

An efficient goal-conditioned reinforcement learning environment for fixed-wing UAV velocity vector control based on Gymnasium (ICLR2025).

Python 84 1 Updated Jul 2, 2025

欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 890 80 Updated Oct 28, 2025

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

1,006 89 Updated Oct 25, 2025

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Python 1,037 83 Updated Sep 19, 2024

This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgent is a novel automatic prompt optimization method that auton…

Python 333 41 Updated Jul 17, 2025

An Open-Source Package for Textual Adversarial Attack.

Python 754 132 Updated Jul 20, 2023

我读过的书。嘿嘿,分享给你。

1,100 416 Updated Dec 25, 2017

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Python 32 1 Updated Jul 9, 2024

Official github repo for AutoDetect, an automated weakness detection framework for LLMs.

Python 44 1 Updated Jun 25, 2024

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]

Shell 358 38 Updated Jan 23, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,166 1,660 Updated Sep 24, 2025

Master copies of the DISARM frameworks, with generated files to help you explore the data

Jupyter Notebook 254 42 Updated Mar 26, 2025

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,378 109 Updated Feb 19, 2025
Next