Thanks to visit codestin.com
Credit goes to github.com

dahua966

Follow

HuaSir dahua966

Follow

I'm HuaSir

34 followers · 59 following

Achievements

Achievements

Highlights

1 security advisory credit

Lists (3)

Sort

LLM

LLM Security

Pentest

Stars

Shword07117 / AIPsychoBench

Python 1 Updated Sep 24, 2025

thunlp / Modularity-Analysis

Repo for ACL2023 Findings paper "Emergent Modularity in Pre-trained Transformers"

Python 25 1 Updated Jun 7, 2023

chrisliu298 / awesome-representation-engineering

A resource repository for representation engineering in large language models

140 5 Updated Nov 14, 2024

assafelovic / gpt-researcher

An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.

Python 23,997 3,167 Updated Oct 25, 2025

AlibabaResearch / DAMO-ConvAI

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

Python 1,492 234 Updated Jul 25, 2025

ckkissane / crosscoder-model-diff-replication

Open source replication of Anthropic's Crosscoders for Model Diffing

Python 59 23 Updated Oct 27, 2024

OSU-NLP-Group / AmpleGCG

AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM

Python 75 8 Updated Nov 3, 2024

ValueByte-AI / Awesome-LLM-in-Social-Science

Awesome papers involving LLMs in Social Science.

549 40 Updated Sep 20, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 10,643 1,086 Updated Apr 30, 2025

confident-ai / deepeval

The LLM Evaluation Framework

Python 11,914 1,041 Updated Oct 31, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 4,574 505 Updated Aug 25, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,757 99 Updated Mar 18, 2025

FudanDISC / SocialAgent

A collection of resources that investigate social agents.

192 18 Updated Apr 22, 2025

GongXudong / GCPO

Official code for "Goal-Conditioned On-Policy Reinforcement Learning" (NeurIPS 2024).

Jupyter Notebook 21 Updated Dec 9, 2024

GongXudong / IRPO

Official code for "Iterative Regularized Policy Optimization with Imperfect Demonstrations" (ICML2024).

Jupyter Notebook 28 Updated May 27, 2024

GongXudong / fly-craft-examples

Demonstrations generation and training scripts for fly-craft/VVCGym (ICML2024, ICLR2025, ICML2025).

Jupyter Notebook 44 1 Updated Oct 29, 2025

GongXudong / fly-craft

An efficient goal-conditioned reinforcement learning environment for fixed-wing UAV velocity vector control based on Gymnasium (ICLR2025).

Python 84 1 Updated Jul 2, 2025

mst272 / LLM-Dojo

欢迎来到 LLM-Dojo，这里是一个开源大模型学习场所，使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 890 80 Updated Oct 28, 2025

yueliu1999 / Awesome-Jailbreak-on-LLMs

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

1,006 89 Updated Oct 25, 2025

thunlp / OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Python 1,037 83 Updated Sep 19, 2024

NY1024 / Foundation-Model-Paper-Notes

66 4 Updated May 22, 2025

XinyuanWangCS / PromptAgent

This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgent is a novel automatic prompt optimization method that auton…

Python 333 41 Updated Jul 17, 2025

thunlp / OpenAttack

An Open-Source Package for Textual Adversarial Attack.

Python 754 132 Updated Jul 20, 2023

skyformat99 / books-1

Forked from Bcupwater/books

我读过的书。嘿嘿，分享给你。

1,100 416 Updated Dec 25, 2017

thu-coai / SafeUnlearning

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Python 32 1 Updated Jul 9, 2024

thu-coai / AutoDetect

Official github repo for AutoDetect, an automated weakness detection framework for LLMs.

Python 44 1 Updated Jun 25, 2024

tml-epfl / llm-adaptive-attacks

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]

Shell 358 38 Updated Jan 23, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,166 1,660 Updated Sep 24, 2025

DISARMFoundation / DISARMframeworks

Master copies of the DISARM frameworks, with generated files to help you explore the data

Jupyter Notebook 254 42 Updated Mar 26, 2025

tencent-ailab / persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,378 109 Updated Feb 19, 2025