Thanks to visit codestin.com
Credit goes to github.com

Skip to content

timlrx/IGCG

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

I-GCG

The official repository for Improved Techniques for Optimization-Based Jailbreaking on Large Language Models.

Please feel free to contact [email protected] if you have any question.

Quick Start

1. Generate suffix initialization

python attack_llm_core_best_update_our_target.py --behaviors_config=behaviors_ours_config.json

2. Generate new json with the initialization

python generate_our_config.py

3. Conduct jailbreaking attack

python run_multiple_attack_our_target.py --behaviors_config=behaviours_gcss_config_init_v2_continued.json --output_path=gcss --model_path="/home/LLM/Llama-2-7b-chat-hf"

Experiments

Comparison results with SOTA jailbreak methods

Transferable performance of jailbreak suffix

Citation

Kindly include a reference to this paper in your publications if it helps your research:

@article{jia2024improved,
  title={Improved Techniques for Optimization-Based Jailbreaking on Large Language Models}, 
      author={Xiaojun Jia and Tianyu Pang and Chao Du and Yihao Huang and Jindong Gu and Yang Liu and Xiaochun Cao and Min Lin},
      year={2024},
      eprint={2405.21018}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •