Thanks to visit codestin.com
Credit goes to github.com

Skip to content
Change the repository type filter

All

    Repositories list

    • ArtPrompt

      Public
      [ACL24] Official Repo of Paper `ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs`
      Python
      198900Updated Aug 15, 2025Aug 15, 2025
    • Python
      01000Updated Jul 19, 2025Jul 19, 2025
    • TinyV

      Public
      Your efficient and accurate answer verification system for RL training.
      Python
      24120Updated Jun 23, 2025Jun 23, 2025
    • magpie

      Public
      Python
      69000Updated Apr 8, 2025Apr 8, 2025
    • safechain

      Public
      [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
      Python
      32500Updated Apr 2, 2025Apr 2, 2025
    • ChatBug

      Public
      [AAAI25] Official Repo of Paper `ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates`
      Python
      11000Updated Mar 22, 2025Mar 22, 2025
    • kodcode

      Public
      Generate diverse coding questions and verifiable solutions - all in one framework
      Python
      16000Updated Mar 15, 2025Mar 15, 2025
    • CleanGen

      Public
      [EMNLP 24] Official Implementation of CLEANGEN: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models
      Python
      22010Updated Mar 9, 2025Mar 9, 2025
    • Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding
      Jupyter Notebook
      1414821Updated Jul 19, 2024Jul 19, 2024
    • edc

      Public
      Source Code for "EDC: Effective and Efficient Dialog Comprehension For Dialog State Tracking" (NAACL 2024)
      Python
      0010Updated Jun 18, 2024Jun 18, 2024