Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View ceferisbarov's full-sized avatar

Block or report ceferisbarov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

REsampling Uncertainty Bounds for Evaluating NLP

Python 3 Updated Dec 22, 2025
Python 3 Updated Sep 6, 2025

alpha-beta-CROWN: An Efficient, Scalable and GPU Accelerated Neural Network Verifier (winner of VNN-COMP 2021, 2022, 2023, 2024, 2025)

Python 341 89 Updated Jan 17, 2026

An Open-Source Package for Textual Adversarial Attack.

Python 765 130 Updated Jul 20, 2023

AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM

Python 83 8 Updated Nov 3, 2024

Research into chain-of-thought monitoring as an AI Control protocol

Python 11 3 Updated Jun 29, 2025

[ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models".

Python 424 56 Updated Jan 22, 2025

The Python Risk Identification Tool for generative AI (PyRIT) is an open source framework built to empower security professionals and engineers to proactively identify risks in generative AI systems.

Python 3,336 646 Updated Jan 17, 2026

An extension of nanoGCG which allows multi-prompt, dual model optimization

Python 10 Updated Jun 24, 2025

Code for the paper "Defeating Prompt Injections by Design"

Jupyter Notebook 219 32 Updated Jun 20, 2025

Provider-agnostic, open-source evaluation infrastructure for language models

Python 711 94 Updated Dec 24, 2025

[arXiv 2025] Pre-training script for Clinical ModernBERT

Python 29 3 Updated Apr 29, 2025
Jupyter Notebook 2 Updated Jul 31, 2025

A framework for few-shot evaluation of language models.

Python 11,257 2,974 Updated Jan 21, 2026
Jupyter Notebook 2 Updated Aug 3, 2025
Python 75 8 Updated Dec 19, 2024

Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".

Python 255 16 Updated Jan 22, 2026
Python 11 4 Updated Jun 12, 2025

A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.

318 15 Updated Oct 15, 2025
Python 75 7 Updated Jun 12, 2025

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

1,177 100 Updated Jan 6, 2026

A curation of awesome tools, documents and projects about LLM Security.

1,512 153 Updated Aug 20, 2025

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,834 117 Updated Jan 1, 2026

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,886 353 Updated Jul 15, 2024
TypeScript 6,277 938 Updated Sep 5, 2025

interactive heightmaps from terrain data

JavaScript 480 141 Updated Jul 23, 2024

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,278 415 Updated Jan 21, 2026
Next