Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View ttttonyhe's full-sized avatar
πŸ”­
πŸ”­

Organizations

@Snapaper @ArtalkJS @Snapodcast @Lune-Lab @StructEval

Block or report ttttonyhe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

GLM-OCR: Accurate Γ— Fast Γ— Comprehensive

Python 1,535 96 Updated Feb 12, 2026

The official implementation of the paper "AgentDyn: A Dynamic Open-Ended Benchmark for Evaluating Prompt Injection Attacks of Real-World Agent Security System".

Python 13 1 Updated Feb 4, 2026

Nano vLLM

Python 11,673 1,565 Updated Nov 3, 2025

Contexts Optical Compression

Python 22,460 2,057 Updated Jan 27, 2026

[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333

Python 1,147 86 Updated Jan 11, 2024

A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.

Python 103 13 Updated Apr 15, 2024

The Granite Guardian models are designed to detect risks in prompts and responses.

Jupyter Notebook 130 13 Updated Oct 8, 2025

A benchmark for prompt injection detection systems.

Jupyter Notebook 159 20 Updated Dec 16, 2025

Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions

Python 920 58 Updated Feb 13, 2026

Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"

Python 88 6 Updated Feb 26, 2025

A list of recent papers about adversarial learning

310 17 Updated Feb 12, 2026

slime is an LLM post-training framework for RL Scaling.

Python 4,046 524 Updated Feb 13, 2026

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,601 715 Updated Feb 13, 2026

My learning notes for ML SYS.

Python 5,326 345 Updated Jan 30, 2026

A project to improve skills of large language models

Python 820 147 Updated Feb 13, 2026

Official implementation of the WASP web agent security benchmark

Python 67 10 Updated Aug 12, 2025

Official Implementation of implicit reference attack

Python 11 Updated Oct 16, 2024

Emoji Attack [ICML 2025]

Python 41 2 Updated Jul 15, 2025
Python 123 20 Updated Jul 7, 2025

dLLM: Simple Diffusion Language Modeling

Python 1,724 171 Updated Feb 10, 2026

A Python library for guardrail models evaluation.

Python 31 7 Updated Oct 9, 2025

Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Python 107 11 Updated Dec 2, 2024

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 5,647 597 Updated Feb 13, 2026

Code for the paper "Defeating Prompt Injections by Design"

Jupyter Notebook 250 37 Updated Jun 20, 2025

Patch Linux executables for compatibility with older glibc

C 417 18 Updated Oct 30, 2024

Patch Linux executables for compatibility with older glibc

C 11 1 Updated Apr 3, 2025

πŸͺ’ Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 21,895 2,156 Updated Feb 13, 2026

Internal Consistency Regularization (CROW) for LLM Backdoor Elimination - Paper accepted to ICML 2025

Python 12 Updated May 6, 2025

Paper Link- https://arxiv.org/abs/2510.21910

1 Updated Oct 31, 2025
Jupyter Notebook 8 3 Updated Feb 9, 2026
Next