ttttonyhe

🔭

Tony L. He ttttonyhe

🔭

Any improvements made anywhere besides the bottleneck are an illusion. —— 𝘛𝘩𝘦 𝘛𝘩𝘦𝘰𝘳𝘺 𝘰𝘧 𝘊𝘰𝘯𝘴𝘵𝘳𝘢𝘪𝘯𝘵𝘴

352 followers · 398 following

Achievements

x2 x3 x3

Achievements

x2 x3 x3

Highlights

Developer Program Member
Pro

Organizations

Lists (12)

Sort

Starred repositories

zai-org / GLM-OCR

GLM-OCR: Accurate × Fast × Comprehensive

Python 1,535 96 Updated Feb 12, 2026

leolee99 / AgentDyn

The official implementation of the paper "AgentDyn: A Dynamic Open-Ended Benchmark for Evaluating Prompt Injection Attacks of Real-World Agent Security System".

Python 13 1 Updated Feb 4, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 11,673 1,565 Updated Nov 3, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 22,460 2,057 Updated Jan 27, 2026

princeton-nlp / MeZO

[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333

Python 1,147 86 Updated Jan 11, 2024

microsoft / BIPIA

A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.

Python 103 13 Updated Apr 15, 2024

ibm-granite / granite-guardian

The Granite Guardian models are designed to detect risks in prompts and responses.

Jupyter Notebook 130 13 Updated Oct 8, 2025

lakeraai / pint-benchmark

A benchmark for prompt injection detection systems.

Jupyter Notebook 159 20 Updated Dec 16, 2025

mjun0812 / flash-attention-prebuild-wheels

Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions

Python 920 58 Updated Feb 13, 2026

thu-ml / STAIR

Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"

Python 88 6 Updated Feb 26, 2025

Trustworthy-AI-Group / Adversarial_Examples_Papers

A list of recent papers about adversarial learning

310 17 Updated Feb 12, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 4,046 524 Updated Feb 13, 2026

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,601 715 Updated Feb 13, 2026