- Waterloo, ON
-
13:32
(UTC -05:00) - https://lipeng.ac
- https://orcid.org/0009-0002-1802-7394
- @lipeng.ac
- in/~lhe
- @tonyhe_lipeng
Highlights
Lists (12)
Sort Name ascending (A-Z)
Academia
Tools for academic publishingAI Agents
Blockchain
Cananical
Projects optimized for scalability and maintainabilityCryptography
Cryptographic scheme and protocol implementationsDesign
LLMs & Tools
Personal Projects
Tony's past and presently active personal projects.- All languages
- AsciiDoc
- Batchfile
- Blade
- Bru
- C
- C#
- C++
- CMake
- CSS
- Circom
- CoffeeScript
- Cuda
- Dart
- Dockerfile
- FreeMarker
- Go
- HTML
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- Less
- Lua
- MATLAB
- Markdown
- OCaml
- Objective-C
- Objective-C++
- PHP
- Pascal
- PowerShell
- Python
- Racket
- Rich Text Format
- Rocq Prover
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Solidity
- Stylus
- Swift
- TeX
- TypeScript
- Vim Script
- Vue
- WebAssembly
Starred repositories
The official implementation of the paper "AgentDyn: A Dynamic Open-Ended Benchmark for Evaluating Prompt Injection Attacks of Real-World Agent Security System".
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.
The Granite Guardian models are designed to detect risks in prompts and responses.
A benchmark for prompt injection detection systems.
Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions
Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"
A list of recent papers about adversarial learning
slime is an LLM post-training framework for RL Scaling.
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
My learning notes for ML SYS.
A project to improve skills of large language models
Official implementation of the WASP web agent security benchmark
Official Implementation of implicit reference attack
A Python library for guardrail models evaluation.
Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
Code for the paper "Defeating Prompt Injections by Design"
Patch Linux executables for compatibility with older glibc
tuxxi / polyfill-glibc
Forked from corsix/polyfill-glibcPatch Linux executables for compatibility with older glibc
πͺ’ Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. πYC W23
Internal Consistency Regularization (CROW) for LLM Backdoor Elimination - Paper accepted to ICML 2025
Paper Link- https://arxiv.org/abs/2510.21910