Stars
[ICML 2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Specification and documentation for the Model Context Protocol
Text2VLM is a CLI tool to transform textual data used in Large Language Model (LLM) evaluations into a format suitable for Visual Language Models (VLMs).
Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.
[NeurIPS 2025] PARCO: Parallel AutoRegressive Combinatorial Optimization
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Model Context Protocol Servers
RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing
Official inference repo for FLUX.2 models
An open-source AI agent that lives in your terminal.
Official implementation of "Cutting Through Privacy: A Hyperplane-Based Data Reconstruction Attack in Federated Learning" (UAI 2025)
Official code for "Epistemic uncertainty in conformal scores: a unified approach", UAI 2025
Official code for paper "Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning"
The implementation of FedCADO (Classifier-Assisted Diffusion for One-shot Federated learning method)
[ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding
Code of “Multi-Modal Deep Learning Enables Ultrafast and Accurate Annotation of Enzymatic Active Sites”
Official repository for NeurIPS 2025 paper "Understanding and Improving Adversarial Robustness of Neural Probabilistic Circuits"
Official code for FaCT: Faithful Concept Traces for Explaining Neural Network Decisions. NeurIPS 2025
[NeurIPS 2025] Robustness in Both Domains: CLIP Needs a Robust Text Encoder
MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]
[NeurIPS 2025] An official source code for paper "Continual Multimodal Contrastive Learning"
[NeurIPS 2025] Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation