Lists (1)
Sort Name ascending (A-Z)
Stars
[CVPR 2025] Official Implementation of "Dinomaly: The Less Is More Philosophy in Multi-Class Unsupervised Anomaly Detection". The first multi-class UAD model that can compete with single-class SOTAs
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Enjoy the magic of Diffusion models!
[ICCV ADFM'25] ADer is an open source visual anomaly detection toolbox based on PyTorch, which supports multiple popular AD datasets and approaches.
Official repository for "Human-MME: A Holistic Evaluation Benchmark for Human-Centric Multimodal Large Language Models"
TokenAR: Multiple Subject Generation via Autoregressive Token-level enhancement
Official code repository for Med-CMR : "A Fine-Grained Benchmark Integrating Visual Evidence and Clinical Logic for Medical Complex Multimodal Reasoning"
From Large Angles to Consistent Faces: Identity-Preserving Video Generation via Mixture of Facial Experts
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
[NeurIPS 2024] Official implementation of MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection.
[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!
LAVIS - A One-stop Library for Language-Vision Intelligence
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning
PPT plugin, supports one-click to add image titles, copy and paste positions, one-click image alignment, and one-click to insert Markdown (including bold, hyperlinks, and other inline styles, as we…