-
Xi'an Jiaotong University
- Shannxi, China
- 微信号:LM18806717807
Lists (1)
Sort Name ascending (A-Z)
Stars
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
LPIPS metric. pip install lpips
[Ultra Fast&Powerful Diffusion RL] Reinforcing Diffusion Models by Direct Group Preference Optimization
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation
A one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]
Hierarchical Reasoning Model Official Release
[CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution
A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.
Paper List of Inference/Test Time Scaling/Computing
This is a Next.js, Tailwind CSS blogging starter template. Comes out of the box configured with the latest technologies to make technical writing a breeze. Easily configurable and customizable. Per…
Image-to-image translation with conditional adversarial nets
[NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT
ULMEvalKit: One-Stop Eval ToolKit for Image Generation
🚀 Power Your World with AI - Explore, Extend, Empower.
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Fast and memory-efficient exact attention
Access OpenAI models programmatically through your ChatGPT subscription.
[ICLR 2025] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generation of Diffusion Models"
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
[ICCV 2025] CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
(NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps