[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,447 541 Updated May 18, 2025

showlab / Awesome-Unified-Multimodal-Models

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

726 41 Updated Oct 10, 2025

siso-paper / SISO

Official implementation of "Single Image Iterative Subject-driven Generation and Editing".

Python 101 5 Updated May 30, 2025

wtybest / FreeFlux

[ICCV 2025] FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing

Jupyter Notebook 63 1 Updated Sep 3, 2025

Stability-AI / stable-virtual-camera

Stable Virtual Camera: Generative View Synthesis with Diffusion Models

Python 1,479 105 Updated Jun 5, 2025

bytedance / UI-TARS

Python 7,993 561 Updated Oct 23, 2025

apple / ml-fastvit

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Python 1,962 118 Updated Nov 30, 2023

ostris / ai-toolkit

The ultimate training toolkit for finetuning diffusion models

Python 6,626 789 Updated Oct 23, 2025

Tencent-Hunyuan / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,191 1,108 Updated Aug 27, 2025

CyberAgentAILab / TANGO

[ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation

Python 1,115 145 Updated Aug 24, 2025

mapooon / PetFace

[ECCV 2024 Oral] PetFace: A Large-Scale Dataset and Benchmark for Animal Identification https://arxiv.org/abs/2407.13555

Python 77 4 Updated Jul 27, 2025

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 5,451 580 Updated Dec 10, 2024

smthemex / ComfyUI_ParlerTTS

This is a simple ComfyUI custom TTS node based on Parler_tts.

Python 46 5 Updated Jul 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shanshan Zhao sshan-zhao

Achievements

Achievements

Block or report sshan-zhao

Stars

Shopee-MUG / MUG-V

showlab / Paper2Video

zhangmiaosen2000 / Phi-Ground

HJYao00 / MMReason

sunlicai / EmoCapCLIP

bytedance / Dolphin

wdttt / PointSD

OSU-NLP-Group / GUI-Agents-Paper-List

showlab / Awesome-GUI-Agent

AIDC-AI / Agentic-ADK

AIDC-AI / Ovis-U1

lllyasviel / FramePack

facebookresearch / chameleon

Findeton / real-state-10k

jiaosiyu1999 / FlexVAR

ziqipang / RandAR

FoundationVision / Infinity

FoundationVision / VAR