Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View jkhu29's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report jkhu29

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1 Updated Dec 19, 2025

Depth Anything 3

Python 3,608 308 Updated Dec 12, 2025

🔥 OneThinker: All-in-one Reasoning Model for Image and Video

Python 323 25 Updated Dec 9, 2025

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Python 370 12 Updated Nov 25, 2025

Learning Plug-and-play Memory for Guiding Video Diffusion Models

Python 13 2 Updated Dec 1, 2025

Official code for VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator

80 Updated Oct 16, 2025

[NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation

Python 658 24 Updated Nov 27, 2025

A tool for matching points annotation

C++ 5 Updated Nov 4, 2025

Universal Image Restoration Pre-training via Masked Degradation Classification

Python 19 1 Updated Oct 15, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,641 53 Updated Nov 15, 2025

Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.

779 86 Updated Aug 27, 2025

[ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"

Python 472 25 Updated Aug 4, 2025

[ICLR'25 Oral] No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images

Python 906 47 Updated Sep 2, 2025

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) and "UltraViCo: Breaking Extrapolation Limits in Video Diffusion Transformers"

Python 768 73 Updated Dec 4, 2025

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,667 80 Updated Nov 28, 2025

DUSt3R: Geometric 3D Vision Made Easy

Python 6,829 716 Updated Sep 24, 2025

Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"

Python 1,317 80 Updated Jun 16, 2025

Cameras as Relative Positional Encoding

Python 632 11 Updated Dec 18, 2025

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".

Python 502 34 Updated Aug 8, 2025

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,160 395 Updated Jul 11, 2024

Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.

Python 148 3 Updated Jul 17, 2025

A list of papers about concept bottleneck models (CBMs)

17 1 Updated Nov 12, 2025

[NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation

Python 565 31 Updated Nov 11, 2025

An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerful framework.

Python 444 14 Updated Dec 2, 2025

CVPR 2024-Improved Implicit Neural Representation with Fourier Reparameterized Training

Python 92 3 Updated May 23, 2025

[CVPR 2025] Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation

Python 74 4 Updated Nov 29, 2025

(NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis

Python 115 6 Updated Nov 8, 2025

[NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"

Python 518 29 Updated Sep 28, 2025

[NeurIPS' 2025] JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent

Python 746 27 Updated Dec 17, 2025
Next