-
RAP Public
Forked from PRBonn/RAP🎤 Register Any Point: Scaling 3D Point Cloud Registration by Flow Matching
Python MIT License UpdatedJan 13, 2026 -
VerseCrafter Public
Forked from TencentARC/VerseCrafterVerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
Python Other UpdatedJan 9, 2026 -
LabelAny3D Public
Forked from UVA-Computer-Vision-Lab/LabelAny3D[NeurIPS 2025] LabelAny3D: Label Any Object 3D in the Wild
Python Creative Commons Attribution 4.0 International UpdatedJan 6, 2026 -
flow_matching Public
Forked from facebookresearch/flow_matchingA PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
Python Other UpdatedJan 5, 2026 -
NeoVerse Public
Forked from IamCreateAI/NeoVerseNeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
UpdatedJan 5, 2026 -
OpenWorldSAM Public
Forked from GinnyXiao/OpenWorldSAM[Neurips 2025 Spotlight] Official repository for the paper: OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts
Python Apache License 2.0 UpdatedJan 4, 2026 -
3DGen-R1 Public
Forked from Ivan-Tang-3D/3DGen-R1The official implementation of The paper "Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation"
Python UpdatedDec 28, 2025 -
foundry-1 Public
Forked from RosettaCommons/foundryCentral repository for biomolecular foundation models with shared trainers and pipeline components
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 27, 2025 -
Awesome-Continual-learning-of-Vision-Language-Models Public
Forked from YuyangSunshine/Awesome-Continual-learning-of-Vision-Language-ModelsAwsome of VLM-CL. Continual Learning for VLMs: A Survey and Taxonomy Beyond Forgetting
UpdatedDec 23, 2025 -
-
B-CLIP Public
Forked from fzohra/B-CLIPCode for β -CLIP: Text-Conditioned Contrastive Learning for Multi-Granular Vision-Language Alignment
Python UpdatedDec 18, 2025 -
MindDrive Public
Forked from xiaomi-mlab/MindDriveOfficial code of “MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning”
Apache License 2.0 UpdatedDec 17, 2025 -
point-mae-zero Public
Forked from UVA-Computer-Vision-Lab/point-mae-zero[3DV 2026] Official Code Release for Learning 3D Representations from Procedural 3D Programs
Python MIT License UpdatedDec 16, 2025 -
Official repository of "Multi-view Pyramid Transformer: Look Coarser to See Broader"
Python MIT License UpdatedDec 12, 2025 -
Efficient-Diffusion-Model-Survey Public
Forked from AIoT-MLSys-Lab/Efficient-Diffusion-Model-Survey[TMLR 2025] Efficient Diffusion Models: A Survey
UpdatedDec 8, 2025 -
4DLangVGGT Public
Forked from hustvl/4DLangVGGTOfficial implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”
Python MIT License UpdatedDec 6, 2025 -
LiteVGGT-repo Public
Forked from GarlicBa/LiteVGGT-repooffical repository of LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging
Python MIT License UpdatedDec 5, 2025 -
AnchorFlow Public
Forked from ZhenglinZhou/AnchorFlowA training-free, mask-free framework for 3D shape editing.
Python Apache License 2.0 UpdatedDec 4, 2025 -
-
DynamicVerse Public
Forked from Dynamics-X/DynamicVerse[NeurIPS 2025]"DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling"
UpdatedDec 3, 2025 -
BEVDilation Public
Forked from gwenzhang/BEVDilation[AAAI'26] BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection
Python Apache License 2.0 UpdatedDec 3, 2025 -
Omni-R1 Public
Forked from aim-uofa/Omni-R1[NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
Python UpdatedDec 3, 2025 -
CAMEO Public
Forked from cvlab-kaist/CAMEOOfficial implementation of "CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models"
Python UpdatedDec 3, 2025 -
CauSight Public
Forked from OpenCausaLab/CauSightCauSight: Learning to Supersense for Visual Causal Discovery
Python Apache License 2.0 UpdatedDec 3, 2025 -
FGTS Public
Forked from hzlsaber/FGTS📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"
Python MIT License UpdatedDec 2, 2025 -
Rectified-Point-Flow Public
Forked from GradientSpaces/Rectified-Point-Flow[NeurIPS 2025, Spotlight] Rectified Point Flow: Generic Point Cloud Pose Estimation
Python Apache License 2.0 UpdatedDec 2, 2025 -
OpenREAD Public
Forked from wyddmw/OpenREADThis is the official implementation of "OpenREAD:Reinforced Open-Ended Reasoning for End-to-End Autonomous Driving with LLM-as-Critic"
Python UpdatedDec 2, 2025 -
TTSnap Public
Forked from TerrysLearning/TTSnapThe official implementation of TTSnap
UpdatedDec 1, 2025 -
CVPR2025_oral_paper_list Public
Forked from yejun688/CVPR2025_oral_paper_list😎 A curated list of CVPR 2025 Oral paper. Total 96
UpdatedDec 1, 2025 -
UniGeoSeg Public
Forked from MiliLab/UniGeoSegOfficial repo for "UniGeoSeg: Towards Unified Open-World Segmentation for Geospatial Scenes"
UpdatedDec 1, 2025