Stars
[arXiv 2025] Official implementation of BiCo: Composing Concepts from Images and Videos via Concept-prompt Binding
The first Interleaved framework for textual reasoning within the visual generation process
Official PyTorch implementation for 'Revisiting Audio-Visual Segmentation with Vision-Centric Transformer'
Official PyTorch implementation for paper`Anchor3DLane: Learning to Regress 3D Anchors for Monocular 3D Lane Detection' accepted by CVPR 2023
[AAAI 2025] AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
[CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
Official repo for consistency models.
🐍 Geometric Computer Vision Library for Spatial AI
A Keypoint-based Global Association Network for Lane Detection. Accepted by CVPR 2022
Code for Linguistic Structure Guided Context Modeling for Referring Image Segmentation, ECCV2020.
Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.
Code for "PPDM: Parallel Point Detection and Matching for Real-time Human-Object Interaction Detection".
OpenMMLab Detection Toolbox and Benchmark
Video Object Segmentation with Re-identification
Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
🔥 2D and 3D Face alignment library build using pytorch