- Japan
Highlights
- Pro
Stars
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
[ECCV 2024] This repository represents the official implementation of PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration
[MICCAI 2024] Official dataset release for "EgoSurgery: A Dataset for Surgical Video Understanding from Egocentric Open Surgery Videos"
[CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living
Integrating Paperpile with Notion Databases 🔄
Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentation"
PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning
[IJCV 2021] SportsCap: Monocular 3D Human Motion Capture and Fine-grained Understanding in Challenging Sports Videos
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)
MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers
[CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"
[ CVPR 2024 ] Implementation for "GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation"
A collection of papers on diffusion models for 3D generation.
🏀 Visualization of NBA games from raw SportVU data logs
🕝 Time-warped principal components analysis (twPCA)
[WACV 2024 Oral] Rethinking Visibility in Human Pose Estimation: Occluded Pose Reasoning via Transformers
Official implementation of the IROS2023 paper "DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model"
This repo is the code of paper "DiffusionInst: Diffusion Model for Instance Segmentation" (ICASSP'24).
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
Pytorch code for Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning, CVPR2022.
Code for Diffusion Action Segmentation (ICCV 2023)
Non-official implement of Paper:CBAM: Convolutional Block Attention Module
Extension to return old Twitter layout from 2015 / 2018.
[IEEE Access - 2022] LMOT : Efficient Light-Weight Detection and Tracking in Crowds