Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View XDLiuyyy's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report XDLiuyyy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[BMVC'21] Official PyTorch Implementation of "Grounded Situation Recognition with Transformers"

Python 27 11 Updated Mar 30, 2022

[AAAI 2024] EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering

Python 138 3 Updated Oct 15, 2025

A collection of awesome LaTeX Thesis/Dissertation templates and beyond! //(LaTeX / Word / Typst / Markdown 格式的学位论文、演示文稿、报告、项目申请书、简历、书籍等模板收藏)

TeX 547 24 Updated Oct 25, 2025

Data set for the IEEE TGRS paper "Mutual Attention Inception Network for Remote Sensing Visual Question Answering"

22 4 Updated Nov 14, 2022

[CVPR 2022] Visual Abductive Reasoning

Python 123 13 Updated Oct 22, 2024

ClipSitu: Effectively Leveraging CLIP for Conditional Predictions in Situation Recognition, by Debaditya Roy and Dhruv Verma and Basura Fernando, IEEE/CVF Winter Conference on Applications of Compu…

Jupyter Notebook 6 1 Updated Feb 2, 2024

[ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling

Python 132 3 Updated Aug 22, 2025

Code for the paper "PointAttN: You Only Need Attention for Point Cloud Completion"

Jupyter Notebook 127 18 Updated Apr 1, 2024

[ICCV 2021 Oral] PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers

Python 769 128 Updated Sep 27, 2024

Papers and Datasets about Point Cloud.

Python 2,834 317 Updated Aug 30, 2024

[MICCAI 2024] TeethDreamer: 3D Teeth Reconstruction from Five Intra-oral Photographs

Python 76 18 Updated Jul 27, 2025

Video Object Segmentation using Space-Time Memory Networks

Python 424 80 Updated Jun 3, 2020

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Python 1,012 83 Updated Jul 6, 2024

OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.

Python 292 23 Updated Apr 29, 2025

Code for paper titled, "Learning to Predict Task Progress by Self-Supervised Video Alignment" by Gerard Donahue and Ehsan Elhamifar, published at CVPR 2024.

Python 15 2 Updated Jul 26, 2024

[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)

Python 60 8 Updated Aug 17, 2021

"Interaction-centric Spatio-Temporal Context Reasoning for Muti-Person Video HOI Recognition" ECCV 2024

Python 4 Updated Oct 2, 2024

Official Implementation of STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering, AAAI 2024

Python 5 Updated Feb 9, 2024

Official repository of ECCV 2024 paper - "HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization"

Python 20 1 Updated Aug 23, 2024

[CVPR 2021] Actor-Context-Actor Relation Network for Spatio-temporal Action Localization

Python 212 39 Updated Oct 8, 2021

Video Evnet Extraction via Tracking Visual States of Arguments (AAAI2023)

Python 11 1 Updated Feb 18, 2024

[ECCV 2024 oral] -C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition

Python 37 6 Updated Dec 7, 2024

[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities

Python 80 3 Updated Oct 10, 2024

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Python 864 45 Updated Aug 13, 2024
Jupyter Notebook 10 Updated Jun 21, 2024

Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019

Python 92 18 Updated Aug 9, 2019

[IEEE TMM 2025 & ACL 2024 Findings] LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition

Python 34 Updated Jul 19, 2025

[CVPR 2023] Code for "Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations"

Jupyter Notebook 19 2 Updated Oct 10, 2023

GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)

Jupyter Notebook 72 5 Updated Jan 2, 2024
Next