xzzhang79

😶‍🌫️

Tired

Xiao Zhang xzzhang79

😶‍🌫️

Tired

1 follower · 5 following

Central South University
Changsha

Lists (5)

Sort

Stars

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,707 76 Updated May 11, 2025

zhymma / Tool-MVR

Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning (KDD '25)

Python 13 Updated May 30, 2025

zhang9302002 / ThinkingWithVideos

The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"

Python 77 1 Updated Oct 15, 2025

LYL1015 / JarvisEvo

🔥 JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization

Python 266 12 Updated Jan 11, 2026

Dodo-D-Caster / VideoGNN

Python 4 Updated Sep 5, 2025

IntelLabs / GraVi-T

Graph learning framework for long-term video understanding

Python 71 11 Updated Jul 13, 2025

RenShuhuai-Andy / TimeChat

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Python 408 39 Updated May 8, 2025

TencentARC / TimeLens

TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

Python 90 3 Updated Dec 19, 2025

microsoft / VideoX

VideoX: a collection of video cross-modal models

Python 1,056 164 Updated Jun 3, 2024

Lzq5 / UniTime

Universal Video Temporal Grounding with Generative Multi-modal Large Language Models

Python 43 2 Updated Nov 25, 2025

yongliang-wu / NumPro

[CVPR2025] Number it: Temporal Grounding Videos like Flipping Manga

Python 142 6 Updated Jan 4, 2026

THUNLP-MT / MUSEG

Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".

Python 38 1 Updated Jun 9, 2025

gyxxyg / VTG-LLM

[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

Python 123 3 Updated Dec 10, 2024

OpenGVLab / VideoChat-R1

[NIPS2025] VideoChat-R1 & R1.5: Enhancing Spatio-Temporal Perception and Reasoning via Reinforcement Fine-Tuning

Python 253 9 Updated Oct 18, 2025

xiaomi-research / time-r1

[NeurIPS'25] Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding

Python 71 4 Updated Dec 14, 2025

CNJianLiu / Awesome-Object-Pose-Estimation

[IJCV 2025] Project Page for "Deep Learning-Based Object Pose Estimation: A Comprehensive Survey".

372 19 Updated Oct 14, 2025

MxLearner / CausalVTG

[NeurIPS 2025] CausalVTG: Towards Robust Video Temporal Grounding via Causal Inference

Python 4 Updated Dec 9, 2025

NVlabs / DoRA

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Python 922 62 Updated Oct 1, 2024

Launch-on-Titania / DynOPETs

[RA-L 2025] This is the official repository for using and downloading the DynOPETs dataset.

Python 10 Updated Nov 30, 2025

gyxxyg / TRACE

[ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling

Python 142 3 Updated Aug 22, 2025

clash-verge-rev / clash-verge-rev

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 92,548 6,781 Updated Jan 17, 2026

Yangzhangcst / Mamba-in-CV

A paper list of some recent Mamba-based CV works.

437 23 Updated Nov 10, 2025

AutoLab-SAI-SJTU / AutoPage

This is the official implementation for Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1.

HTML 151 13 Updated Oct 27, 2025

Paper2Poster / Paper2Poster

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 3,045 208 Updated Dec 21, 2025

JiamingZang / DailyArxiv

📌 Code sourced from [zezhishao/DailyArXiv](https://github.com/zezhishao/DailyArXiv)

Python 7 Updated Jan 15, 2026

RobvanGastel / dinov3-finetune

Testing adaptation of the DINOv2/3 encoders for vision tasks with Low-Rank Adaptation (LoRA)

Jupyter Notebook 417 31 Updated Oct 24, 2025

youngyangyang04 / leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

Shell 60,001 12,315 Updated Nov 7, 2025

Tangkfan / Awesome-Temporal-Video-Grounding

paper list on Video Moment Retrieval (VMR), or Temporal Video Grounding (TVG), Video Grounding (VG), or Temporal Sentence Grounding in Videos (TSGV)

33 1 Updated Dec 27, 2025

ki-lw / Awesome-MLLMs-for-Video-Temporal-Grounding

Latest Papers, Codes and Datasets on VTG-LLMs.

73 2 Updated Nov 17, 2025

Zhuo-Cao / QV-M2

When One Moment Isn’t Enough: Multi-Moment Retrieval with Cross-Moment Interactions (NeurIPS 2025)

Python 4 Updated Nov 28, 2025

Xiao Zhang xzzhang79

Lists (5)

⭐6D Object Pose Estimation

⭐Mamba

⭐MLLM

⭐Vedio Temporal Grounding

⭐Vedio Understanding

Stars