Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View xzzhang79's full-sized avatar
😶‍🌫️
Tired
😶‍🌫️
Tired
  • Central South University
  • Changsha

Block or report xzzhang79

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,707 76 Updated May 11, 2025

Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning (KDD '25)

Python 13 Updated May 30, 2025

The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"

Python 77 1 Updated Oct 15, 2025

🔥 JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization

Python 266 12 Updated Jan 11, 2026
Python 4 Updated Sep 5, 2025

Graph learning framework for long-term video understanding

Python 71 11 Updated Jul 13, 2025

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Python 408 39 Updated May 8, 2025

TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

Python 90 3 Updated Dec 19, 2025

VideoX: a collection of video cross-modal models

Python 1,056 164 Updated Jun 3, 2024

Universal Video Temporal Grounding with Generative Multi-modal Large Language Models

Python 43 2 Updated Nov 25, 2025

[CVPR2025] Number it: Temporal Grounding Videos like Flipping Manga

Python 142 6 Updated Jan 4, 2026

Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".

Python 38 1 Updated Jun 9, 2025

[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

Python 123 3 Updated Dec 10, 2024

[NIPS2025] VideoChat-R1 & R1.5: Enhancing Spatio-Temporal Perception and Reasoning via Reinforcement Fine-Tuning

Python 253 9 Updated Oct 18, 2025

[NeurIPS'25] Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding

Python 71 4 Updated Dec 14, 2025

[IJCV 2025] Project Page for "Deep Learning-Based Object Pose Estimation: A Comprehensive Survey".

372 19 Updated Oct 14, 2025

[NeurIPS 2025] CausalVTG: Towards Robust Video Temporal Grounding via Causal Inference

Python 4 Updated Dec 9, 2025

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Python 922 62 Updated Oct 1, 2024

[RA-L 2025] This is the official repository for using and downloading the DynOPETs dataset.

Python 10 Updated Nov 30, 2025

[ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling

Python 142 3 Updated Aug 22, 2025

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 92,548 6,781 Updated Jan 17, 2026

A paper list of some recent Mamba-based CV works.

437 23 Updated Nov 10, 2025

This is the official implementation for Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1.

HTML 151 13 Updated Oct 27, 2025

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 3,045 208 Updated Dec 21, 2025

📌 Code sourced from [zezhishao/DailyArXiv](https://github.com/zezhishao/DailyArXiv)

Python 7 Updated Jan 15, 2026

Testing adaptation of the DINOv2/3 encoders for vision tasks with Low-Rank Adaptation (LoRA)

Jupyter Notebook 417 31 Updated Oct 24, 2025

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 60,001 12,315 Updated Nov 7, 2025

paper list on Video Moment Retrieval (VMR), or Temporal Video Grounding (TVG), Video Grounding (VG), or Temporal Sentence Grounding in Videos (TSGV)

33 1 Updated Dec 27, 2025

Latest Papers, Codes and Datasets on VTG-LLMs.

73 2 Updated Nov 17, 2025

When One Moment Isn’t Enough: Multi-Moment Retrieval with Cross-Moment Interactions (NeurIPS 2025)

Python 4 Updated Nov 28, 2025
Next