Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View HarborYuan's full-sized avatar

Highlights

  • Pro

Block or report HarborYuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,544 71 Updated Oct 23, 2025

Official implementation of "Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence"

Python 45 Updated Oct 24, 2025
Python 1 1 Updated Oct 19, 2025

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 506 26 Updated Oct 17, 2025
Python 7,993 561 Updated Oct 23, 2025

🔥 [EMNLP 2025] Official open-source repo for Boosting Multi-modal Keyphrase Prediction with Dynamic Chain-of-Thought in Vision-Language Models

Python 7 Updated Oct 14, 2025

The official implementation of "Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs"

Python 183 7 Updated Oct 9, 2025

CUDA accelerated rasterization of gaussian splatting

Cuda 3,809 579 Updated Oct 2, 2025

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 17 3 Updated Oct 9, 2025

Efficient Triton Kernels for LLM Training

Python 5,771 418 Updated Oct 25, 2025

[CVPR 2024] RoMa: Robust Dense Feature Matching; RoMa is the robust dense feature matcher capable of estimating pixel-dense warps and reliable certainties for almost any image pair.

Python 1,016 100 Updated Oct 24, 2025

A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.

Python 2,203 287 Updated Nov 10, 2023

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,538 2,076 Updated Jul 17, 2025

Official repo of "Time Reversal Fusion" (ECCV2024)

Python 49 3 Updated May 20, 2025

Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"

112 2 Updated Oct 2, 2025
Python 1 Updated Oct 13, 2025
Python 4 Updated Oct 17, 2025

Standalone TFRecord reader/writer with PyTorch data loaders

Python 894 110 Updated May 15, 2025

Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO

C++ 733 307 Updated Sep 4, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,393 1,370 Updated Jul 9, 2025

An intuitive and low-overhead instrumentation tool for Python

Python 1,159 38 Updated Jul 8, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 80,366 8,853 Updated Oct 25, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 51,655 7,504 Updated Oct 25, 2025

Official Jax Implementation of MD4 Masked Diffusion Models

Python 135 15 Updated Feb 27, 2025

Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".

Python 62 7 Updated Feb 28, 2024

Collection of eclectic utils for python.

Python 242 25 Updated Oct 23, 2025

HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model

Python 68 4 Updated Jul 17, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,524 208 Updated Jun 17, 2025

Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset

Python 377 32 Updated Jun 3, 2025
Next