Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View Jingkang50's full-sized avatar
🍐
Today's Fruit
🍐
Today's Fruit

Block or report Jingkang50

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Python 32 Updated Feb 10, 2026

EgoHandICL: Egocentric 3D Hand Reconstruction with In-Context Learning (ICLR 2026)

Python 13 Updated Jan 29, 2026

Agentic LaTeX Writer - Local-first editor for AI-assisted academic writing

TypeScript 83 9 Updated Feb 18, 2026

Privacy-first AI memory layer - Signal for AI Memory. E2EE, local-first, works with Claude, Cursor, and any MCP-compatible AI.

TypeScript 16 Updated Feb 13, 2026

My linux dev configuration

Lua 8 Updated Feb 8, 2026

A local AI assistant running on your device. It turns your files into actionable memory.

TypeScript 54 6 Updated Feb 15, 2026

Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Python 238 7 Updated Feb 13, 2026

Open-source Autonomous 3D Characters on the Web

TypeScript 199 20 Updated Jan 15, 2026

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 1 Updated Dec 15, 2025

A multimodal framework for analyzing public street space utilization and inclusiveness using vision-language models.

Python 2 Updated Oct 19, 2025

[NeurIPS 2025] Deep Memory Backtracking for Long Video Understanding

Python 64 Updated Feb 10, 2026

Deploying High-Performance Lean 4 Server in One Click

Python 9 Updated Aug 14, 2025

Code for paper "Half-Physics: Enabling Kinematic 3D Human Model with Physical Interactions". Coming soon.

33 Updated Jul 31, 2025

Official code of the paper "EgoExOR: EgoExOR: An Ego-Exo-Centric Operating Room Dataset for Surgical Activity Understanding" accepted at NeurIPS 2025

Jupyter Notebook 25 5 Updated Feb 20, 2026
Python 77 6 Updated May 4, 2025

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 397 21 Updated Aug 26, 2025

This is the project repo for 'PSG-4D-LLM'.

CSS 12 Updated May 27, 2025

[ICLR' 25] AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation

Python 69 3 Updated Mar 19, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 54,547 9,547 Updated Feb 11, 2026

[CVPR 2025] EgoLife: Towards Egocentric Life Assistant

Python 396 19 Updated Mar 19, 2025

A fork to add multimodal model training to open-r1

Python 1,483 69 Updated Feb 8, 2025

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,545 254 Updated Jul 31, 2024

[ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.

Python 183 11 Updated Sep 26, 2025

Official Implementation of ReALFRED (ECCV'24)

Python 43 2 Updated Oct 11, 2024

[ECCV 2024 Oral] Code for our paper "A Fair Ranking and New Model for Panoptic Scene Graph Generation"

Python 16 1 Updated Dec 2, 2025

Official Code for "Digital Life Project: Autonomous 3D Characters with Social Intelligence"

44 Updated Sep 9, 2024

Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [Miyai+, TMLR2025]

98 3 Updated Jun 16, 2025

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 1,277 85 Updated Jan 23, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 20,230 2,148 Updated Feb 19, 2026
Python 19 1 Updated Jul 23, 2024
Next