Jingkang50

🍐

Today's Fruit

Jingkang (Jake) Yang Jingkang50

🍐

Today's Fruit

Co-Founder at Synvo AI | Prev. MMLab@NTU PhD

318 followers · 82 following

Synvo AI
Singapore
jingkangyang.com
@JingkangY

Achievements

x2 x3 x3

Achievements

x2 x3 x3

Lists (1)

Sort

mmlab

14 repositories

Starred repositories

dongyh20 / Demo-ICL

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Python 32 Updated Feb 10, 2026

Nicous20 / EgoHandICL

EgoHandICL: Egocentric 3D Hand Reconstruction with In-Context Learning (ICLR 2026)

Python 13 Updated Jan 29, 2026

EvolvingLMMs-Lab / lmms-lab-writer

Agentic LaTeX Writer - Local-first editor for AI-assisted academic writing

TypeScript 83 9 Updated Feb 18, 2026

EvolvingLMMs-Lab / engram

Privacy-first AI memory layer - Signal for AI Memory. E2EE, local-first, works with Claude, Cursor, and any MCP-compatible AI.

TypeScript 16 Updated Feb 13, 2026

Luodian / lazybuild

My linux dev configuration

Lua 8 Updated Feb 8, 2026

synvo-ai / local-cocoa

A local AI assistant running on your device. It turns your files into actionable memory.

TypeScript 54 6 Updated Feb 15, 2026

EvolvingLMMs-Lab / OneVision-Encoder

Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Python 238 7 Updated Feb 13, 2026

dlp3d-ai / dlp3d.ai

Open-source Autonomous 3D Characters on the Web

TypeScript 199 20 Updated Jan 15, 2026

MMMU-Japanese-Benchmark / lmms-eval_jmmmu-pro_pr

Forked from EvolvingLMMs-Lab/lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 1 Updated Dec 15, 2025

ZhangXiamengwei / EgoCity

A multimodal framework for analyzing public street space utilization and inclusiveness using vision-language models.

Python 2 Updated Oct 19, 2025

worldbench / VideoLucy

[NeurIPS 2025] Deep Memory Backtracking for Long Video Understanding

Python 64 Updated Feb 10, 2026

EvolvingLMMs-Lab / lean-runner

Deploying High-Performance Lean 4 Server in One Click

Python 9 Updated Aug 14, 2025

lisiyao21 / Half-Physics

Code for paper "Half-Physics: Enabling Kinematic 3D Human Model with Physical Interactions". Coming soon.

33 Updated Jul 31, 2025

ardamamur / EgoExOR

Official code of the paper "EgoExOR: EgoExOR: An Ego-Exo-Centric Operating Room Dataset for Surgical Activity Understanding" accepted at NeurIPS 2025

Jupyter Notebook 25 5 Updated Feb 20, 2026

EvolvingLMMs-Lab / Aero-1

Python 77 6 Updated May 4, 2025

EvolvingLMMs-Lab / multimodal-search-r1

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 397 21 Updated Aug 26, 2025

ChocoWu / PSG-4D-LLM

This is the project repo for 'PSG-4D-LLM'.

CSS 12 Updated May 27, 2025

yukangcao / AvatarGO

[ICLR' 25] AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation

Python 69 3 Updated Mar 19, 2025

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 54,547 9,547 Updated Feb 11, 2026

EvolvingLMMs-Lab / EgoLife

[CVPR 2025] EgoLife: Towards Egocentric Life Assistant

Python 396 19 Updated Mar 19, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,483 69 Updated Feb 8, 2025

octo-models / octo

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,545 254 Updated Jul 31, 2024

EvolvingLMMs-Lab / multimodal-sae

[ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.

Python 183 11 Updated Sep 26, 2025

snumprlab / realfred

Official Implementation of ReALFRED (ECCV'24)

Python 43 2 Updated Oct 11, 2024

lorjul / panoptic-scene-graph-generation

[ECCV 2024 Oral] Code for our paper "A Fair Ranking and New Model for Panoptic Scene Graph Generation"

Python 16 1 Updated Dec 2, 2025

caizhongang / digital_life_project

Official Code for "Digital Life Project: Autonomous 3D Characters with Social Intelligence"

44 Updated Sep 9, 2024

AtsuMiyai / Awesome-OOD-VLM

Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [Miyai+, TMLR2025]

98 3 Updated Jun 16, 2025

DAMO-NLP-SG / VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 1,277 85 Updated Jan 23, 2025

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 20,230 2,148 Updated Feb 19, 2026

ztyang23 / BACON

Python 19 1 Updated Jul 23, 2024

Jingkang (Jake) Yang Jingkang50

Lists (1)

mmlab

Starred repositories

Terminal

Python

LaTeX