Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View fcjian's full-sized avatar

Block or report fcjian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source unified multimodal model

Python 5,569 487 Updated Oct 27, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,519 60 Updated Jun 14, 2025

NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.

Jupyter Notebook 5,949 949 Updated Jan 15, 2026

[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,736 191 Updated Dec 16, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,664 2,235 Updated Feb 1, 2025

[NeurIPS 2025] Improving Video Generation with Human Feedback

Python 408 11 Updated Sep 24, 2025

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,052 401 Updated Jan 16, 2026

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,083 524 Updated Jan 6, 2026

FastPillars: A Deployment-friendly Pillar-based 3D Detector

Python 171 11 Updated Jan 14, 2025
Python 30 Updated Jun 24, 2024

codes for RFSR: Improving ISR Diffusion Models via Reward Feedback Learning

Python 19 Updated Dec 8, 2024
Python 100 10 Updated Dec 27, 2024
Python 64 3 Updated Feb 20, 2025

A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability

105 Updated Nov 28, 2024
Python 4,513 440 Updated Sep 14, 2025

OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Python 391 30 Updated Mar 12, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,312 2,316 Updated Dec 25, 2024
8 Updated Jul 21, 2024
Python 557 44 Updated Jun 8, 2025

A curated list of awesome knowledge-driven autonomous driving (continually updated)

488 23 Updated Jun 7, 2024

UniMD: Towards Unifying Moment retrieval and temporal action Detection

Python 55 1 Updated Jul 5, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,320 2,703 Updated Aug 12, 2024

A curated list of awesome LLM/VLM/VLA for Autonomous Driving(LLM4AD) resources (continually updated)

1,629 95 Updated Jan 15, 2026

InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024

Jupyter Notebook 90 4 Updated Apr 9, 2024

✨✨Latest Advances on Multimodal Large Language Models

17,195 1,103 Updated Dec 26, 2025

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 2,555 196 Updated Feb 16, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,478 2,153 Updated Jan 16, 2026

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Python 2,724 306 Updated Jul 31, 2024

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 13,524 2,596 Updated Jun 26, 2024
Next