Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View ManishGovind's full-sized avatar

Block or report ManishGovind

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code to load DreamZero model checkpoints and run evaluation on DROID-sim and Genie Sim 3.0

Python 688 22 Updated Feb 11, 2026
Python 10 Updated Feb 9, 2026
Python 187 16 Updated Aug 1, 2025
Python 99 13 Updated Dec 4, 2025

Official Repository of "Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads"

Python 17 2 Updated Oct 6, 2025

šŸ¤— LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 21,602 3,764 Updated Feb 13, 2026

Official Code for RVT-2 and RVT

Jupyter Notebook 396 54 Updated Feb 14, 2025

A curated list of large VLM-based VLA models for robotic manipulation.

340 12 Updated Dec 21, 2025

[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions

Python 988 57 Updated Nov 19, 2025

[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos

Python 162 5 Updated Oct 1, 2025

Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Jupyter Notebook 533 55 Updated Feb 4, 2026

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 1,102 81 Updated Aug 14, 2025

Official Repository of 'Multi-Scale Temporal Mamba for Efficient Temporal Action Detection'

Python 35 5 Updated Jan 23, 2026

[ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Python 227 8 Updated Mar 29, 2025

Theia: Distilling Diverse Vision Foundation Models for Robot Learning

Python 266 11 Updated Nov 6, 2025

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 11,500 1,712 Updated Jan 13, 2026

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,195 139 Updated Dec 15, 2025

怐EMNLP 2024šŸ”„ć€‘Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,448 248 Updated Dec 3, 2024

[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".

Python 305 24 Updated Apr 3, 2024

[CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living

Python 30 3 Updated Nov 12, 2025

This is the offical repository of LLAVIDAL

Python 23 5 Updated Oct 4, 2025

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

HTML 24,607 5,252 Updated Feb 10, 2026