Stars
Attention mappers and visualisation for transformer-based Physical AI policies
Isaac Lab API, powered by MuJoCo-Warp, for RL and robotics research.
Confidence scores for Neural Networks, made easy!
Enhancing Motion Dynamics of Image-to-Video Models via Adaptive Low-Pass Guidance (arXiv 2025)
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
NVIDIA Isaac GR00T N1.5 - A Foundation Model for Generalist Robots.
moojink / openvla-oft
Forked from openvla/openvlaFine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
Evaluating Safety of Autonomous Agents in Mobile Device Control
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing (ICLR 2025)
Realtime API for Lucky World simulator with ROS-like interface
Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation and CoLLAs 2025)
Large World Model -- Modeling Text and Video with Millions Context
A JAX research toolkit for building, editing, and visualizing neural networks.
MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)
[ICLR 2024] Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models
🧠💬 Articles I wrote about machine learning, archived from MachineCurve.com.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
RLHF implementation details of OAI's 2019 codebase
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
Can large language models provide useful feedback on research papers? A large-scale empirical analysis.