Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View obtuseanglor's full-sized avatar

Block or report obtuseanglor

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,429 294 Updated Nov 5, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,300 405 Updated Jun 28, 2024

Official EHM Tracking Implementation for GUAVA (ICCV 2025)

Python 15 Updated Oct 8, 2025

Official implementation of the paper "GUAVA: Generalizable Upper Body 3D Gaussian Avatar" [ICCV 2025]

Python 173 26 Updated Oct 8, 2025
Python 39 2 Updated Jul 8, 2025

[ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting"

Python 1,806 215 Updated Oct 5, 2025

[RSS 2025] AMO: Adaptive Motion Optimization for Hyper-Dexterous Humanoid Whole-Body Control

Python 294 13 Updated May 11, 2025

Official implementation of OpenWBT.

Python 774 84 Updated Jul 30, 2025
Python 1,552 308 Updated Jul 23, 2024

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,649 305 Updated Oct 20, 2025

Low-level locomotion policy training in Isaac Lab

Python 357 30 Updated Mar 7, 2025

[RSS'25] This repository is the implementation of "NaVILA: Legged Robot Vision-Language-Action Model for Navigation"

Python 379 32 Updated Aug 20, 2025

Official Implementation of "KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skills"

Python 582 79 Updated Nov 13, 2025

[IROS 2025] Generalizable Humanoid Manipulation with 3D Diffusion Policies. Part 1: Train & Deploy of iDP3

Python 441 32 Updated Jun 16, 2025

Humanoid robot arms retarget algorithm with VisionPro app

Python 91 7 Updated Oct 19, 2024

Various retargeting optimizers to translate human hand motion to robot hand motion.

Python 654 71 Updated Aug 21, 2025

PyTorch implementation for our paper Learning Character-Agnostic Motion for Motion Retargeting in 2D, SIGGRAPH 2019

Python 473 86 Updated Jun 21, 2022

[RSS 2024]: Expressive Whole-Body Control for Humanoid Robots

Python 462 46 Updated Mar 30, 2025

基于mujoco仿真环境对unitree g1机器操作的研究和学习

Python 61 8 Updated Oct 11, 2025

FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers

Python 480 32 Updated Aug 20, 2025

Text-audio foundation model from Boson AI

Python 7,620 564 Updated Sep 15, 2025

Audio Large Language Models

Python 782 40 Updated Jul 5, 2025

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)

JavaScript 55,709 9,480 Updated Jul 16, 2025

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 1,544 304 Updated Nov 12, 2025

[ACM MM'2024]"DiffMM: Multi-Modal Diffusion Model for Recommendation"

Python 88 5 Updated Jul 21, 2024

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,995 395 Updated Jul 10, 2024
Next