Highlights
Stars
[CVPR 2025] The offical implementation of 'MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors'
Papers, code and datasets about deep learning for 3D Object Detection.
3D object detection using YOLO and depth estimation
A language server for Bash
SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.
XLeRobot: Practical Dual-Arm Mobile Home Robot for $660
2025 - This is my deployment environment for real world AI robot policies, and a place to create training data for reinforcement learning and imitation learning.
Aluminium Body for Standard Open Arm (SO-ARM100)
Play with open source, low-cost AI robots with ease π€
π Solana transaction normalizer, parser, and resolver.
ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
A Python frontend and library for ComfyUI
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of β¦
Open Source smart glasses designed to be 1. All day wearable 2. Immediately useful 3. Extendable for makers, startups, and everyone else.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Prototype of application that extracts pupil size from webcam directly or from pre-recorded video.
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified database.
A debugging and profiling tool that can trace and visualize python code execution
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
π Text-Prompted Generative Audio Model
π°οΈ A versatile WebRTC pre-compiled Android library that reflects the recent WebRTC updates to facilitate real-time video chat for Android and Compose.
Simple UI for LLM Model Finetuning
Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.
An unofficial PyTorch implementation of the audio LM VALL-E