Starred repositories
Open source annotation tool for machine learning practitioners.
Multilingual Document Layout Parsing in a Single Vision-Language Model
Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments (VIS 2024)
Offline, privacy-first grammar checker. Fast, open-source, Rust-powered
Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods
ByteTrack implementation for person tracking using PyTorch
A simple screen parsing tool towards pure vision based GUI agent
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
Must-read Papers on Knowledge Editing for Large Language Models.
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
Kodu is an autonomous coding agent that lives in your IDE. It is a VSCode extension that can help you build your dream project step by step by leveraging the latest technologies in automated coding…
RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.
OpenMMLab Pose Estimation Toolbox and Benchmark.
Adding guardrails to large language models.
Port of OpenAI's Whisper model in C/C++
[ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels
SGLang is a fast serving framework for large language models and vision language models.
Official implementation of the paper "CoEdIT: Text Editing by Task-Specific Instruction Tuning" (EMNLP 2023)
Reference PyTorch implementation and models for DINOv3
A course on aligning smol models.
An extremely fast Python package and project manager, written in Rust.
Fast & Simple repository for pre-training and fine-tuning T5-style models
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
A GPU-accelerated TSDF and ESDF library for robots equipped with RGB-D cameras.