Starred repositories
🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Video+code lecture on building nanoGPT from scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Implementing modern DL systems from scratch — Transformers, Diffusion, Multimodal LLMs, FlashAttention, RLHF.
Text-audio foundation model from Boson AI
A high-throughput and memory-efficient inference and serving engine for LLMs
This repository contains the implementation of an object-based imitation learning approach that utilizes an RNN-based method to predict future waypoints by leveraging object data, lane data, and IM…
This repository collects research papers of large Vision Language Models in Autonomous driving and Intelligent Transportation System. The repository will be continuously updated to track the lates…
Collection of AWESOME vision-language models for vision tasks
Machine Learning and Computer Vision Engineer - Technical Interview Questions
Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw
Learning materials of Transformer, including my code, XMind, PDF and so on
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
This repository presents an enhanced version of the HKUST-Aerial-Robotics/EPSILON project, migrated from ROS1 to ROS2 for improved performance and compatibility with modern autonomous driving frame…
The AWS Deployment Framework (ADF) is an extensive and flexible framework to manage and deploy resources across multiple AWS accounts and regions based on AWS Organizations.
ADDF is a collection of modules, deployed using the SeedFarmer orchestration tool. ADDF modules enable users to quickly bootstrap environments for the process and analysis of autonomous driving data.
Experiments for designing tree search algorithms for Continuous POMDPs
Hybrid A* Path Planner for the KTH Research Concept Vehicle
MTR: Motion Transformer with Global Intention Localization and Local Movement Refinement, NeurIPS 2022.
[CVPR 2021] "Multimodal Motion Prediction with Stacked Transformers": official code implementation and project page.
Extremely simple yet powerful header-only C++ plotting library built on the popular matplotlib
Master programming by recreating your favorite technologies from scratch.