-
Harvard University
- Boston, US
- https://www.linkedin.com/in/qianru-lao
- https://estherbear.github.io
Highlights
- Pro
Stars
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.
My learning notes for ML SYS.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
A listing of compiler, language and runtime teams for people looking for jobs in this area
OBS Studio - Free and open source software for live streaming and screen recording
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
A high-throughput and memory-efficient inference and serving engine for LLMs
A composable and fully extensible C++ execution engine library for data management systems.
Tutel MoE: Optimized Mixture-of-Experts Library, Support GptOss/DeepSeek/Kimi-K2/Qwen3 using FP8/NVFP4/MXFP4
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/
Git Source Code Mirror - This is a publish-only repository but pull requests can be turned into patches to the mailing list via GitGitGadget (https://gitgitgadget.github.io/). Please follow Documen…
《Machine Learning Systems: Design and Implementation》- Chinese Version
awesome grounding: A curated list of research papers in visual grounding