Lists (1)
Sort Name ascending (A-Z)
Stars
Fast and memory-efficient exact attention
Several simple examples for popular neural network toolkits calling custom CUDA operators.
Sample codes for my CUDA programming book
Efficient Triton Kernels for LLM Training
XLeRobot: Practical Dual-Arm Mobile Home Robot for $660
how to optimize some algorithm in cuda.
Devops Tutorial for Beginners - Learn Docker, Kubernetes, Terraform, Ansible, Jenkins and Azure Devops
Machine Learning Engineering Open Book
Machine Learning and Computer Vision Engineer - Technical Interview Questions
Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.
A cheatsheet of modern C++ language and library features.
🟣 Concurrency interview questions and answers to help you prepare for your next software architecture and design patterns interview in 2025.
C++17 limit-order book and TCP-based matching engine with sub-200 µs latency using Boost.Asio and custom memory pools.
system-design-interview-bytebytego
A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.
搞定C++:punch:。C++ Primer 中文版第5版学习仓库,包括笔记和课后练习答案。
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
Solve puzzles. Improve your pytorch.
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.