Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View PuppyQ08's full-sized avatar
😏
Working from home
😏
Working from home

Block or report PuppyQ08

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LeetGPU Solutions

Python 71 5 Updated Oct 9, 2025

Fast and memory-efficient exact attention

Python 20,361 2,115 Updated Nov 5, 2025

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Python 1,519 201 Updated Apr 29, 2021

Sample codes for my CUDA programming book

Cuda 1,923 375 Updated Feb 15, 2025

Efficient Triton Kernels for LLM Training

Python 5,805 426 Updated Nov 6, 2025

XLeRobot: Practical Dual-Arm Mobile Home Robot for $660

Python 4,043 402 Updated Nov 5, 2025

how to optimize some algorithm in cuda.

Cuda 2,597 235 Updated Oct 30, 2025

Devops Tutorial for Beginners - Learn Docker, Kubernetes, Terraform, Ansible, Jenkins and Azure Devops

Java 2,737 6,591 Updated Sep 10, 2024

Machine Learning Engineering Open Book

Python 15,615 957 Updated Oct 27, 2025
Jupyter Notebook 67 22 Updated Nov 5, 2025

Machine Learning and Computer Vision Engineer - Technical Interview Questions

4,202 675 Updated May 20, 2025

Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.

Cuda 387 52 Updated Jan 2, 2025

C/C++ Cheat Sheet

Python 206 50 Updated Nov 4, 2025

Modern C++ Cheatsheet

C++ 3,469 727 Updated Dec 15, 2023

A cheatsheet of modern C++ language and library features.

21,208 2,238 Updated Apr 5, 2025

🟣 Concurrency interview questions and answers to help you prepare for your next software architecture and design patterns interview in 2025.

87 13 Updated May 19, 2025

C++17 limit-order book and TCP-based matching engine with sub-200 µs latency using Boost.Asio and custom memory pools.

Makefile 1 Updated Jul 11, 2025

📚 从零开始的大语言模型原理与实践教程

Jupyter Notebook 20,987 1,854 Updated Oct 17, 2025

卢瑟们的作业展示,答案讲解,以及一些C++知识

C++ 742 140 Updated Oct 6, 2025

system-design-interview-bytebytego

7 5 Updated Aug 17, 2024

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,386 229 Updated Nov 2, 2025

Assembler for NVIDIA Maxwell architecture

Sass 1,046 172 Updated Jan 3, 2023

C++ Primer 5 answers

C++ 8,265 2,981 Updated Jun 6, 2024

搞定C++:punch:。C++ Primer 中文版第5版学习仓库,包括笔记和课后练习答案。

C++ 8,521 2,012 Updated Jul 20, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 4,991 690 Updated Nov 6, 2025

Some CUDA example code with READMEs.

Cuda 176 26 Updated Mar 2, 2025

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,765 336 Updated Jul 15, 2024

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

11,572 1,893 Updated Aug 31, 2023
C++ 267 91 Updated Oct 29, 2025
Next