Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View CongMa13's full-sized avatar
  • AMD
  • Calgary
  • 06:17 (UTC -07:00)

Block or report CongMa13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A next-generation C++ language server for modern C++, focused on high performance and deep code intelligence

C++ 1,157 67 Updated Feb 4, 2026

Fast and Furious AMD Kernels

C++ 365 53 Updated Feb 15, 2026

Distributed Compiler based on Triton for Parallel Systems

Python 1,359 127 Updated Feb 13, 2026

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 3,169 355 Updated Jan 17, 2026

Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO

C++ 1,902 81 Updated Dec 23, 2025

A guide for the rest of us on using C++ templates.

C++ 562 34 Updated May 30, 2018

List of materials about functional programming in C++

699 63 Updated Jun 27, 2020

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 31,415 3,911 Updated Feb 22, 2026

Functional programming style pattern-matching library for C++

C++ 1,310 76 Updated Oct 22, 2021

C++17 `std::variant` for C++11/14/17

C++ 707 90 Updated Dec 7, 2022

Eggs.Variant is a C++11/14/17 generic, type-safe, discriminated union.

C++ 141 27 Updated Feb 2, 2022

C++11/C++14 Variant

C++ 379 96 Updated Apr 13, 2023

The best way to write secure and reliable applications. Write nothing; deploy nowhere.

Dockerfile 64,752 4,786 Updated Aug 7, 2024

LLM inference in C/C++

C++ 95,599 15,020 Updated Feb 22, 2026

Neural Networks: Zero to Hero

Jupyter Notebook 20,378 2,901 Updated Aug 18, 2024

Customizable automatic UML diagram generator for C++ based on Clang.

C++ 868 59 Updated Jan 28, 2026

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 55,178 6,024 Updated Feb 9, 2026

📚 [译] ApacheCN C/C++ 译文集

JavaScript 360 71 Updated Apr 13, 2025

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 9,393 1,014 Updated Dec 4, 2025

21 Lessons, Get Started Building with Generative AI

Jupyter Notebook 106,761 57,209 Updated Feb 16, 2026

My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接

Jupyter Notebook 9,585 1,767 Updated Jan 11, 2026

A technical explainer by @kognise of how your computer runs programs, from start to finish.

MDX 5,442 190 Updated Jun 15, 2024

A collection of examples for the ROCm software stack

C++ 279 83 Updated Feb 20, 2026

A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators

Python 126 18 Updated Nov 14, 2025