Thanks to visit codestin.com
Credit goes to github.com

amazingyyc

Follow

😞

惊奇漫画 amazingyyc

😞

Follow

你呀你，是志在如风的少年。

34 followers · 2 following

@microsoft
beijing

Achievements

Achievements

Stars

Mellanox / gpu_direct_rdma_access

example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory

C 145 36 Updated Jul 30, 2024

DeepLink-org / DLSlime

DLSlime: Flexible & Efficient Heterogeneous Transfer Toolkit

C++ 71 6 Updated Oct 26, 2025

LLMServe / SwiftTransformer

High performance Transformer implementation in C++.

C++ 139 16 Updated Jan 18, 2025

LLMServe / DistServe

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 709 79 Updated Apr 6, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,154 407 Updated Oct 25, 2025

osayamenja / FlashMoE

Distributed MoE in a Single Kernel [NeurIPS '25]

Cuda 89 11 Updated Sep 30, 2025

twitter / the-algorithm

Source code for the X Recommendation Algorithm

Scala 67,630 12,606 Updated Sep 8, 2025

Azure / MS-AMP

Microsoft Automatic Mixed Precision Library

Python 627 48 Updated Sep 29, 2024

ggml-org / ggml

Tensor library for machine learning

C++ 13,321 1,375 Updated Oct 22, 2025

wanghenshui / cppweeklynews

c++中文周刊

SCSS 525 26 Updated Aug 11, 2025

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 2,578 232 Updated Oct 21, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 13,956 3,188 Updated Oct 27, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 94,274 25,671 Updated Oct 27, 2025

idealvin / coost

A tiny boost library in C++11.

C++ 4,162 580 Updated May 27, 2025

eip-work / kuboard-press

Kuboard 是基于 Kubernetes 的微服务管理界面。同时提供 Kubernetes 免费中文教程，入门教程，最新版本的 Kubernetes v1.23.4 安装手册，(k8s install) 在线答疑，持续更新。

JavaScript 24,437 1,595 Updated Oct 12, 2025

eBay / NuRaft

C++ implementation of Raft core logic as a replication library

C++ 1,136 268 Updated Oct 20, 2025

google-ai-edge / mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

C++ 31,734 5,582 Updated Oct 24, 2025

agrechnev / trt-cpp-min

TensorRT 7 C++ (almost) minimal examples

C++ 83 7 Updated Nov 11, 2023

amazingyyc / DADT

A Decentrilized Asynchronously Distribute Training framework

C++ 7 Updated Apr 14, 2022

The-Run-Philosophy-Organization / run

润学全球官方指定GITHUB，整理润学宗旨、纲领、理论和各类润之实例；解决为什么润，润去哪里，怎么润三大问题；并成为新中国人的核心宗教，核心信念。

32,136 2,605 Updated Jul 31, 2024

lutzroeder / netron

Visualizer for neural network, deep learning and machine learning models

JavaScript 31,637 3,015 Updated Oct 27, 2025

tensorflow / gnn

TensorFlow GNN is a library to build Graph Neural Networks on the TensorFlow platform.

Python 1,492 198 Updated Oct 24, 2025

dmlc / dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 14,109 3,050 Updated Jul 31, 2025

cs01 / gdbgui

Browser-based frontend to gdb (gnu debugger). Add breakpoints, view the stack, visualize data structures, and more in C, C++, Go, Rust, and Fortran. Run gdbgui from the terminal and a new tab will …

TypeScript 10,179 520 Updated Jun 29, 2025

baoshengyu / H3R

Heatmap Regression via Randomized Rounding

Python 53 13 Updated Nov 25, 2021

pkhungurn / talking-head-anime-demo

Demo for the "Talking Head Anime from a Single Image."

Python 2,018 287 Updated Jun 29, 2022

lattice / quda

QUDA is a library for performing calculations in lattice QCD on GPUs.

C++ 328 109 Updated Oct 25, 2025

NVIDIA / nccl

Optimized primitives for collective multi-GPU communication

C++ 4,182 1,048 Updated Oct 18, 2025

4U6U57 / wsl-open

Open files with xdg-open on Bash for Windows in Windows applications. Read only mirror from GitLab, see link 👉

Shell 534 27 Updated May 26, 2022

996icu / 996.ICU

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

274,696 21,033 Updated Aug 22, 2025