Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View amazingyyc's full-sized avatar
😞
😞

Block or report amazingyyc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory

C 145 36 Updated Jul 30, 2024

DLSlime: Flexible & Efficient Heterogeneous Transfer Toolkit

C++ 71 6 Updated Oct 26, 2025

High performance Transformer implementation in C++.

C++ 139 16 Updated Jan 18, 2025

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 709 79 Updated Apr 6, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,154 407 Updated Oct 25, 2025

Distributed MoE in a Single Kernel [NeurIPS '25]

Cuda 89 11 Updated Sep 30, 2025

Source code for the X Recommendation Algorithm

Scala 67,630 12,606 Updated Sep 8, 2025

Microsoft Automatic Mixed Precision Library

Python 627 48 Updated Sep 29, 2024

Tensor library for machine learning

C++ 13,321 1,375 Updated Oct 22, 2025

c++中文周刊

SCSS 525 26 Updated Aug 11, 2025

how to optimize some algorithm in cuda.

Cuda 2,578 232 Updated Oct 21, 2025

Ongoing research training transformer models at scale

Python 13,956 3,188 Updated Oct 27, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 94,274 25,671 Updated Oct 27, 2025

A tiny boost library in C++11.

C++ 4,162 580 Updated May 27, 2025

Kuboard 是基于 Kubernetes 的微服务管理界面。同时提供 Kubernetes 免费中文教程,入门教程,最新版本的 Kubernetes v1.23.4 安装手册,(k8s install) 在线答疑,持续更新。

JavaScript 24,437 1,595 Updated Oct 12, 2025

C++ implementation of Raft core logic as a replication library

C++ 1,136 268 Updated Oct 20, 2025

Cross-platform, customizable ML solutions for live and streaming media.

C++ 31,734 5,582 Updated Oct 24, 2025

TensorRT 7 C++ (almost) minimal examples

C++ 83 7 Updated Nov 11, 2023

A Decentrilized Asynchronously Distribute Training framework

C++ 7 Updated Apr 14, 2022

润学全球官方指定GITHUB,整理润学宗旨、纲领、理论和各类润之实例;解决为什么润,润去哪里,怎么润三大问题; 并成为新中国人的核心宗教,核心信念。

32,136 2,605 Updated Jul 31, 2024

Visualizer for neural network, deep learning and machine learning models

JavaScript 31,637 3,015 Updated Oct 27, 2025

TensorFlow GNN is a library to build Graph Neural Networks on the TensorFlow platform.

Python 1,492 198 Updated Oct 24, 2025

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 14,109 3,050 Updated Jul 31, 2025

Browser-based frontend to gdb (gnu debugger). Add breakpoints, view the stack, visualize data structures, and more in C, C++, Go, Rust, and Fortran. Run gdbgui from the terminal and a new tab will …

TypeScript 10,179 520 Updated Jun 29, 2025

Heatmap Regression via Randomized Rounding

Python 53 13 Updated Nov 25, 2021

Demo for the "Talking Head Anime from a Single Image."

Python 2,018 287 Updated Jun 29, 2022

QUDA is a library for performing calculations in lattice QCD on GPUs.

C++ 328 109 Updated Oct 25, 2025

Optimized primitives for collective multi-GPU communication

C++ 4,182 1,048 Updated Oct 18, 2025

Open files with xdg-open on Bash for Windows in Windows applications. Read only mirror from GitLab, see link 👉

Shell 534 27 Updated May 26, 2022

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

274,696 21,033 Updated Aug 22, 2025
Next