Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View yuxuan-z19's full-sized avatar

Highlights

  • Pro

Block or report yuxuan-z19

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Programmable CUDA/C++ GPU Graph Analytics

C++ 1,042 214 Updated Jul 30, 2024

Benchmarking Deep Learning operations on different hardware

C++ 1,099 240 Updated Apr 25, 2021
C++ 267 90 Updated Oct 29, 2025

The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.

Python 1,477 406 Updated Oct 31, 2025

collection of benchmarks to measure basic GPU capabilities

C++ 441 66 Updated Oct 24, 2025

The official repository of ALE-Bench

Python 122 13 Updated Oct 27, 2025

Evaluating Large Language Models for CUDA Code Generation ComputeEval is a framework designed to generate and evaluate CUDA code from Large Language Models.

Python 69 12 Updated Oct 1, 2025

A microbenchmark support library

C++ 9,799 1,712 Updated Oct 29, 2025

CUDA Kernel Benchmarking Library

Cuda 757 90 Updated Oct 21, 2025

Learn CUDA with PyTorch

Cuda 96 13 Updated Sep 24, 2025

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.

Python 272 49 Updated Nov 1, 2025

Open-source unified multimodal model

Python 5,234 454 Updated Oct 27, 2025

Apptainer: Application containers for Linux

Go 1,606 160 Updated Oct 31, 2025

A PyTorch native platform for training generative AI models

Python 4,623 590 Updated Nov 1, 2025
C++ 707 121 Updated Oct 29, 2025

A hybrid GPU cluster simulator for ML system performance estimation

Rust 7 Updated Oct 9, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,260 1,100 Updated Oct 31, 2025

A Git-compatible VCS that is both simple and powerful

Rust 21,838 772 Updated Oct 31, 2025

cloc counts blank lines, comment lines, and physical lines of source code in many programming languages.

Perl 21,933 1,077 Updated Oct 31, 2025

AgentScope: Agent-Oriented Programming for Building LLM Applications

Python 13,487 1,089 Updated Oct 31, 2025

Leave One Feature Out Importance

Python 847 86 Updated Feb 14, 2025
Python 50 4 Updated Sep 19, 2025

Capabench:A Game-Theoretic Evaluation Benchmark for Modular Attribution in LLM Agents

Python 7 Updated May 16, 2025

Qianfan-VL: Domain-Enhanced Universal Vision-Language Models

166 13 Updated Sep 22, 2025

Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)

C 131 36 Updated Jul 22, 2022

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,245 175 Updated Aug 19, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,186 415 Updated Oct 31, 2025

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 798 60 Updated Oct 31, 2025

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 3,644 673 Updated Nov 1, 2025

KMS 激活服务,slmgr 命令激活 Windows 系统、Office

HTML 2,657 431 Updated Oct 5, 2025
Next