Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View kzwrime's full-sized avatar

Block or report kzwrime

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Simple intro to tma

Cuda 7 4 Updated Sep 5, 2025

Nano vLLM

Python 9,860 1,239 Updated Nov 3, 2025

A Python subset for a better MLIR programming experience

Python 43 7 Updated Oct 30, 2025

Efficient Triton Kernels for LLM Training

Python 5,962 452 Updated Dec 20, 2025

BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.

C++ 91 29 Updated Oct 22, 2025

Tutorials on data assimilation (DA) and the EnKF

Python 176 55 Updated Dec 5, 2025

Modern Cmake C++ project example, with codespell, cmake, cpppcheck clang-format clang-tidy lcov gcovr support.

CMake 3 Updated Jan 6, 2024

Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments.

Python 852 109 Updated Dec 8, 2025

ArrayFire: a general purpose GPU library.

C++ 4,839 552 Updated Sep 5, 2025

A parallel framework for deep learning

Fortran 456 100 Updated Dec 17, 2025

CUDA on non-NVIDIA GPUs

Rust 13,679 880 Updated Dec 19, 2025

The missing CMake project initializer

CMake 2,456 93 Updated Aug 31, 2025

A simple C++ templated multiarray class for array, a header-only library

CMake 2 Updated Jan 5, 2024

timeprof is a simple C++ library for profiling code regions to measure execution time.

C++ 7 Updated Jul 31, 2023

openai 充值指南

284 14 Updated Jul 10, 2024

逆天开发常用库(整理更新ing)

Batchfile 43 28 Updated May 26, 2021

row-major matmul optimization

C++ 692 94 Updated Aug 20, 2025

Domain specific library for electronic structure calculations

C++ 158 47 Updated Dec 18, 2025

Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术

Jupyter Notebook 10,704 883 Updated Sep 20, 2024

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 16,042 1,270 Updated Jan 18, 2025

🤖 Scrape data from HTML websites automatically by just providing examples

Python 1,370 90 Updated Mar 17, 2024

A curated list of awesome CMake resources, scripts, modules and examples.

5,319 494 Updated Dec 15, 2025

Rodinia benchmark

C 194 106 Updated Apr 14, 2023

Spector: An OpenCL FPGA Benchmark Suite

Shell 49 18 Updated Feb 2, 2019

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,701 323 Updated Oct 19, 2024

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

1,639 173 Updated Sep 12, 2025

A collection of out-of-tree LLVM passes for teaching and learning

C++ 3,326 432 Updated Dec 16, 2025

A fast, simple & powerful blog framework, powered by Node.js.

TypeScript 41,092 5,027 Updated Dec 12, 2025

Dependence-Based Code Transformation for Coarse-Grained Parallelism

C++ 4 1 Updated Dec 8, 2018

An implementation of the Johnson's circuit finding algorithm

C++ 25 14 Updated Jul 26, 2017
Next