Stars
A Python subset for a better MLIR programming experience
Efficient Triton Kernels for LLM Training
BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.
Tutorials on data assimilation (DA) and the EnKF
Modern Cmake C++ project example, with codespell, cmake, cpppcheck clang-format clang-tidy lcov gcovr support.
Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments.
A parallel framework for deep learning
A simple C++ templated multiarray class for array, a header-only library
timeprof is a simple C++ library for profiling code regions to measure execution time.
Domain specific library for electronic structure calculations
Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术
pix2tex: Using a ViT to convert images of equations into LaTeX code.
🤖 Scrape data from HTML websites automatically by just providing examples
A curated list of awesome CMake resources, scripts, modules and examples.
A list of awesome compiler projects and papers for tensor computation and deep learning.
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
A collection of out-of-tree LLVM passes for teaching and learning
A fast, simple & powerful blog framework, powered by Node.js.
Dependence-Based Code Transformation for Coarse-Grained Parallelism
An implementation of the Johnson's circuit finding algorithm