Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Fangtangtang's full-sized avatar
🐒
energetic
🐒
energetic

Block or report Fangtangtang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM Serving simulation for multi-core NPU

C++ 19 2 Updated Oct 30, 2025

Asynchronous semantics for architectural simulation and synthesis.

Python 55 13 Updated Nov 1, 2025

A machine learning accelerator core designed for energy-efficient AI at the edge.

Emacs Lisp 1,683 162 Updated Oct 31, 2025

kernels, of the mega variety

Python 594 26 Updated Sep 28, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,920 145 Updated Oct 31, 2025
C 488 87 Updated Oct 31, 2025
Python 155 49 Updated Feb 22, 2024

Open, Modular, Deep Learning Accelerator

Scala 312 86 Updated Apr 10, 2024

cricket is a virtualization solution for GPUs

C 218 49 Updated Sep 9, 2025
C++ 64 19 Updated Aug 30, 2024

A reference-counted netlist library for EDA tool development

Rust 1 3 Updated Oct 15, 2025

Run Time for AIE and FPGA based platforms

C++ 631 505 Updated Nov 1, 2025
C++ 114 35 Updated Oct 28, 2025

The CP-SAT Primer: Using and Understanding Google OR-Tools' CP-SAT Solver

Jupyter Notebook 595 52 Updated Oct 30, 2025

WaferLLM: Large Language Model Inference at Wafer Scale

Python 65 8 Updated Oct 31, 2025

Asterinas is a secure, fast, and general-purpose OS kernel, written in Rust and providing Linux-compatible ABI.

Rust 3,712 233 Updated Oct 31, 2025

A collection of pre-trained, state-of-the-art models in the ONNX format

Jupyter Notebook 9,147 1,528 Updated Sep 16, 2025

Fork of LLVM to support AMD AIEngine processors

LLVM 173 30 Updated Oct 31, 2025

AMD Ryzen™ AI Software includes the tools and runtime libraries for optimizing and deploying AI inference on AMD Ryzen™ AI powered PCs.

Python 678 106 Updated Oct 21, 2025

Exocompilation for productive programming of hardware accelerators

Python 676 49 Updated Nov 1, 2025

Berkeley's Spatial Array Generator

Scala 1,094 222 Updated Oct 31, 2025

An MLIR Complier for PyTorch/C/C++ Codes into HLS Dataflow Designs

MLIR 49 10 Updated Aug 1, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,813 293 Updated Nov 1, 2025

ARIES: An Agile MLIR-Based Compilation Flow for Reconfigurable Devices with AI Engines (FPGA 2025 Best Paper Nominee)

C++ 51 8 Updated Nov 1, 2025

An MLIR-based toolchain for AMD AI Engine-enabled devices.

MLIR 515 158 Updated Oct 31, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,848 729 Updated Oct 15, 2025

Bridging polyhedral analysis tools to the MLIR framework

C++ 117 23 Updated Sep 9, 2023
Next