Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View chiro2001's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Organizations

@CrosSt-Chat @crarch

Block or report chiro2001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,745 281 Updated Oct 28, 2025

Contexts Optical Compression

Python 18,097 1,183 Updated Oct 25, 2025

A Lightweight Recommendation System

Python 8,973 688 Updated Oct 13, 2025
C++ 2 1 Updated Aug 27, 2025

Arm C Language Extensions (ACLE)

Python 115 65 Updated Oct 24, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,859 533 Updated Oct 28, 2025

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,323 277 Updated Jul 17, 2025

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,209 183 Updated Mar 27, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 19,448 3,186 Updated Oct 28, 2025

Applied AI experiments and examples for PyTorch

Python 301 29 Updated Aug 22, 2025

Simple go utility to download HuggingFace Models and Datasets

Go 754 87 Updated Sep 5, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 9,943 1,658 Updated Oct 27, 2025

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,474 424 Updated Oct 24, 2025

A gem5 experimental repo in order to explore Data-dependent Access (DDA).

C++ 8 6 Updated Jun 5, 2024
Python 205 26 Updated May 5, 2025
Python 14 3 Updated Dec 5, 2024

BLAS-like Library Instantiation Software Framework

C 2,543 402 Updated Oct 21, 2025

symmetric int8 gemm

Assembly 67 13 Updated Jun 7, 2020

row-major matmul optimization

C++ 682 94 Updated Aug 20, 2025

Modelling DRAMs with Petrinets

JavaScript 11 5 Updated Jan 7, 2020

Fast and accurate DRAM power and energy estimation tool

C++ 181 53 Updated Oct 6, 2025

Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…

C++ 418 107 Updated Oct 20, 2025

A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, WIOx, HBMx, and various academic proposals. Described in the…

C++ 665 214 Updated Aug 29, 2023

DRAMSys a SystemC TLM-2.0 based DRAM simulator.

C++ 311 74 Updated Sep 22, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,591 5,056 Updated Oct 24, 2025

Portable header-only C++ low level SIMD library

C++ 1,291 130 Updated Aug 26, 2024

Agenium Scale vectorization library for CPUs and GPUs

C 334 30 Updated Oct 21, 2021
C# 1 Updated Sep 17, 2020

A visualization and comparison tool for DRAMSim2

C# 5 4 Updated Jun 29, 2011
Next