Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View kdw4537's full-sized avatar

Block or report kdw4537

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 63,039 11,266 Updated Nov 14, 2025

ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)

Python 44 6 Updated Dec 17, 2024

Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025

Python 102 17 Updated May 3, 2025

GlazeWM is a tiling window manager for Windows inspired by i3wm.

Rust 10,497 311 Updated Nov 13, 2025

[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design

Python 123 13 Updated Jun 27, 2023

A co-design architecture on sparse attention

Python 53 4 Updated Aug 23, 2021

AMBA bus generator including AXI, AHB, and APB

C 106 45 Updated Jul 29, 2021

Fast and memory-efficient exact attention

Python 20,518 2,135 Updated Nov 13, 2025

SiT Dataset: Socially Interactive Pedestrian Trajectory Dataset for Social Navigation Robots [NeurIPS 2023]

Python 70 6 Updated Oct 17, 2024

Verilog AXI components for FPGA implementation

Verilog 1,848 510 Updated Feb 27, 2025

IC implementation of TPU

Verilog 134 29 Updated Dec 18, 2019

Fast and accurate DRAM power and energy estimation tool

C++ 186 53 Updated Oct 6, 2025

A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, WIOx, HBMx, and various academic proposals. Described in the…

C++ 668 215 Updated Aug 29, 2023

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 35,397 15,182 Updated Nov 14, 2025

A FPGA supported RISC-V CPU with 5-stage pipeline implemented in Verilog HDL

C 93 14 Updated Dec 5, 2019

Inference code for AI Challenge (Dec 2020)

Jupyter Notebook 6 Updated Feb 22, 2022

TernGEMM: General Matrix Multiply Library with Ternary Weights for Fast DNN Inference

C++ 13 1 Updated Feb 22, 2022

Layer-wise Pruning of Transformer Heads for Efficient Language Modeling

Python 22 1 Updated Feb 22, 2022
Python 9 2 Updated Nov 4, 2022

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

C 22,169 7,949 Updated Aug 28, 2025

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 56,018 17,327 Updated Nov 9, 2025