Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View shidong-ai's full-sized avatar
:octocat:
:octocat:

Highlights

  • Pro

Block or report shidong-ai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A flexible, high-performance, user-friendly computer architecture simulator engine

Go 91 25 Updated Nov 7, 2025

NVDLA SW

C++ 508 213 Updated Jan 28, 2021

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 246 75 Updated Nov 14, 2025

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

C++ 5,029 824 Updated Jun 17, 2024

A package for Multiple Kernel Learning in Python

Python 131 46 Updated Apr 3, 2023

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

274,803 21,017 Updated Aug 22, 2025

TensorFlow ROCm port

C++ 698 100 Updated Nov 13, 2025

[DEPRECATED] Moved to ROCm/rocm-libraries repo

C++ 388 191 Updated Nov 12, 2025

[DEPRECATED] Moved to ROCm/rocm-libraries repo

C++ 132 67 Updated Nov 13, 2025

Efficient GPU kernels for block-sparse matrix multiplication and convolution

Cuda 1,061 199 Updated Jun 8, 2023

a software library containing Sparse functions written in OpenCL

C++ 175 62 Updated Feb 21, 2020

[DEPRECATED] Moved to ROCm/rocm-libraries repo

Python 254 167 Updated Nov 6, 2025

A benchmark framework for Tensorflow

Python 1,148 633 Updated Oct 6, 2023

Models and examples built with TensorFlow

Python 77,671 45,420 Updated Nov 6, 2025

(Deprecated) hipCaffe: the HIP port of Caffe

C++ 124 25 Updated May 1, 2024

oneAPI Deep Neural Network Library (oneDNN)

C++ 3,917 1,083 Updated Nov 14, 2025

An Open Source Machine Learning Framework for Everyone

C++ 192,427 74,990 Updated Nov 14, 2025

A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, WIOx, HBMx, and various academic proposals. Described in the…

C++ 668 215 Updated Aug 29, 2023

Caffe2 is a lightweight, modular, and scalable deep learning framework.

Shell 8,404 1,923 Updated Feb 7, 2023

Benchmarking Deep Learning operations on different hardware

C++ 1,101 242 Updated Apr 25, 2021

The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA applications. This information can be used by developers to…

C++ 85 15 Updated Jun 16, 2020

Real Time Face Detection

C++ 2 1 Updated Sep 11, 2018

C4.5 Decision Tree python implementation with validation, pruning, and attribute multi-splitting

Python 85 39 Updated Jun 21, 2018

scikit-learn: machine learning in Python

Python 64,019 26,434 Updated Nov 14, 2025

Visualizations for machine learning datasets

Jupyter Notebook 7,379 888 Updated May 24, 2023

HIP: C++ Heterogeneous-Compute Interface for Portability

C++ 4,240 572 Updated Nov 13, 2025

[DEPRECATED] Moved to ROCm/rocm-libraries repo

Assembly 1,185 270 Updated Nov 13, 2025

Optimized primitives for collective multi-GPU communication

C++ 4,224 1,066 Updated Nov 10, 2025

A Benchmark Suite for Heterogeneous System Computation

Jupyter Notebook 54 15 Updated Feb 20, 2025

Benchmarks of Deep Neural Networks

C++ 39 19 Updated May 19, 2021
Next