Highlights
- Pro
Stars
A flexible, high-performance, user-friendly computer architecture simulator engine
ROCm / pytorch
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
A package for Multiple Kernel Learning in Python
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
ROCm / tensorflow-upstream
Forked from tensorflow/tensorflowTensorFlow ROCm port
Efficient GPU kernels for block-sparse matrix multiplication and convolution
a software library containing Sparse functions written in OpenCL
Models and examples built with TensorFlow
oneAPI Deep Neural Network Library (oneDNN)
An Open Source Machine Learning Framework for Everyone
A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, WIOx, HBMx, and various academic proposals. Described in the…
Caffe2 is a lightweight, modular, and scalable deep learning framework.
Benchmarking Deep Learning operations on different hardware
The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA applications. This information can be used by developers to…
C4.5 Decision Tree python implementation with validation, pruning, and attribute multi-splitting
scikit-learn: machine learning in Python
Visualizations for machine learning datasets
HIP: C++ Heterogeneous-Compute Interface for Portability
[DEPRECATED] Moved to ROCm/rocm-libraries repo
Optimized primitives for collective multi-GPU communication
A Benchmark Suite for Heterogeneous System Computation