-
heyi_composable_kernel Public
Forked from ROCm/composable_kernelComposable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
C++ Other UpdatedOct 23, 2025 -
aiter Public
Forked from ROCm/aiterAI Tensor Engine for ROCm
Python MIT License UpdatedOct 23, 2025 -
-
-
-
-
A high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedSep 11, 2025 -
-
-
-
-
-
-
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Python BSD 3-Clause "New" or "Revised" License UpdatedJan 16, 2025