Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Latest commit

Β 

History

History
32 lines (21 loc) Β· 1.5 KB

File metadata and controls

32 lines (21 loc) Β· 1.5 KB

πŸŽ“ Hexagon-MLIR Tutorials

Welcome to the Hexagon-MLIR tutorials! These hands-on examples will guide you through the process of writing, compiling, and executing Triton kernels and PyTorch models on Qualcomm Hexagon NPUs.

πŸƒβ€β™‚οΈ Quick Start

πŸ“– Start with Triton Tutorials

πŸ“– Start with PyTorch Tutorials

πŸš€ What You'll Learn

These tutorials demonstrate how to leverage Qualcomm's Hexagon NPU targets for AI workloads. You'll discover how to:

Triton Kernels

  • Write Triton Kernels: Create kernels that run efficiently on Qualcomm Hexagon NPUs
  • Understand the Compilation Pipeline: Follow your code from Python through multiple IR transformations to optimized machine code
  • Optimize Performance: Leverage specific features like multi-threading, vector processing, and memory hierarchy optimization
  • Debug and Profile: Use built-in tools to analyze and improve your kernel performance

PyTorch Models

  • Use PyTorch Flow: Take PyTorch models and compile and execute in our flow
  • Understand the Compilation Pipeline: Follow your code from Python through multiple IR transformations to optimized machine code

πŸ› οΈ Prerequisites

Before diving into the tutorials, make sure you have:

  • βœ… Hexagon-MLIR framework installed (Installation Guide)
  • βœ… Python environment with required dependencies
  • βœ… Access to Hexagon hardware or simulator
  • βœ… Basic understanding of Python and tensor operations