TileAttention (TLA)

TileAttention (TLA) is a high-performance attention kernel library built to run on the TileLang backend. It offers efficient, modular, and composable attention implementations optimized for AI workloads.

📦 Installation

Requirements

Python 3.8+
PyTorch >= 2.1
TileLang
Triton (optional, for selected fast kernels)

Install (editable mode for development)

git clone https://github.com/tile-ai/TileAttention
cd TileAttention
git submodule update --init --recursive
pip install -e .

🚀 Quick Usage

import torch
import tla
from tla import MLA_kernel

device = "cuda"
dtype = torch.float16

batch = 128
heads = 64
kv_heads = 1
kv_ctx = 8192
dim = 512
pe_dim = 64

# Query input: [batch, heads, dim]
q = torch.randn(batch, heads, dim, device=device, dtype=dtype)

# Query positional encoding: [batch, heads, pe_dim]
q_pe = torch.randn(batch, heads, pe_dim, device=device, dtype=dtype)

# KV cache input: [batch, kv_ctx, kv_heads, dim]
kv = torch.randn(batch, kv_ctx, kv_heads, dim, device=device, dtype=dtype)

# KV positional encoding: [batch, kv_ctx, kv_heads, pe_dim]
k_pe = torch.randn(batch, kv_ctx, kv_heads, pe_dim, device=device, dtype=dtype)

# Use MLA kernel
block_N = 64
block_H = 64
num_split = 1

mla = MLA_kernel(batch, heads, kv_heads, kv_ctx, dim, pe_dim, block_N, block_H, num_split)

out = mla(q, q_pe, kv, k_pe)

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
3rdparty		3rdparty
tests		tests
tla		tla
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
format.sh		format.sh
pyproject.toml		pyproject.toml
requirements-lint.txt		requirements-lint.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TileAttention (TLA)

📦 Installation

Requirements

Install (editable mode for development)

🚀 Quick Usage

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

wumengz/TileAttention

Folders and files

Latest commit

History

Repository files navigation

TileAttention (TLA)

📦 Installation

Requirements

Install (editable mode for development)

🚀 Quick Usage

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages