-
Notifications
You must be signed in to change notification settings - Fork 10
Open
Description
Version
v0.1.0
Feature
- Python API refactor
- Gemm [Feature Request] Gemm #59
- Multi-Head Attention [Feature Request] Multi-Head Attention #60
- Group-Query Attention [Feature Request] Group Query Attention #61
- Multi-Head Attention Decode [Feature Request] Multi-Head Attention Decode #62
- Group-Query Attention Decode [Feature Request] Group-Query Attention Decode #63
- Multi-Head Latent Attention Decode [Feature Request] Multi-Head Latent Attention Decode #64
- DeepSeek Sparse Attention Decode [Feature Request] DeepSeek Sparse Attention Decode #65
- Benchmark
- CI
- Docs [Feature Request] Python API Documentation #66
- Readme
Metadata
Metadata
Assignees
Labels
No labels