Lists (3)
Sort Name ascending (A-Z)
Stars
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.
chang-wenbin / cutlass
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
chang-wenbin / FastDeploy
Forked from PaddlePaddle/FastDeployHigh-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle
High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle
chang-wenbin / Paddle
Forked from PaddlePaddle/PaddlePArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
chang-wenbin / triton
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
chang-wenbin / PaddleNLP
Forked from PaddlePaddle/PaddleNLP👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…
chang-wenbin / PaddleMIX
Forked from PaddlePaddle/PaddleMIXPaddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation