ymwangg

Yanming W. ymwangg

20 followers · 11 following

@aws

Achievements

nki-autotune Public
Forked from awslabs/nki-autotune

Python Apache License 2.0 Updated Oct 1, 2025
tokenizers Public
Forked from huggingface/tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust Apache License 2.0 Updated May 13, 2025
vllm-test Public

Misc test and benchmark code for vllm

Python 1 Updated Nov 20, 2024
BetterChatGPT Public
Forked from ztjhz/BetterChatGPT

An amazing UI for OpenAI's ChatGPT (Website + Windows + MacOS + Linux)

TypeScript Creative Commons Zero v1.0 Universal Updated Aug 14, 2024
vllm Public
Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 3 Apache License 2.0 Updated Jul 30, 2024
djl-serving Public
Forked from deepjavalibrary/djl-serving

A universal scalable machine learning model deployment solution

Java Apache License 2.0 Updated May 22, 2024
flash-attention Public
Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 1 BSD 3-Clause "New" or "Revised" License Updated May 17, 2024
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python MIT License Updated Feb 2, 2024
PipeEdge Public
Forked from usc-isi/PipeEdge

PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices

Python BSD 3-Clause "New" or "Revised" License Updated Jan 31, 2024
ColossalAI-Documentation Public
Forked from hpcaitech/ColossalAI-Documentation

Documentation for Colossal-AI

JavaScript Apache License 2.0 Updated Jan 16, 2024
llm-awq Public
Forked from mit-han-lab/llm-awq

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python MIT License Updated Sep 14, 2023
llama.cpp Public
Forked from ggml-org/llama.cpp

Port of Facebook's LLaMA model in C/C++

C MIT License Updated Sep 14, 2023
pytorch Public
Forked from pytorch/pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python Other Updated Sep 8, 2023
text-generation-webui Public
Forked from oobabooga/text-generation-webui

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (ggml/gguf), Llama models.

Python GNU Affero General Public License v3.0 Updated Aug 29, 2023
AITemplate Public
Forked from facebookincubator/AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python Apache License 2.0 Updated Jul 31, 2023
ColossalAI Public
Forked from hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

Python Apache License 2.0 Updated Jul 20, 2023
alpa Public
Forked from alpa-projects/alpa

Training and serving large-scale neural networks

Python Apache License 2.0 Updated May 19, 2023
transformers Public
Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Python 1 Apache License 2.0 Updated Mar 21, 2023
xla Public
Forked from pytorch/xla

Enabling PyTorch on Google TPU

C++ 1 Other Updated Mar 15, 2023
accelerate Public
Forked from huggingface/accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Python Apache License 2.0 Updated Feb 28, 2023
amazon-textract-transformer-pipeline Public

Python 1 Other Updated Oct 22, 2022
tensorflow-fork Public
Forked from tensorflow/tensorflow

An Open Source Machine Learning Framework for Everyone

C++ Apache License 2.0 Updated Oct 20, 2022
detr Public
Forked from facebookresearch/detr

End-to-End Object Detection with Transformers

Python 1 Apache License 2.0 Updated Oct 18, 2022
tensorflow Public

C++ 1 Apache License 2.0 Updated Aug 11, 2022
maskrcnn-benchmark Public
Forked from facebookresearch/maskrcnn-benchmark

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

Python MIT License Updated Mar 3, 2022
mlas Public

Assembly 3 3 MIT License Updated Feb 3, 2022
tvm Public
Forked from apache/tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python Apache License 2.0 Updated Jan 25, 2022
bench-bert Public

Jupyter Notebook 2 Updated Nov 11, 2021
TLCBench Public

Python Updated Jun 10, 2021
mlas-old Public

Assembly 1 Updated Jun 2, 2021

Yanming W. ymwangg

Achievements

Achievements

nki-autotune Public

Uh oh!

tokenizers Public

Uh oh!

vllm-test Public

Uh oh!

BetterChatGPT Public

Uh oh!

vllm Public

Uh oh!

djl-serving Public

Uh oh!

flash-attention Public

Uh oh!

lm-evaluation-harness Public

Uh oh!

PipeEdge Public

Uh oh!

ColossalAI-Documentation Public

Uh oh!

llm-awq Public

Uh oh!

llama.cpp Public

Uh oh!

pytorch Public

Uh oh!

text-generation-webui Public

Uh oh!

AITemplate Public

Uh oh!

ColossalAI Public

Uh oh!

alpa Public

Uh oh!

transformers Public

Uh oh!

xla Public

Uh oh!

accelerate Public

Uh oh!

amazon-textract-transformer-pipeline Public

Uh oh!

tensorflow-fork Public

Uh oh!

detr Public

Uh oh!

tensorflow Public

Uh oh!

maskrcnn-benchmark Public

Uh oh!

mlas Public

Uh oh!

tvm Public

Uh oh!

bench-bert Public

Uh oh!

TLCBench Public

Uh oh!

mlas-old Public

Uh oh!