Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View yufenglee's full-sized avatar

Organizations

@microsoft

Block or report yufenglee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Microsoft Linear Algebra Subroutines

C++ 10 4 Updated Oct 18, 2025

Generative AI extensions for onnxruntime

C++ 863 221 Updated Oct 24, 2025

Source code examples from the Parallel Forall Blog

HTML 1,305 641 Updated Sep 23, 2025

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 18,188 3,514 Updated Oct 25, 2025

A JIT assembler for x86/x64 architectures supporting FPU, MMX, SSE (1-4), AVX (1-2, 512), APX, and AVX10.2

C++ 2,194 293 Updated Sep 2, 2025

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2,514 282 Updated Oct 24, 2025

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

C++ 1,657 606 Updated Oct 24, 2025

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,414 777 Updated Oct 25, 2025

Cross-platform, customizable ML solutions for live and streaming media.

C++ 31,723 5,578 Updated Oct 24, 2025

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 22,192 4,339 Updated Oct 23, 2025

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 13,328 2,088 Updated Oct 17, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 33,794 3,214 Updated Oct 25, 2025

Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators

C 1,546 219 Updated Aug 28, 2019

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,460 672 Updated Oct 25, 2025

Low-precision matrix multiplication

C++ 1,817 456 Updated Jan 29, 2024

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 14,534 3,369 Updated Aug 12, 2024

[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

Cuda 1,798 463 Updated Oct 9, 2023
Cuda 22 13 Updated Jul 31, 2017

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 94,243 25,666 Updated Oct 25, 2025

Optimized primitives for collective multi-GPU communication

C++ 4,178 1,047 Updated Oct 18, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,270 2,258 Updated Sep 24, 2025

nGraph has moved to OpenVINO

C++ 1,343 217 Updated Oct 15, 2020

[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl

C++ 4,985 765 Updated Feb 8, 2024

A natural language modeling framework based on PyTorch

Python 6,318 795 Updated Oct 17, 2022

oneAPI Deep Neural Network Library (oneDNN)

C++ 3,903 1,078 Updated Oct 25, 2025

TensorFlow code and pre-trained models for BERT

Python 39,598 9,701 Updated Jul 23, 2024

Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX

Jupyter Notebook 2,484 458 Updated Sep 12, 2025

An Open Source Machine Learning Framework for Everyone

C++ 192,196 74,916 Updated Oct 25, 2025

Code samples for my book "Neural Networks and Deep Learning"

Python 17,152 6,923 Updated Jun 2, 2024

Open standard for machine learning interoperability

Python 19,770 3,815 Updated Oct 25, 2025