NouamaneTazi

Nouamane Tazi NouamaneTazi

ML Research Engineer

428 followers · 197 following

Achievements

x4 x3 x3

Achievements

x4 x3 x3

Highlights

Developer Program Member

Organizations

Lists (11)

Sort

Starred repositories

mawaqit / home-assistant

Mawaqit integration - salat time and nearest mosque - in Home Assistant

Python 95 21 Updated Oct 18, 2024

Dao-AILab / sonic-moe

Accelerating MoE with IO and Tile-aware Optimizations

Python 544 43 Updated Jan 14, 2026

open-lm-engine / lm-engine

LM engine is a library for pretraining/finetuning LLMs

Python 109 24 Updated Jan 12, 2026

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,893 1,065 Updated Dec 29, 2025

IST-DASLab / marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 985 83 Updated Sep 4, 2024

allenai / asta-paper-finder

frozen-in-time version of our Paper Finder agent for reproducing evaluation results

Python 220 23 Updated Aug 20, 2025

huggingface / text-clustering

Easily embed, cluster and semantically label text datasets

Python 589 46 Updated Mar 28, 2024

huggingface / evaluation-guidebook

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 2,035 118 Updated Dec 3, 2025

ByteDance-Seed / Triton-distributed

Distributed Compiler based on Triton for Parallel Systems

Python 1,314 116 Updated Dec 27, 2025

gaogaotiantian / viztracer

A debugging and profiling tool that can trace and visualize python code execution

Python 7,509 471 Updated Jan 12, 2026

alibaba / ChatLearn

A flexible and efficient training framework for large-scale alignment tasks

Python 447 39 Updated Oct 23, 2025

meta-pytorch / torchcomms

torchcomms: a modern PyTorch communications API

C++ 321 64 Updated Jan 16, 2026

eth-cscs / UserLabDay

CSCS User Lab Day – Meet the Swiss National Supercomputing Centre

Jupyter Notebook 12 10 Updated Sep 12, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 40,362 5,203 Updated Jan 16, 2026

NVIDIA / nccl-tests

NCCL Tests

Cuda 1,403 342 Updated Jan 15, 2026

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 2,737 296 Updated Jan 15, 2026

perplexityai / libfabric-efa-demo

C++ 72 13 Updated Feb 10, 2025

esnet / iperf

iperf3: A TCP, UDP, and SCTP network bandwidth measurement tool

C 8,193 1,393 Updated Jan 16, 2026

NVIDIA / nvbandwidth

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 607 68 Updated Apr 15, 2025

aws-samples / awsome-distributed-training

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.

Shell 384 159 Updated Jan 16, 2026

deepseek-ai / profile-data

Analyze computation-communication overlap in V3/R1.

1,134 144 Updated Mar 21, 2025

Victarry / PP-Schedule-Visualization

Pipeline Parallelism Emulation and Visualization

Python 75 8 Updated Jan 8, 2026

ISEEKYAN / mbridge

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 183 45 Updated Jan 9, 2026

antgroup / DeepXTrace

DeepXTrace is a lightweight tool for precisely diagnosing slow ranks in DeepEP-based environments.

Python 87 7 Updated Jan 16, 2026

illuin-tech / colpali

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 2,450 226 Updated Jan 8, 2026

jlfwong / speedscope

🔬 A fast, interactive web-based viewer for performance profiles.

TypeScript 6,432 306 Updated Dec 11, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,416 3,046 Updated Jan 16, 2026

Dao-AILab / quack

A Quirky Assortment of CuTe Kernels

Python 749 73 Updated Jan 14, 2026

reframe-hpc / reframe

A powerful Python framework for writing and running portable regression tests and benchmarks for HPC systems.

Python 263 118 Updated Jan 14, 2026

Narsil / safetensors_distributed

Rust 2 1 Updated Jun 16, 2025

amazon-sagemaker-lab

PyTorch

Hacktoberfest

Windows

Vue.js

Ubuntu

TypeScript

Terminal

Tensorflow

SQL

See all starred topics

Nouamane Tazi NouamaneTazi

Highlights

Organizations

Lists (11)

💻 Competitive programming

👁️ Computer Vision

🧑‍⚕️Interpretability

Knowledge Graphs

🧠 Machine Learning

Misc

📖 NLP

PINN

🏭 Production

🗣️Speech

TinyML