Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View NouamaneTazi's full-sized avatar

Organizations

@huggingface @Hugging-Face-Supporter @embeddings-benchmark

Block or report NouamaneTazi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Mawaqit integration - salat time and nearest mosque - in Home Assistant

Python 95 21 Updated Oct 18, 2024

Accelerating MoE with IO and Tile-aware Optimizations

Python 544 43 Updated Jan 14, 2026

LM engine is a library for pretraining/finetuning LLMs

Python 109 24 Updated Jan 12, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 8,893 1,065 Updated Dec 29, 2025

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 985 83 Updated Sep 4, 2024

frozen-in-time version of our Paper Finder agent for reproducing evaluation results

Python 220 23 Updated Aug 20, 2025

Easily embed, cluster and semantically label text datasets

Python 589 46 Updated Mar 28, 2024

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 2,035 118 Updated Dec 3, 2025

Distributed Compiler based on Triton for Parallel Systems

Python 1,314 116 Updated Dec 27, 2025

A debugging and profiling tool that can trace and visualize python code execution

Python 7,509 471 Updated Jan 12, 2026

A flexible and efficient training framework for large-scale alignment tasks

Python 447 39 Updated Oct 23, 2025

torchcomms: a modern PyTorch communications API

C++ 321 64 Updated Jan 16, 2026

CSCS User Lab Day – Meet the Swiss National Supercomputing Centre

Jupyter Notebook 12 10 Updated Sep 12, 2025

The best ChatGPT that $100 can buy.

Python 40,362 5,203 Updated Jan 16, 2026

NCCL Tests

Cuda 1,403 342 Updated Jan 15, 2026

Post-training with Tinker

Python 2,737 296 Updated Jan 15, 2026

iperf3: A TCP, UDP, and SCTP network bandwidth measurement tool

C 8,193 1,393 Updated Jan 16, 2026

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 607 68 Updated Apr 15, 2025

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.

Shell 384 159 Updated Jan 16, 2026

Analyze computation-communication overlap in V3/R1.

1,134 144 Updated Mar 21, 2025

Pipeline Parallelism Emulation and Visualization

Python 75 8 Updated Jan 8, 2026

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 183 45 Updated Jan 9, 2026

DeepXTrace is a lightweight tool for precisely diagnosing slow ranks in DeepEP-based environments.

Python 87 7 Updated Jan 16, 2026

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 2,450 226 Updated Jan 8, 2026

🔬 A fast, interactive web-based viewer for performance profiles.

TypeScript 6,432 306 Updated Dec 11, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,416 3,046 Updated Jan 16, 2026

A Quirky Assortment of CuTe Kernels

Python 749 73 Updated Jan 14, 2026

A powerful Python framework for writing and running portable regression tests and benchmarks for HPC systems.

Python 263 118 Updated Jan 14, 2026
Next