Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View adam-smnk's full-sized avatar

Block or report adam-smnk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository contains companion software for the Colfax Research paper "Categorical Foundations for CuTe Layouts".

Python 71 2 Updated Sep 24, 2025

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 517 52 Updated Oct 27, 2025

Meta project around MLIR

Python 17 4 Updated Oct 27, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

8,394 550 Updated Sep 11, 2025

Templight is a Clang-based tool to profile the time and memory consumption of template instantiations and to perform interactive debugging sessions to gain introspection into the template instantia…

C++ 784 41 Updated Dec 7, 2024

🇪🇺 💶 Generate e-invoices (E-Rechnung in German) conforming to EN16931 (Factur-X/ZUGFeRD, UBL, CII, XRechnung aka X-Rechnung) from LibreOffice Calc/Excel data or JSON.

TypeScript 106 14 Updated Oct 26, 2025

Next-generation JavaScript analysis tooling

C++ 99 7 Updated Oct 25, 2025

Distributed Compiler based on Triton for Parallel Systems

Python 1,200 98 Updated Oct 17, 2025

Custom Bindings for Enzyme Automatic Differentiation Tool and Interfacing with JAX.

MLIR 100 20 Updated Oct 27, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,370 660 Updated Oct 27, 2025
Python 143 12 Updated Dec 27, 2024

Intel® Tensor Processing Primitives extension for Pytorch*

C++ 17 10 Updated Oct 5, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,740 277 Updated Oct 27, 2025

This is a repository listing companies which offer full-time remote jobs with Spanish contracts

2,674 211 Updated Oct 20, 2025

A feature-rich command-line audio/video downloader

Python 132,876 10,666 Updated Oct 27, 2025

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems

Python 631 77 Updated Oct 24, 2025

Efficient Triton Kernels for LLM Training

Python 5,773 419 Updated Oct 27, 2025

A modern model graph visualizer and debugger

JavaScript 1,324 131 Updated Oct 27, 2025

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,649 186 Updated Jun 25, 2024

MLIR-based partitioning system

MLIR 142 24 Updated Oct 27, 2025

Type in Morse code by repeatedly slamming your laptop shut

Shell 2,410 24 Updated Apr 28, 2020

A set of short tests designated to check Intel GPU SW environment and ability to execute user-generated code.

C++ 3 Updated Jan 10, 2024

The Linux Kernel Module Programming Guide (updated for 5.0+ kernels)

TeX 8,194 590 Updated Sep 28, 2025

Best practice for training LLaMA models in Megatron-LM

Python 659 56 Updated Jan 2, 2024

Ongoing research training transformer models at scale

Python 13,968 3,190 Updated Oct 27, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 48,180 8,077 Updated Dec 9, 2024

[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Python 917 46 Updated Apr 30, 2025

An experimental CPU backend for Triton

MLIR 154 31 Updated Oct 20, 2025

GIM: Learning Generalizable Image Matcher From Internet Videos (ICLR 2024 Spotlight)

Python 811 55 Updated Aug 3, 2025

PArallelLOOPgEneratoR: Threaded Loops Code Generation Infrastructure targeting Tensor Contraction Applications such as GEMMs, Convolutions and Fused Deep Learning Primitives

C++ 19 5 Updated May 29, 2025
Next