Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View retrogyro's full-sized avatar

Block or report retrogyro

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1,261 111 Updated Nov 11, 2025

Transformer related optimization, including BERT, GPT

C++ 6,346 921 Updated Mar 27, 2024

"Everyday life is like programming, I guess. If you love something you can put beauty into it." ― Donald E. Knuth

2,119 549 Updated Jun 21, 2024

Machine Learning Engineering Open Book

Python 15,708 961 Updated Oct 27, 2025

πŸ”₯ Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

2,598 178 Updated Aug 5, 2025

The Book of Statistical Proofs

HTML 385 78 Updated Nov 7, 2025
Python 1,198 114 Updated Oct 9, 2025

😎 Awesome list of Infrastructure-from-Code

28 3 Updated May 28, 2024

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 5,677 487 Updated Nov 7, 2025

Vector (and Scalar) Quantization, in Pytorch

Python 3,679 300 Updated Nov 12, 2025

H-Net: Hierarchical Network with Dynamic Chunking

Python 778 92 Updated Sep 30, 2025

Foundation Architecture for (M)LLMs

Python 3,118 221 Updated Apr 11, 2024

An implementation of local windowed attention for language modeling

Python 485 51 Updated Jul 16, 2025

πŸš€ Efficient implementations of state-of-the-art linear attention models

Python 3,824 299 Updated Nov 12, 2025

Large Context Attention

Python 749 52 Updated Oct 13, 2025

Torch implementation of ResNet from http://arxiv.org/abs/1512.03385 and training scripts

Lua 2,348 666 Updated Aug 24, 2022

Useful resources on data quality for machine learning and artificial intelligence.

21 Updated Apr 14, 2025
Python 136 11 Updated Jul 27, 2025

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

Python 13,249 1,758 Updated Nov 3, 2025
Python 619 56 Updated Oct 24, 2025

A collection of research papers on low-precision training methods

43 2 Updated May 10, 2025

Quantized Neural Networks - networks trained for inference at arbitrary low precision.

Python 147 43 Updated Nov 28, 2017

This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.

C++ 192 80 Updated Nov 12, 2025

Ring attention implementation with flash attention

Python 909 88 Updated Sep 10, 2025
Python 4,166 448 Updated Jul 31, 2025

official code for "Large Language Models as Optimizers"

Python 653 85 Updated Dec 4, 2024

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.

Python 3,076 255 Updated Jul 25, 2025

Triton-based Symmetric Memory operators and examples

Python 62 10 Updated Oct 17, 2025

Write a fast kernel and run it on Discord. See how you compare against the best!

Python 61 19 Updated Nov 11, 2025

A curated list of awesome resources combining Transformers with Neural Architecture Search

268 30 Updated Jun 15, 2023
Next