Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View keroro824's full-sized avatar

Block or report keroro824

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,188 1,845 Updated Jan 9, 2026

MAGI-1: Autoregressive Video Generation at Scale

Python 3,628 230 Updated Jun 17, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,173 68 Updated Aug 7, 2025

LLM Inference on consumer devices

Python 128 15 Updated Mar 17, 2025

[ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generation

Python 246 17 Updated Dec 16, 2024

A large-scale information-rich web dataset, featuring millions of real clicked query-document labels

345 18 Updated Dec 16, 2024

scalable and robust tree-based speculative decoding algorithm

Python 366 37 Updated Jan 28, 2025

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,854 248 Updated Jan 17, 2026

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,418 288 Updated Jul 17, 2025

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Python 501 74 Updated Aug 1, 2024

SkyAGI: Emerging human-behavior simulation capability in LLM

TypeScript 787 55 Updated Sep 21, 2023

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,982 2,217 Updated Jul 29, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,271 4,024 Updated Jul 17, 2024
Jupyter Notebook 660 88 Updated Sep 17, 2025

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 69,272 7,391 Updated Jan 16, 2026

DSPy: The framework for programming—not prompting—language models

Python 31,617 2,569 Updated Jan 19, 2026

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,379 589 Updated Oct 28, 2024

Exploring finetuning public checkpoints on filter 8K sequences on Pile

Python 116 14 Updated Mar 22, 2023

A framework for few-shot evaluation of language models.

Python 11,224 2,970 Updated Jan 16, 2026

Vienna Graph Clustering

C++ 17 3 Updated Nov 12, 2025
C++ 107 28 Updated Oct 19, 2023

User-friendly secure computation engine based on secure multi-party computation

Rust 377 6 Updated Aug 4, 2023

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 14,220 3,060 Updated Jul 31, 2025

Reformer, the efficient Transformer, in Pytorch

Python 2,193 256 Updated Jun 21, 2023

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,100 6,648 Updated Sep 30, 2025

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python 1,607 191 Updated Aug 12, 2020

Pytorch implementation of the image transformer for unconditional image generation

Python 118 31 Updated Jul 25, 2024

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 16,914 3,709 Updated Jun 2, 2023

Implementations of several fast approximate algorithms for geometric optimal transport (OT)

C++ 117 8 Updated Apr 26, 2020
Next