Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View tmm1's full-sized avatar

Sponsoring

@mattn
@jart
@mschoch
@joshdholtz
@BurntSushi
@matthuisman
@ziglang
@formkit

Highlights

  • Pro

Organizations

@rubinius @postrank-labs @graphite-project @fancybits

Block or report tmm1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

ML

25 repositories

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,929 383 Updated Mar 14, 2024

Curated list of useful LLM / Analytics / Datascience resources

2,567 220 Updated Apr 25, 2025

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 76,980 8,303 Updated May 27, 2025

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,554 122 Updated Mar 23, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,325 4,780 Updated Jun 2, 2025

LLM inference in C/C++

C++ 91,850 14,197 Updated Dec 23, 2025
Python 1,498 113 Updated May 12, 2023

Awesome AI Coding

745 64 Updated Dec 19, 2025

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,986 2,216 Updated Jul 29, 2024

Locally run an Instruction-Tuned Chat-Style LLM

C 10,192 874 Updated Apr 19, 2023

A school for camelids

Python 1,206 74 Updated May 1, 2023

Finetuning large language models for GDScript generation.

Python 558 26 Updated May 26, 2023

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,637 1,300 Updated Dec 17, 2025

Binding to transformers in ggml

C++ 64 11 Updated Dec 11, 2025

A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.

Python 56 8 Updated Apr 3, 2025

Official repository for LongChat and LongEval

Python 533 31 Updated May 24, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 5,012 529 Updated Apr 11, 2025

JAX implementation of the Llama 2 model

Python 216 24 Updated Feb 2, 2024

Landmark Attention: Random-Access Infinite Context Length for Transformers

Python 427 36 Updated Dec 20, 2023

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

C++ 14,101 1,216 Updated Oct 29, 2025

Burn is a next generation tensor library and Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.

Rust 13,713 756 Updated Dec 22, 2025

A Rust implementation of OpenAI's Whisper model using the burn framework

Rust 336 48 Updated May 6, 2024

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,760 622 Updated Feb 21, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 30,506 2,076 Updated Nov 19, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,335 1,450 Updated Nov 28, 2025