sparkyigniter95

sparkyigniter95

37 followers · 1.4k following

Starred repositories

google-deepmind / learning-to-learn

Learning to Learn in TensorFlow

Python 4,064 606 Updated Jun 29, 2021

lima-vm / lima

Linux virtual machines, with a focus on running containers

Go 18,262 712 Updated Oct 24, 2025

stepfun-ai / Step-Audio2

Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.

Python 1,178 83 Updated Sep 22, 2025

Kwai-Klear / KlearReasoner

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

Python 77 9 Updated Sep 28, 2025

NousResearch / atropos

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 731 167 Updated Oct 25, 2025

stepfun-ai / StepFun-Prover-Preview

Large language models designed for formal theorem proving through tool-integrated reasoning.

29 Updated Aug 13, 2025

ars22 / scaling-LLM-math-synthetic-data

Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"

31 Updated Jun 16, 2024

meta-pytorch / torchforge

PyTorch-native post-training at scale

Python 363 34 Updated Oct 25, 2025

vibevoice-community / VibeVoice

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

Python 635 256 Updated Oct 22, 2025

rsxdalv / TTS-WebUI

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, …

TypeScript 2,676 282 Updated Oct 23, 2025

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,032 368 Updated Oct 21, 2025

voicepowered-ai / VibeVoice-finetuning

Unofficial WIP LoRa Finetuning repository for VibeVoice

Python 233 59 Updated Sep 24, 2025

blackfeather-wang / AdaFocus

Reducing spatial redundancy in video recognition. SOTA computational efficiency.

Python 126 17 Updated Dec 15, 2024

jianghaojun / Awesome-Parameter-Efficient-Transfer-Learning

A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.

410 26 Updated Sep 26, 2024

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,524 208 Updated Jun 17, 2025

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,933 1,876 Updated Jul 15, 2025

ymcui / Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,173 569 Updated Jul 15, 2025

yule-BUAA / MergeLM

Codebase for Merging Language Models (ICML 2024)

Python 853 51 Updated May 5, 2024

openai / prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,060 122 Updated Jun 1, 2023

togethercomputer / RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,834 366 Updated Dec 7, 2024

lamini-ai / lamini

The Official Python Client for Lamini's API

Python 2,543 154 Updated Apr 7, 2025

Gengzigang / TokenSet

Official PyTorch implementation of TokenSet.

Python 126 1 Updated Mar 21, 2025

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,081 966 Updated Jul 1, 2024

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,851 529 Updated Oct 25, 2025