TransformeR22

TransformeR22

Stars

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,985 961 Updated Nov 11, 2025

Kingsoft-LLM / QZhou-Embedding

Large-scale text embedding model

Python 37 Updated Sep 6, 2025

camel-ai / owl

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 18,314 2,120 Updated Sep 24, 2025

microsoft / markitdown

Python tool for converting files and office documents to Markdown.

Python 82,859 4,691 Updated Oct 20, 2025

opendatalab / MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 48,534 4,011 Updated Nov 11, 2025

Alibaba-NLP / CHRONOS

Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"

Python 288 34 Updated Aug 4, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,343 2,477 Updated Nov 11, 2025

elastic / elasticsearch

Free and Open Source, Distributed, RESTful Search Engine

Java 75,413 25,605 Updated Nov 11, 2025

agno-agi / agno

Multi-agent framework, runtime and control plane. Built for speed, privacy, and scale.

Python 35,041 4,597 Updated Nov 11, 2025

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,537 1,270 Updated Nov 10, 2025

NirDiamant / GenAI_Agents

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…

Jupyter Notebook 17,631 2,888 Updated Oct 30, 2025

NirDiamant / RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 22,894 2,599 Updated Oct 30, 2025

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,501 288 Updated Nov 11, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,875 739 Updated Oct 15, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,714 985 Updated Nov 6, 2025

wangrongding / wechat-bot

🤖一个基于 WeChaty 结合 DeepSeek / ChatGPT / Kimi / 讯飞等Ai服务实现的微信机器人，可以用来帮助你自动回复微信消息，或者管理微信群/好友，检测僵尸粉等...

JavaScript 8,971 1,065 Updated Oct 24, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,857 899 Updated Sep 30, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,156 3,948 Updated Nov 10, 2025

deepseek-ai / DeepSeek-R1

91,467 11,782 Updated Jun 27, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,630 2,398 Updated Sep 8, 2025

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 2,142 178 Updated Sep 3, 2025

zzz47zzz / spurious-forgetting

[ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"

Jupyter Notebook 55 4 Updated May 9, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 78,433 11,592 Updated Nov 9, 2025

OpenBMB / MiniCPM

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,412 521 Updated Oct 8, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,249 619 Updated Nov 11, 2025

qiufengqijun / mini_qwen

这是一个从头训练大语言模型的项目，包括预训练、微调和直接偏好优化，模型拥有1B参数，支持中英文。

Python 670 91 Updated Feb 18, 2025

datalab-to / surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 18,863 1,288 Updated Oct 21, 2025

magpie-align / magpie

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!

Python 786 69 Updated Mar 17, 2025

deepseek-ai / DeepSeek-V3

Python 100,230 16,332 Updated Aug 28, 2025

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 8,726 760 Updated Nov 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly