Stars
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
Reference PyTorch implementation and models for DINOv3
This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
LLM Finetuning with peft
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR team.
1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.
AI-Powered Python & Python-Powered AI (Python-Use)
A Framework of Small-scale Large Multimodal Models
Natural Language Processing Best Practices & Examples
all kinds of text classification models and more with deep learning
Open-source search and retrieval database for AI applications.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
Curated list of datasets and tools for post-training.
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
🌈Python3网络爬虫实战:淘宝、京东、网易云、B站、12306、抖音、笔趣阁、漫画小说下载、音乐电影下载等
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。