Lists (1)
Sort Name ascending (A-Z)
Stars
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
Dora is an experiment management framework. It expresses grid searches as pure python files as part of your repo. It identifies experiments with a unique hash signature. Scale up to hundreds of exp…
nateraw / audiocraft
Forked from facebookresearch/audiocraftAudiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
This is a cog implementation of the fine-tuner for Meta's MusicGen
Generative models for conditional audio generation
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…
Yi-1.5 is an upgraded version of Yi, delivering stronger performance in coding, math, reasoning, and instruction-following capability.
A series of large language models trained from scratch by developers @01-ai
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
[OpenPAR] An open-source framework for Pedestrian Attribute Recognition, based on PyTorch
OpenMMLab Detection Toolbox and Benchmark
The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
convert dataset to coco/voc format
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Use lmdb to speed up imagenet dataset
This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in the Wild" and the ECCV 2022 paper titled "Improving Closed and…