Stars
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
Scripting tool for downloading Dify plugin package from Dify Marketplace and Github and repackaging [true] offline package.
Next-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle management capabilities from development, debugging, and evaluation…
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
Retrieval and Retrieval-augmented LLMs
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Community maintained hardware plugin for vLLM on Ascend
NVIDIA DeepStream SDK 8.0 / 7.1 / 7.0 / 6.4 / 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 / 5.1 implementation for YOLO models
Sample apps to demonstrate how to deploy models trained with TAO on DeepStream
DeepStream SDK Python bindings and sample applications
DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Python framework that facilitates the quick development of complex video analysis applications and other series-processing based applications in a multiprocessing environment.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Production-ready platform for agentic workflow development.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
⏩ Ship faster with Continuous AI. Build and run custom agents across your IDE, terminal, and CI
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
VS Code extension for CodeGeeX
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
A modular graph-based Retrieval-Augmented Generation (RAG) system
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)