Stars
SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Blockchain dark forest selfguard handbook. Master these, master the security of your cryptocurrency.
基于多智能体LLM的中文金融交易框架 - TradingAgents中文增强版
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
Enjoy the magic of Diffusion models!
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
Text-audio foundation model from Boson AI
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
本仓库包含对 Claude Code v1.0.33 进行逆向工程的完整研究和分析资料。包括对混淆源代码的深度技术分析、系统架构文档,以及重构 Claude Code agent 系统的实现蓝图。主要发现包括实时 Steering 机制、多 Agent 架构、智能上下文管理和工具执行管道。该项目为理解现代 AI agent 系统设计和实现提供技术参考。
【Accepted by ACM MM'25 】MS-DETR: Towards Effective Video Moment Retrieval and Highlight Detection by Joint Motion-Semantic Learning
The ultimate training toolkit for finetuning diffusion models
[NeurIPS' 2025] JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent
[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!
Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.
🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …
鸣潮 后台自动战斗 自动刷声骸 一键日常 Automation for Wuthering Waves
崩坏:星穹铁道 - 一条龙 Honkai Star Rail - One Dragon | 全日常自动 |
MAGI-1: Autoregressive Video Generation at Scale
Lets make video diffusion practical!
Thera: Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields
[NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from sim…
Production-ready platform for agentic workflow development.
📦BetterGI · 更好的原神 - 自动拾取 | 自动剧情 | 全自动钓鱼(AI) | 全自动七圣召唤 | 自动伐木 | 自动刷本 | 自动采集/挖矿/锄地 | 一条龙 | 全连音游 - UI Automation Testing Tools For Genshin Impact
Finetuning and inference tools for the CogView4 and CogVideoX model series.