-
pandalla.ai
- Hangzhou
-
09:42
(UTC -12:00) - https://scholar.google.com/citations?user=QRV7CjgAAAAJ
- @Jiaxi_Cui
Stars
The official implementation of paper “VChain: Chain-of-Visual-Thought for Reasoning in Video Generation”
Industry leading face manipulation platform
TrendPublish: 全自动 AI 内容生成与发布系统 | 微信公众号自动化 | 多源数据抓取 (Twitter/X、网站) | DeepseekAI、千问、讯飞模型 | 智能内容分析排序 | 定时发布 | 多模板支持 | Node.js | TypeScript | AI 技术趋势跟踪工具
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
AutoDL平台服务器适配梯子, 使用 Clash 作为代理工具
一键生成产品营销与泛内容短视频,AI批量自动剪辑,高颜值跨平台桌面端工具 One click generation of product marketing and general content short videos, AI batch automatic cliping, beautiful cross platform desktop tool
Forked vLLM that supports higgs-audio model
🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理!- A powered tool for easy and efficient video subtitling.
Text-audio foundation model from Boson AI
EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation
Open Agent Coding CLI, Koding with GLM, Qwen, Kimi, DeepSeek etc.(welcome to use Kode to summit PR)
This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for re…
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
JimmyMa99 / train-higgs-audio
Forked from boson-ai/higgs-audioText-audio foundation model from Boson AI
Turn any browser into your terminal & command your agents on the go.
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Control Claude Code remotely via email、discord、telegram. Start tasks locally, receive notifications when Claude completes them, and send new commands by simply replying to emails.
CRS-自建Claude Code镜像,一站式开源中转服务,让 Claude、OpenAI、Gemini、Droid 订阅统一接入,支持拼车共享,更高效分摊成本,原生工具无缝使用。
Multi-channel AI proxy with intelligent key rotation. 智能密钥轮询的多渠道 AI 代理。
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
The open-source CapCut alternative
OpenAI-compatible API server for Apple on-device models
Agentic RAG R1 Framework via Reinforcement Learning
Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process deployment. The video translation output is optimized for platfo…
Two conversational AI agents switching from English to sound-level protocol after confirming they are both AI agents