Stars
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
Krita is a free and open source cross-platform application that offers an end-to-end solution for creating digital art files from scratch built on the KDE and Qt frameworks.
FlashVSR:Towards Real-Time Diffusion-Based Streaming Video Super-Resolution,you can use it in comfyUI
Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional de…
lihaoyun6 / FlashVSR_plus
Forked from OpenImagingLab/FlashVSRTowards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional de…
A sparse attention kernel supporting mix sparse patterns
[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
Official Pytorch Implementation for "Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising"
[NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching
Wan: Open and Advanced Large-Scale Video Generative Models
MoCha: End-to-End Video Character Replacement without Structural Guidance
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
This project is to remove the watermark from the sora2 generated videos, with best quality.
HunyuanVideoFoley generates SFX audio to match your video and text prompt
In order to make it easier to use the ComfyUI, I have made some optimizations and integrations to some commonly used nodes.
Kronos: A Foundation Model for the Language of Financial Markets
FaceCat-Kronos是由 花卷猫量化研究团队 打造的一款金融量化工具。本项目基于清华大学最新开源的K线预测模型 Kronos,融合了前沿的人工智能技术,旨在为金融市场提供科学的分析与预测能力。 本工具能够对股票历史数据进行深度预训练,实现精准的做市商K线规划,并对未来市场走势进行科学推演,适用于量化研究、策略研发、交易决策支持、投研汇报、教学演示、二次开发。无论是基金、私募、荐股机构…
Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.
🦜🔗 The platform for reliable agents.
AI-powered reverse engineering assistant that bridges IDA Pro with language models through MCP.
The open-source CapCut alternative