Thanks to visit codestin.com
Credit goes to github.com

makoton27

Follow

mktn makoton27

Follow

6 followers · 85 following

Starred repositories

asfdrwe / Anime-Llasa-3B-Captions-Demo

local version for OmniAICreator/Anime-Llasa-3B-Captions-Demo

Python 6 2 Updated Oct 28, 2025

drozbay / ComfyUI-WanVaceAdvanced

Python 59 5 Updated Oct 30, 2025

EzioBy / Ditto

[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Python 442 36 Updated Oct 29, 2025

brannondorsey / PassGAN

A Deep Learning Approach for Password Guessing (https://arxiv.org/abs/1709.00440)

Python 1,930 382 Updated Feb 24, 2023

laksjdjf / llasa-trainer

Python 3 Updated Oct 28, 2025

character-ai / Ovi

Python 1,149 106 Updated Oct 11, 2025

AIGeeksGroup / UniVid

UniVid: The Open-Source Unified Video Model

Python 24 Updated Oct 13, 2025

SOTAMak1r / Infinite-Forcing

Forked from guandeh17/Self-Forcing

Infinite-Forcing: Towards Infinite-Long Video Generation

Python 88 2 Updated Oct 22, 2025

ai-forever / Kandinsky-5

Kandinsky 5.0: A family of diffusion models for Video & Image generation

Python 175 11 Updated Nov 1, 2025

nv-tlabs / lyra

Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation

Python 568 31 Updated Oct 2, 2025

TencentARC / RollingForcing

Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Python 202 4 Updated Oct 31, 2025

WeChatCV / Wan-Alpha

High-Quality Text-to-Video Generation with Alpha Channel

Python 268 20 Updated Oct 1, 2025

jupo-ai / comfy-jupo-prompt-preset

JavaScript 2 1 Updated Sep 28, 2025

bytedance / lynx

Lynx: Towards High-Fidelity Personalized Video Generation

Python 279 34 Updated Sep 26, 2025

p1atdev / danbot-comfy-node

Python 24 3 Updated Mar 29, 2025

Zuntan03 / EasyLlasa

EasyLlasa は 5～15秒の日本語音声と日本語テキストから日本語音声を生成する TSTS (TextSpeechToSpeech) です。

Python 20 2 Updated Sep 29, 2025

eddyhhlure1Eddy / auto_wan2.2animate_freamtowindow_server

to server only

Python 55 1 Updated Sep 20, 2025

komikndr / raylight

Enable true multi gpu capability in Comfy UI using XDiT XFuser and FSDP

Python 177 18 Updated Oct 31, 2025

Phantom-video / OmniInsert

OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models

141 12 Updated Sep 24, 2025

kijai / ComfyUI-WanAnimatePreprocess

ComfyUI nodes for WanAnimate model input preprocessing

Python 295 22 Updated Oct 6, 2025

Hypfer / Valetudo

Cloud replacement for vacuum robots enabling local-only operation

JavaScript 8,034 435 Updated Nov 2, 2025

methmx83 / Ace-Step_Data-Tool

Ace-Step Dataset Generator

Python 8 2 Updated Sep 27, 2025

XiaomiMiMo / MiMo-Audio

MiMo-Audio: Audio Language Models are Few-Shot Learners

Python 830 84 Updated Sep 20, 2025

ic005k / Xplist

Cross-platform Plist Editor

C++ 485 46 Updated Feb 2, 2024

lodestone-rock / RamTorch

RAM is all you need

Python 210 21 Updated Nov 5, 2025

voicepowered-ai / VibeVoice-finetuning

Unofficial WIP LoRa Finetuning repository for VibeVoice

Python 246 62 Updated Sep 24, 2025

vibevoice-community / VibeVoice

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

Python 681 270 Updated Oct 27, 2025

HorizonWind2004 / reconstruction-alignment

Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 300 10 Updated Oct 16, 2025

FireRedTeam / FireRedTTS2

Long-form streaming TTS system for multi-speaker dialogue generation

Python 1,188 106 Updated Oct 26, 2025

Phantom-video / HuMo

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Python 797 92 Updated Oct 19, 2025

Starred topics

loupedeck

loupedeck-plugin