Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View makoton27's full-sized avatar

Block or report makoton27

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

local version for OmniAICreator/Anime-Llasa-3B-Captions-Demo

Python 6 2 Updated Oct 28, 2025

[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Python 442 36 Updated Oct 29, 2025

A Deep Learning Approach for Password Guessing (https://arxiv.org/abs/1709.00440)

Python 1,930 382 Updated Feb 24, 2023
Python 3 Updated Oct 28, 2025
Python 1,149 106 Updated Oct 11, 2025

UniVid: The Open-Source Unified Video Model

Python 24 Updated Oct 13, 2025

Infinite-Forcing: Towards Infinite-Long Video Generation

Python 88 2 Updated Oct 22, 2025

Kandinsky 5.0: A family of diffusion models for Video & Image generation

Python 175 11 Updated Nov 1, 2025

Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation

Python 568 31 Updated Oct 2, 2025

Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Python 202 4 Updated Oct 31, 2025

High-Quality Text-to-Video Generation with Alpha Channel

Python 268 20 Updated Oct 1, 2025
JavaScript 2 1 Updated Sep 28, 2025

Lynx: Towards High-Fidelity Personalized Video Generation

Python 279 34 Updated Sep 26, 2025
Python 24 3 Updated Mar 29, 2025

EasyLlasa は 5~15秒の日本語音声と日本語テキストから日本語音声を生成する TSTS (TextSpeechToSpeech) です。

Python 20 2 Updated Sep 29, 2025

Enable true multi gpu capability in Comfy UI using XDiT XFuser and FSDP

Python 177 18 Updated Oct 31, 2025

OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models

141 12 Updated Sep 24, 2025

ComfyUI nodes for WanAnimate model input preprocessing

Python 295 22 Updated Oct 6, 2025

Cloud replacement for vacuum robots enabling local-only operation

JavaScript 8,034 435 Updated Nov 2, 2025

Ace-Step Dataset Generator

Python 8 2 Updated Sep 27, 2025

MiMo-Audio: Audio Language Models are Few-Shot Learners

Python 830 84 Updated Sep 20, 2025

Cross-platform Plist Editor

C++ 485 46 Updated Feb 2, 2024

RAM is all you need

Python 210 21 Updated Nov 5, 2025

Unofficial WIP LoRa Finetuning repository for VibeVoice

Python 246 62 Updated Sep 24, 2025

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

Python 681 270 Updated Oct 27, 2025

Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 300 10 Updated Oct 16, 2025

Long-form streaming TTS system for multi-speaker dialogue generation

Python 1,188 106 Updated Oct 26, 2025

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Python 797 92 Updated Oct 19, 2025
Next