-
Freelance
- China
-
14:46
(UTC +08:00) - @bdsqlsz
- https://ko-fi.com/bdsqlsz
Lists (9)
Sort Name ascending (A-Z)
Starred repositories
Official implementation of "LoFA: Learning to Predict Personalized Prior for Fast Adaptation of Visual Generative Models".
STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…
Native and Compact Structured Latents for 3D Generation
Taming large-scale few-step training with self-adversarial flows! 👏🏻
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards.
The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"
[ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)
微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。
Open-Source AI Presentation Generator and API (Gamma, Beautiful AI, Decktopus Alternative)
FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generation.
Transform your 3D texturing workflow with the power of generative AI, directly within Blender!
The official implementation of SDPose: Exploiting Diffusion Priors for Out-of-Domain and Robust Pose Estimation
Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference
Instant Skinned Gaussian Avatars for Web, Mobile and VR Applications
💾 Self-hosted online file converter. Supports 1000+ formats ⚙️
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …
A family of highly efficient, lightweight yet powerful optimizers.
「来剪」轻量级视频编辑器。网页版、桌面版等均可免费使用,功能灵感源自 CapCut 等编辑器。A Lightweight Video Editor. Free for the web, desktop, and more, with features inspired by editors like CapCut.
The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement
Implementation of "S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models"
Source code for the X Recommendation Algorithm
Tiny AutoEncoder for Hunyuan Video (and other video models)
Scalable group inference for generating high quality and diverse images with diffusion models.