bobo-fun is a project that collects interesting AI open-source projects that run on a Mac (All-In-One).
All code has been tested on a MacBook Pro (M4 Max / 128GB RAM).
- Support QwenLM/Qwen2.5-Omni
- Support HiDream-ai/HiDream-I1
- Support Wan-Video/Wan2.1
- Support HunyuanVideo-I2V
To set up our environment, please run:
conda create --name bobo python=3.11
conda activate bobopip install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cpupip install -r requirements.txtbrew install ffmpegJust run you want as belows
python run_flux.pypip install git+https://github.com/huggingface/diffusers.git
python run_hidream.pypython run_glm.pypython run_phi.pypython run_deepseek_r1.pypython run_omnigen.pypython run_story_diffusion.pypython run_flow_edit.pyPYTORCH_ENABLE_MPS_FALLBACK=1 python run_rmbg.pypython run_cogvideo.pyPYTORCH_ENABLE_MPS_FALLBACK=1 python run_cogvideofun.py- LTX-Video with STGuidance
python run_ltxvideo.py- mochi ---------> PS. black video bug
python run_mochi.pypython run_hyvideo.pyPYTORCH_ENABLE_MPS_FALLBACK=1 python run_wan_video.pyconda remove pytorch torchvision torchaudio
conda install pytorch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 -c pytorch
pip install -r requirements.txt
python run_mimic_motion.pypython run_realesrgan.pypython run_rife.pypython run_kokoro_82m.pypython run_suno_bark.pypython run_mmaudio.pypip install git+https://github.com/huggingface/[email protected]
huggingface-cli login (for downloading model)
python run_qwen_omni.py-
目前几乎所有文生视频项目, 当分辨率或生成帧数较大时, 都有 total bytes of NDArray > 2**32 或 Invalid buffer size 报错问题。这似乎是 mac 本身内部实现问题, 暂无太好方法。
-
Mochi 目前有纯黑视频 bug。
Mochi: sample_model() ---> model_fn() ---> out_cond 和 out_uncond 模型采样值为 tensor([[nan, nan, ..., nan, nan]...]), 导致纯黑问题。
-
HiDream 生成的图片呈现奇怪的“五彩斑斓”, 有待研究原因
-
1. THUDM/glm-4-9b-chat (文生文)
-
2. microsoft/Phi-3.5-mini-instruct (文生文)
-
3. deepseek-ai/DeepSeek-R1-Distill (文生文)
-
4. flux.1 (文生图)
-
5. HiDream (文生图)
-
6. VectorSpaceLab/OmniGen (文生图)
-
7. HVision-NKU/StoryDiffusion (文生图, 擅长故事)
-
8. fallenshock/FlowEdit (修图)
-
9. briaai/RMBG-2.0 (图片背景剔除)
-
10. THUDM/CogVideo (文生视频)
-
11. aigc-apps/CogVideoX-Fun (文生视频)
-
12. Lightricks/LTX-Video (文生视频)
-
13. genmoai/mochi (文生视频)
-
14. Tencent/HunyuanVideo (文生视频)
-
15. Wan-Video/Wan2.1 (文生视频)
-
16. Tencent/MimicMotion (动作/跳舞)
-
17. xinntao/Real-ESRGAN (视频超分)
-
18. hzwer/ECCV2022-RIFE (视频插帧)
-
19. thewh1teagle/kokoro-onnx (文生音频)
-
20. suno-ai/bark (文生音频)
-
21. hkchengrex/MMAudio (文生音频, 擅长音效/音乐)
-
22. QwenLM/Qwen2.5-Omni (多模态)