AniDoc: Animation Creation Made Easier

anidoc_video.mp4

AniDoc: Animation Creation Made Easier

Yihao Meng^1,2, Hao Ouyang², Hanlin Wang^3,2, Qiuyu Wang², Wen Wang^4,2, Ka Leong Cheng^1,2 , Zhiheng Liu⁵, Yujun Shen², Huamin Qu^†,2
¹HKUST ²Ant Group ³NJU ⁴ZJU ⁵HKU ^†corresponding author

AniDoc colorizes a sequence of sketches based on a character design reference with high fidelity, even when the sketches significantly differ in pose and scale.

Strongly recommend seeing our demo page.

Showcases:

Flexible Usage:

Same Reference with Varying Sketches

Satoru Gojo from Jujutsu Kaisen

Same Sketch with Different References.

Anya Forger from Spy x Family

TODO List

Release the paper and demo page. Visit https://yihao-meng.github.io/AniDoc_demo/
Release the inference code.
Build Gradio Demo
Release the training code.
Release the sparse sketch setting interpolation code.

Requirements:

The training is conducted on 8 A100 GPUs (80GB VRAM), the inference is tested on RTX 5000 (32GB VRAM). In our test, the inference requires about 14GB VRAM.

Environment

All the tests are conducted in Linux. We suggest running our code in Linux. To set up our environment in Linux, please run:

sudo apt-get update && sudo apt-get install cbm git-lfs ffmpeg
conda create -n anidoc python=3.8 -y
conda activate anidoc
pip install ipykernel
python -m ipykernel install --user --name anidoc --display-name "anidoc"

git clone https://huggingface.co/spaces/svjack/AniDoc && cd AniDoc
pip install -r requirements.txt
python gradio_app_with_frames.py

OR

git clone https://github.com/svjack/AniDoc.git
cd AniDoc
bash install.sh

Checkpoints

please download the pre-trained stable video diffusion (SVD) checkpoints from here, and put the whole folder under pretrained_weight, it should look like ./pretrained_weights/stable-video-diffusion-img2vid-xt
please download the checkpoint for our Unet and ControlNet from here, and put the whole folder as ./pretrained_weights/anidoc.
please download the co_tracker checkpoint from here and put it as ./pretrained_weights/cotracker2.pth.

mkdir pretrained_weights

git clone https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt
cp -r stable-video-diffusion-img2vid-xt pretrained_weights

git clone https://huggingface.co/Yhmeng1106/anidoc
cp -r anidoc/anidoc pretrained_weights

#git clone https://huggingface.co/facebook/cotracker
#cp -r cotracker/cotracker2.pth pretrained_weights
wget https://huggingface.co/facebook/cotracker/resolve/main/cotracker2.pth?download=true -O cotracker2.pth
cp cotracker2.pth pretrained_weights

Video prepare

from moviepy.editor import VideoFileClip, ImageSequenceClip
import os
import shutil
import numpy as np

def extract_frames_and_save(input_video_path, output_video_path, target_frames=14):
    """
    将视频的总帧数调整为指定的帧数，并保存到本地。

    :param input_video_path: 输入视频文件的路径
    :param output_video_path: 输出视频文件的路径
    :param target_frames: 目标帧数（默认为 14）
    """
    try:
        # 创建临时路径
        temp_dir = "temp_frames"
        if not os.path.exists(temp_dir):
            os.makedirs(temp_dir)

        # 加载视频文件
        video_clip = VideoFileClip(input_video_path)

        # 提取所有帧并保存到临时路径
        total_frames = int(video_clip.fps * video_clip.duration)
        for i, frame in enumerate(video_clip.iter_frames()):
            frame_path = os.path.join(temp_dir, f"frame_{i:04d}.png")
            if i < total_frames:  # 只保存有效帧
                video_clip.save_frame(frame_path, t=i / video_clip.fps)

        # 选取 14 帧
        frame_files = sorted(os.listdir(temp_dir))
        selected_frames = np.linspace(0, len(frame_files) - 1, target_frames, dtype=int)
        selected_frame_files = [frame_files[i] for i in selected_frames]

        # 读取选取的帧
        frames = [os.path.join(temp_dir, frame) for frame in selected_frame_files]

        # 合并为新视频
        new_clip = ImageSequenceClip(frames, fps=video_clip.fps)
        new_clip.write_videofile(output_video_path, codec="libx264")

        print(f"视频已成功保存到: {output_video_path}")
    except Exception as e:
        print(f"处理视频时出错: {e}")
    finally:
        # 关闭视频对象
        if 'video_clip' in locals():
            video_clip.close()
        if 'new_clip' in locals():
            new_clip.close()

        # 删除临时路径
        if os.path.exists(temp_dir):
            shutil.rmtree(temp_dir)

# 示例调用
input_video = "刻晴摇_short.mp4"  # 替换为你的输入视频文件路径
output_video = "刻晴摇_short_14fps.mp4"  # 替换为你的输出视频文件路径
extract_frames_and_save(input_video, output_video, target_frames=14)

Generate Your Animation!

To colorize the target lineart sequence with a specific character design, you can run the following command:

python scripts_infer/anidoc_inference.py --all_sketch --matching --tracking --control_image '刻晴摇_short_14fps.mp4' --ref_image '刻晴白背景.png' --output_dir 'results' --max_point 10

OR

bash  scripts_infer/anidoc_inference.sh

We provide some test cases in data_test folder. You can also try our model with your own data. You can change the lineart sequence and corresponding character design in the script anidoc_inference.sh, where --control_image refers to the lineart sequence and --ref_image refers to the character design.

Citation:

Don't forget to cite this source if it proves useful in your research!

@article{meng2024anidoc,
      title={AniDoc: Animation Creation Made Easier},
      author={Yihao Meng and Hao Ouyang and Hanlin Wang and Qiuyu Wang and Wen Wang and Ka Leong Cheng and Zhiheng Liu and Yujun Shen and Huamin Qu},
      journal={arXiv preprint arXiv:2412.14173},
      year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
LightGlue		LightGlue
cotracker		cotracker
data_test		data_test
datasets		datasets
figure		figure
lineart_extractor/annotator		lineart_extractor/annotator
models_diffusers		models_diffusers
pipelines		pipelines
scripts_infer		scripts_infer
.gitignore		.gitignore
README.md		README.md
install.sh		install.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AniDoc: Animation Creation Made Easier

Showcases:

Flexible Usage:

Same Reference with Varying Sketches

Same Sketch with Different References.

TODO List

Requirements:

Environment

Checkpoints

Video prepare

Generate Your Animation!

Citation:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AniDoc: Animation Creation Made Easier

Showcases:

Flexible Usage:

Same Reference with Varying Sketches

Same Sketch with Different References.

TODO List

Requirements:

Environment

Checkpoints

Video prepare

Generate Your Animation!

Citation:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages