We provide official PyTorch implementation for:
WAIT: Feature Warping for Animation to Illustration video Translation using GANs
- (March, 2025) Our paper is accepted to Neurocomputing!
Teaser:
Dataset Stats:
Sample Images:
WAIT:
Here we compare the WAIT results with baseline methods. From left the right;
Input, CycleGAN, OpticalFlowWarping, ReCycleGAN, ReCycleGANv2, WAIT
WAIT results on AS Style:
WAIT results on BP Style:
- Linux, macOS or Windows
- Python 3.6+
- NVIDIA GPU
- CUDA 11.1
- CuDNN 8.0.5
Please refer to datasets.md for details.
- Clone this repo:
git clone https://github.com/giddyyupp/wait.git
cd wait- Install PyTorch 1.5+ and torchvision from http://pytorch.org and other dependencies (e.g., visdom and dominate). You can install all the dependencies by
pip install -r requirements.txt- Build Deformable Conv layers:
cd models/deform_conv
python setup.py install develop
- 
Download a GANILLA illustrator dataset and corresponding animation movies (e.g. BP). For illustration dataset please follow the steps explained in Ganilla repository. For animations, we use Peter Rabbit movie to curate BP dataset, and videos from ZOG Youtube channel for AS dataset. 
- 
Train a model: 
python train.py --dataroot ./datasets/bp_dataset --name bp_wait --model cycle_gan_warp --netG resnet_9blocks \ 
--centerCropSize 256 --resize_or_crop resize_and_centercrop --batch_size 8 --lr 0.0008 --niter_decay 200 --verbose \ 
--norm_warp "batch" --use_warp_speed_ups --rec_bug_fix --final_conv --merge_method "concat" --time_gap 2 \ 
--offset_network_block_cnt 8 --warp_layer_cnt 5
- 
To view training results and loss plots, run python -m visdom.serverand click the URL http://localhost:8097. To see more intermediate results, check out./checkpoints/bp_wait/web/index.html
- 
Test the model: With assigning correct variables to dataset, EXP_ID, and backbone; 
#!./scripts/test_warp_models.sh ./datasets/"$dataset" $EXP_ID $backbone $dataset --norm_warp "batch" --rec_bug_fix --use_warp_speed_ups --final_conv --merge_method "concat"or
python test.py --dataroot ./datasets/bp_dataset --name bp_wait --model cycle_gan_warp --netG resnet_9blocks \ 
--centerCropSize 256 --resize_or_crop center_crop --no_flip --phase test --epoch 200 --time_gap 0 --norm_warp "batch" \
--rec_bug_fix --final_conv --merge_method "concat" --offset_network_block_cnt 8 --warp_layer_cnt 5
The test results will be saved to a html file here: ./results/bp_wait/latest_test/index.html.
- To calculate FID & MSE you can directly use our scripts in scripts/metricsdirectory.
cd scripts/metrics
./calculate_FID_batch.sh path_to_source path_to_result
We put 2 helper scripts in the metrics/FWE folder, just copy paste them to the main directory of the above repo.
More specifically, lets assume you would like to calculate a model trained on AS dataset, and you run the inference and want to calculate the FWE.
First, rename your generated images so that they start from 000000.png and end at 001234.png using:
python rename_video_frames.py
here you need to fix data_dir and target_dir parameters based on your needs.
Second, if your testA images are not in the same order as mentioned above, again fix the input parameters and run python rename_video_frames.py again on the test images.
Now we should put the real and generated images to correct folders to proceed with flow occlusion and FWE calculations.
For generated images, create a folder in the path ./data/test/wait/fwe/AS/testA and put all the renamed generated images there.
For real images, create a folder in the path ./data/test/input/AS/testA and put all the renamed real images there.
Now you have real and generated images at the same order and renamed properly and put in the correct folders. Lets calculate flow occlusions. The script to use is compute_flow_occlusion.py.
If you inspect it, there is one required parameter which is -dataset. For our case it is AS. Create a .txt file in the path ./lists/AS_test.txt and just put testA in the first line and save. Run below to generate flow occlusions.
python compute_flow_occlusion.py -dataset "AS" 
Finally, run below command to compute the FWE score.
python evaluate_WarpError.py -method "wait" -task fwe -dataset AS
- We also provide a single script to calculate all metrics in single run.
cd scripts/
./calculate_metrics_all.sh path_to_wait_repo exp_name dataset_name path_to_fwe_repo
You can find more scripts at scripts directory.
- You can download our pretrained models (AS, BP in the paper, and IRVI traffic from I2V-GAN using following GDrive.
Put the pretrained models under ./checkpoints/.
- Then generate the results for BP dataset using:
python test.py --dataroot ./datasets/bp_dataset --name bp_wait --model cycle_gan_warp --netG resnet_9blocks \ 
--centerCropSize 256 --resize_or_crop center_crop --no_flip --phase test --epoch 200 --time_gap 0 --norm_warp "batch" \
--rec_bug_fix --final_conv --merge_method "concat" --offset_network_block_cnt 8 --warp_layer_cnt 5The results will be saved at ./results/. Use --results_dir {directory_path_to_save_result} to specify the results directory.
We also experimented with a dataset from another domain, infrared Traffic dataset presented in the I2V-GAN. Below table depicts a comparison between WAIT and the method presented in I2V-GAN. These results show the effectiveness of our method on another domain, Infrared to RGB video translation.
| Method | FID | KID | FWE | MSE | 
|---|---|---|---|---|
| I2V-GAN | 43.59 | 0.0212 | 0.002057 | 8144.97 | 
| WAIT | 43.68 | 0.0227 | 0.002438 | 8336.48 | 
If you use this code for your research, please cite our papers.
@article{HICSONMEZ_WAIT,
      title = {WAIT: Feature warping for animation to illustration video translation using GANs},
      journal = {Neurocomputing},
      volume = {637},
      year = {2025},
      issn = {0925-2312},
      author = {Samet Hicsonmez and Nermin Samet and Fidan Samet and Oguz Bakir and Emre Akbas and Pinar Duygulu}
}
Our code is heavily inspired by GANILLA.
The numerical calculations reported in this work were fully performed at TUBITAK ULAKBIM, High Performance and Grid Computing Center (TRUBA resources).