Codestin Search App

Environment

Ubuntu 18.04.1
Python 3.7
不要4090！！！！用2080ti

我用的：

conda create -n faceformer python=3.7
conda activate faceformer
pip install torch==1.11.0+cu113 torchvision==0.12.0+cu113 torchaudio==0.11.0 --extra-index-url https://download.pytorch.org/whl/cu113
不要用pyopengl3.1.0，会报fail render。用3.1.4，虽然会说pyrender要求3.1.0，但没关系。
pip install -r requirements.txt
pip install pyopengl==3.1.4
conda install ffmpeg
sudo apt update
sudo apt-get install libboost-dev
git clone https://github.com/MPI-IS/mesh.git
cd mesh
清空mesh的requirements
python -m pip install pip==22.2.1
pip install pyyaml zmq
BOOST_INCLUDE_DIRS=/path/to/boost/include make all
make tests
如果test时输出为OK (skipped=5)，应该就行了
换了conda环境后要重新libboost-dev

Data(只运行demo时，除了data verts都需要下载)

VOCASET

Place data_verts.npy, raw_audio_fixed.pkl, templates.pkl and subj_seq_to_idx.pkl in the folder VOCASET.

Download "FLAME_sample.ply" from voca and put it in VOCASET/templates.

BIWI

Request the BIWI dataset from Biwi 3D Audiovisual Corpus of Affective Communication. The dataset contains the following subfolders:

'faces' contains the binary (.vl) files for the tracked facial geometries.
'rigid_scans' contains the templates stored as .obj files.
'audio' contains audio signals stored as .wav files.

Place the folders 'faces' and 'rigid_scans' in BIWI and place the wav files in BIWI/wav.

Demo

Download the pretrained models from biwi.pth and vocaset.pth. Put the pretrained models under BIWI and VOCASET folders, respectively. Given the audio signal,

to animate a mesh in BIWI topology, run:

 python demo.py --model_name biwi --wav_path "demo/wav/test.wav" --dataset BIWI --vertice_dim 70110  --feature_dim 128 --period 25 --fps 25 --train_subjects "F2 F3 F4 M3 M4 M5" --test_subjects "F1 F5 F6 F7 F8 M1 M2 M6" --condition M3 --subject M1

to animate a mesh in FLAME topology, run:

 python demo.py --model_name vocaset --wav_path "demo/wav/test.wav" --dataset vocaset --vertice_dim 15069 --feature_dim 64 --period 30  --fps 30  --train_subjects "FaceTalk_170728_03272_TA FaceTalk_170904_00128_TA FaceTalk_170725_00137_TA FaceTalk_170915_00223_TA FaceTalk_170811_03274_TA FaceTalk_170913_03279_TA FaceTalk_170904_03276_TA FaceTalk_170912_03278_TA" --test_subjects "FaceTalk_170809_00138_TA FaceTalk_170731_00024_TA" --condition FaceTalk_170913_03279_TA --subject FaceTalk_170809_00138_TA

This script will automatically generate the rendered videos in the demo/output folder. You can also put your own test audio file (.wav format) under the demo/wav folder and specify the argument --wav_path "demo/wav/test.wav" accordingly.

如果出现了osmesa报错，则apt-get install -y python-opengl libosmesa6

可能需要开启一下代理source /etc/network_turbo，否则会出现connection error报错。

报 Failed to open PLY file.是因为需要把FLAME_sample.ply放在vacaset/templates里

Training and Testing on VOCASET

Data Preparation

Read the vertices/audio data and convert them to .npy/.wav files stored in vocaset/vertices_npy and vocaset/wav:
```
 cd vocaset
 python process_voca_data.py
```

Training and Testing

To train the model on VOCASET and obtain the results on the testing set, run:

 python main.py --dataset vocaset --vertice_dim 15069 --feature_dim 64 --period 30 --train_subjects "FaceTalk_170728_03272_TA FaceTalk_170904_00128_TA FaceTalk_170725_00137_TA FaceTalk_170915_00223_TA FaceTalk_170811_03274_TA FaceTalk_170913_03279_TA FaceTalk_170904_03276_TA FaceTalk_170912_03278_TA" --val_subjects "FaceTalk_170811_03275_TA FaceTalk_170908_03277_TA" --test_subjects "FaceTalk_170809_00138_TA FaceTalk_170731_00024_TA"

The results and the trained models will be saved to vocaset/result and vocaset/save.

Visualization

To visualize the results, run:
```
 python render.py --dataset vocaset --vertice_dim 15069 --fps 30
```
You can find the outputs in the vocaset/output folder.

Training and Testing on BIWI

Data Preparation

(to do) Read the geometry data and convert them to .npy files stored in BIWI/vertices_npy.

Training and Testing

To train the model on BIWI and obtain the results on testing set, run:

 python main.py --dataset BIWI --vertice_dim 70110 --feature_dim 128 --period 25 --train_subjects "F2 F3 F4 M3 M4 M5" --val_subjects "F2 F3 F4 M3 M4 M5" --test_subjects "F1 F5 F6 F7 F8 M1 M2 M6"

The results will be available in the BIWI/result folder. The trained models will be saved in the BIWI/save folder.

Visualization

To visualize the results, run:
```
 python render.py --dataset BIWI --vertice_dim 70110 --fps 25
```
The rendered videos will be available in the BIWI/output folder.

Using Your Own Dataset

Data Preparation

Create the dataset directory <dataset_dir> in FaceFormer directory.
Place your vertices data (.npy format) and audio data (.wav format) in <dataset_dir>/vertices_npy and <dataset_dir>/wav folders, respectively.
Save the templates of all subjects to a templates.pkl file and put it in <dataset_dir>, as done for BIWI and vocaset. Export an arbitary template to .ply format and put it in <dataset_dir>/templates/.

Training and Testing

Create the train, val and test splits by specifying the arguments --train_subjects, --val_subjects and --test_subjects in main.py.
Train a FaceFormer model on your own dataset by specifying the arguments --dataset and --vertice_dim (number of vertices in your mesh * 3) in main.py. You might need to adjust --feature_dim and --period to your dataset. Run main.py.
The results and models will be saved to <dataset_dir>/result and <dataset_dir>/save.

Visualization

Specify the arguments --dataset, --vertice_dim and --fps in render.py. Run render.py to visualize the results. The rendered videos will be saved to <dataset_dir>/output.

记录

需要保存中间的每一帧obj结果的话使用demo_mine就行。建议最好使用demo_mine，因为原demo生成的视频没声音。更改也很好更改，在issue里面提到了。

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
BIWI		BIWI
demo		demo
vocaset		vocaset
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_loader.py		data_loader.py
demo.py		demo.py
demo_mine.py		demo_mine.py
faceformer.py		faceformer.py
main.py		main.py
render.py		render.py
requirements.txt		requirements.txt
wav2vec.py		wav2vec.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Environment

Data(只运行demo时，除了data verts都需要下载)

VOCASET

BIWI

Demo

Training and Testing on VOCASET

Data Preparation

Training and Testing

Visualization

Training and Testing on BIWI

Data Preparation

Training and Testing

Visualization

Using Your Own Dataset

Data Preparation

Training and Testing

Visualization

记录

About

Uh oh!

Releases

Packages

Languages

License

aurelianocyp/FaceFormer

Folders and files

Latest commit

History

Repository files navigation

Environment

Data(只运行demo时，除了data verts都需要下载)

VOCASET

BIWI

Demo

Training and Testing on VOCASET

Data Preparation

Training and Testing

Visualization

Training and Testing on BIWI

Data Preparation

Training and Testing

Visualization

Using Your Own Dataset

Data Preparation

Training and Testing

Visualization

记录

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages