Sound Reconstruction

This repository shows the data and demo code:

Park JY, Tsukamoto M, Tanaka M, Kamitani Y (2025) Natural sounds can be reconstructed from human neuroimaging data using deep neural network representation. PLOS Biology 23(7): e3003293. https://doi.org/10.1371/journal.pbio.3003293
Park, JY, Tsukamoto M, Tanaka M, & Kamitani Y (2023) Sound reconstruction from human brain activity via a generative model with brain-like auditory features (arXiv:2306.11629). arXiv. http://arxiv.org/abs/2306.11629

Dataset

Raw fMRI data: OpenNeuro
Preprocessed fMRI data, DNN features extracted from sound clips: figshare
Trained transformer models: figshare
Stimulus sound clips: Refer to data/README.md .

Code

Setup

Clone this SoundReconstruction repository to your local machine (GPU machine preferred).

git clone [email protected]:KamitaniLab/SoundReconstruction.git

Create conda environment using the specvqgan.yaml.

conda env create --name specvqgan -f specvqgan.yaml 
python -c "import torch; print(torch.cuda.is_available())"
# True

Clone SpecVQGAN repository next to SoundReconstruction directory. Please use the following fork repository instead of the original SpecVQGAN repository because the path of the Transformer configuration file has been rewritten.

git clone [email protected]:KamitaniLab/SpecVQGAN.git

Download datasets and models

See data/README.md.

Usage

We provide scripts that reproduce main results in the original paper. Please execute the sh files in the following order.

Train feature decoders to predict the VGGishish features.

./1_train_batch.sh

Using the decoders trained in step.1, perform feature predictions. (Perform the prediction for the attention task dataset at the same time.)

./2_test_batch.sh

Validate the prediction accuracy of predicted features.

./3_eval_batch.sh

Visualize the prediction accuracy with the following notebook. This notebook draws Fig.3D and Fig.3E of the original paper.

feature_decoding/makefigures_featdec_eval.ipynb

Reconstruct sound clips using predicted features.

./4_recon_batch.sh

Validate the quality of reconstructed sound.

./5_recon_eval_batch.sh

Visualize the reconstruction quality with the following notebooks. These notebooks draws Fig.4C and Fig.8C of the original paper.

reconstruction/makefigures_recon_eval.ipynb
reconstruction/makefigures_recon_eval_attention.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sound Reconstruction

Dataset

Code

Setup

Download datasets and models

Usage

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
data		data
feature_decoding		feature_decoding
reconstruction		reconstruction
1_train_batch.sh		1_train_batch.sh
2_test_batch.sh		2_test_batch.sh
3_eval_batch.sh		3_eval_batch.sh
4_recon_batch.sh		4_recon_batch.sh
5_recon_eval_batch.sh		5_recon_eval_batch.sh
README.md		README.md
specvqgan.yaml		specvqgan.yaml

KamitaniLab/SoundReconstruction

Folders and files

Latest commit

History

Repository files navigation

Sound Reconstruction

Dataset

Code

Setup

Download datasets and models

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages