UMind: A Unified Multitask Network for Zero-Shot M/EEG Visual Decoding

This repository is the official implementation of UMind. 📄 Paper

Abstract

Unified Multitask Framework: We introduce a zero-shot M/EEG-based multitask model for retrieval, classification, and reconstruction, surpassing single-task methods through joint optimization and mutual feature reinforcement.
Multimodal Alignment Strategy: Our approach integrates M/EEG, images, and text, using dual-granularity text fusion to enhance neural-visual and semantic representation learning.
Dual-Conditional Diffusion Model: We separately extract neural visual and semantic features and employ them as dual conditions for guiding image generation, ensuring more comprehensive and accurate reconstruction.

The framework of UMind.

The reconstruction cases based on EEG.

Datasets

Multimodal data preparation

M/EEG pre-processing

./EEG-preprocessing/
./MEG-preprocessing/

Image and corresponding text preparation

coarse-grained and fine-grained text generation

python detail_text_generation.py

image and text features from pretrained model

python img_text_feature_load.py

prompt embeddings and pooled prompt embeddings for reconstruction

python text_features_load_SDXL.py

Data path

raw coarse-grained text data: ./data/class_names.txt
raw fine-grained text data: ./data/detail_caption.txt
proprocessed eeg data: ./data/Things-EEG2/Preprocessed_data_250Hz/
proprocessed image and text data: ViT-H-14_detail_class_features.pt
prompt embeddings and pooled prompt embeddings: ./data/SDXL-text-encoder_prompt_embeds.pt

Visual Decoding

Environment setup

pip install -r requirements.txt

Multimodal Alignment Pretraining

python EEG_image_retrieval_classification.py

Visual Reconstruction

Semantic guidance:

python text_condition.py
python text_pool_condition.py

Visual guidance:

python image_condition.py

EEG-based visual reconstruction

python EEG_image_generation.py

Reconstruction metrics computation

python recon_metrics.py

Acknowledgment

We would like to express our sincere gratitude to the authors of the following works for their valuable contributions, which have greatly inspired and guided our research:

Citation

Hope this code is helpful. I would appreciate you citing us in your paper. 😊

@article{xu2025umind,
  title={{UMind}: {A} {Unified} {Multitask} {Network} for {Zero-Shot} {M/EEG} {Visual} {Decoding},
  author={Xu, Chengjian and Song, Yonghao and Liao, Zelin and Zhang, Haochuan and Wang, Qiong and Zheng, Qingqing},
  journal={arXiv preprint arXiv:2509.14772},
  year={2025}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UMind: A Unified Multitask Network for Zero-Shot M/EEG Visual Decoding

Abstract

Datasets

Multimodal data preparation

M/EEG pre-processing

Image and corresponding text preparation

Data path

Visual Decoding

Environment setup

Multimodal Alignment Pretraining

Visual Reconstruction

Acknowledgment

Citation

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
EEG-preprocessing		EEG-preprocessing
MEG-preprocessing		MEG-preprocessing
data		data
EEG_image_generation.py		EEG_image_generation.py
EEG_image_retrieval_classification.py		EEG_image_retrieval_classification.py
README.md		README.md
custom_pipeline.py		custom_pipeline.py
detail_text_generation.py		detail_text_generation.py
diffusion_prior.py		diffusion_prior.py
framework.png		framework.png
generation_cases.png		generation_cases.png
image_condition.py		image_condition.py
img_text_feature_load.py		img_text_feature_load.py
loss.py		loss.py
model.py		model.py
recon_metrics.py		recon_metrics.py
requirements.txt		requirements.txt
text_condition.py		text_condition.py
text_features_load_SDXL.py		text_features_load_SDXL.py
text_pool_condition.py		text_pool_condition.py
utils.py		utils.py

xuchengjian632/UMind

Folders and files

Latest commit

History

Repository files navigation

UMind: A Unified Multitask Network for Zero-Shot M/EEG Visual Decoding

Abstract

Datasets

Multimodal data preparation

M/EEG pre-processing

Image and corresponding text preparation

Data path

Visual Decoding

Environment setup

Multimodal Alignment Pretraining

Visual Reconstruction

Acknowledgment

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages