FOMO25 Challenge Code Team: FOMO2JOMO

This repository contains the official code for the FOMO25 Challenge of the team FOMO2JOMO. For more information on the challenge, please visit the FOMO25 Challenge website.

Tasks and Data

This codebase supports three tasks:

Task 1: Infarct Detection - Binary classification
Task 2: Meningioma Segmentation - Binary segmentation
Task 3: Brain Age Regression - Regression

Data for the challenge includes:

Pretraining Data: 11,187 subjects, 13,900 sessions, 60,529 scans
Finetuning Data: Limited few-shot data (~20-200 cases per task)

Requirements

Install required dependencies:

# Install basic dependencies
pip install -e .

# For development
pip install -e ".[dev]"

# For testing
pip install -e ".[test]"

# For all dependencies
pip install -e ".[dev,test]"

Data Preparation

While the data included in this challenge is already preprocessed (co-registered, transposed to RAS orientation and defaced/skull-stripped), to run this code, one needs to further preprocess with the following highly opinionated preprocessing steps.

This "Opinionated Preprocessing" can be done in the following way

Preprocess Pretraining Data

For preprocessing the pretraining (FOMO60K) data:

python src/data/fomo-60k/preprocess.py --in_path=/path/to/raw/pretrain/data --out_path=/path/to/output/preprocessed/data

This will:

Store each tensor in numpy format for easy loading.
Treat each scan as a separate datapoint which can be sampled iid.
Crop to the minimum bounding box.
Z-normalize on a per-volume level.
Resample to isotropic (1mm, 1mm, 1mm) spacing.

Preprocess Finetuning Data (required)

For preprocessing the finetuning data for tasks 1-3:

python src/data/preprocess/run_preprocessing.py --taskid=1 --source_path=/path/to/raw/finetuning/data

Replace --taskid=1 with --taskid=2 or --taskid=3 for the other tasks.

This will apply a preprocessing akin to the one of the pre-trained data:

Assemble each session into a single 4D tensor and store it as a numpy array for easy loading.
Treat each scan as a separate datapoint which can be sampled iid.
Crop to the minimum bounding box.
Z-normalize on a per-volume level.
Resample to isotropic (1mm, 1mm, 1mm) spacing.

Pretraining

To pretrain a model using the proposed framework solution:

python src/pretrain.py \
    --save_dir=/path/to/save/models \
    --pretrain_data_dir=/path/to/preprocessed/pretrain/data \
    --model_name=mmunetvae \
    --patch_size=96 \
    --batch_size=2 \
    --epochs=100 \
    --warmup_epochs=5 \
    --num_workers=64 \
    --augmentation_preset=all

Key pretraining parameters:

--model_name: Supported models include unet_b_lw_dec, unet_xl_lw_dec, etc.
--patch_size: Size of 3D patches (must be divisible by 8)
--mask_patch_size: Size of masking unit for MAE (default is 4)
--mask_ratio: Ratio of patches to mask (default is 0.6)
--augmentation_preset: Choose from all, basic, or none

Finetuning

To finetune a pretrained model on one of the three tasks:

python src/finetune.py \
    --data_dir=/path/to/preprocessed/data \
    --save_dir=/path/to/save/finetuned/models \
    --pretrained_weights_path=/path/to/pretrained/checkpoint.pth \
    --model_name=mmunetvae \
    --patch_size=96 \
    --taskid=1 \
    --batch_size=2 \
    --epochs=500 \
    --train_batches_per_epoch=100 \
    --augmentation_preset=basic

Key finetuning parameters:

--taskid: Task ID (1: Infarct Detection, 2: Meningioma Segmentation, 3: Brain Age Regression)
--model_name: Must match the architecture of the pretrained checkpoint
--pretrained_weights_path: Path to the pretrained model checkpoint
--augmentation_preset: Choose from all, basic, or none

💾 Model Checkpoints

💻 Hardware Requirements

The reference implementation was pretrained on 1 A100 GPU with 80GB of memory. Depending on your hardware, you may need to adjust batch sizes and patch sizes accordingly.

📚 Citation

If you use this code, please cite:

@article{llambias2024yucca,
  title={Yucca: A deep learning framework for medical image analysis},
  author={Llambias, Sebastian N{\o}rgaard and Machnio, Julia and Munk, Asbj{\o}rn and Ambsdorf, Jakob and Nielsen, Mads and Ghazi, Mostafa Mehdipour},
  journal={arXiv preprint arXiv:2407.19888},
  year={2024}
}

@article{munk2024amaes,
  title={AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation},
  author={Munk, Asbjørn and Ambsdorf, Jakob and Llambias, Sebastian and Nielsen, Mads},
  journal={MICCAI Workshop on Advancing Data Solutions in Medical Imaging AI (ADSMI 2024)},
  year={2024}
}

Our work is currently under review. Full citation details will be available once published.

@article{brainfm2025,
  title={TITLE},
  author={[Authors]},
  journal={Under Review},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src		src
.flake8		.flake8
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
README.md		README.md
pyproject.toml		pyproject.toml
run_finetune_tasks.sh		run_finetune_tasks.sh
run_preprocessing.sh		run_preprocessing.sh
run_pretraining.sh		run_pretraining.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FOMO25 Challenge Code Team: FOMO2JOMO

Tasks and Data

Requirements

Data Preparation

Preprocess Pretraining Data

Preprocess Finetuning Data (required)

Pretraining

Finetuning

💾 Model Checkpoints

💻 Hardware Requirements

📚 Citation

About

Uh oh!

Releases

Packages

Languages

jbanusco/fomo25

Folders and files

Latest commit

History

Repository files navigation

FOMO25 Challenge Code Team: FOMO2JOMO

Tasks and Data

Requirements

Data Preparation

Preprocess Pretraining Data

Preprocess Finetuning Data (required)

Pretraining

Finetuning

💾 Model Checkpoints

💻 Hardware Requirements

📚 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages