Overcoming Small Data Limitations in Video-Based Infant Respiration Estimation & AIR-400 Dataset

WACV 2026

This is the official repository of our WACV 2026 paper:

Song, L.*, Bishnoi, H.*, Manne, S.K.R., Ostadabbas, S., Taylor, B.J., Wan, M., "Overcoming Small Data Limitations in Video-Based Infant Respiration Estimation" (*equal contribution). 2026 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). [arXiv link]

Here we provide our model code, training checkpoints, and annotated dataset to support automatic estimation of infant respiration waveforms and respiration rate from natural video footage, with the help of spatiotemporal computer vision models and infant-specific region-of-interest tracking.

Sample Dataset Preprocessing

📋 Table of Contents

📦 Requirements & Setup

1. Set up the environment

conda env create -f environment.yml

2. Compile pyflow library and import it as a module

git clone https://github.com/pathak22/pyflow.git
(cd pyflow && python setup.py build_ext -i && mv pyflow.cpython-*.so ..)

⚡ Quickstart: Inference

Sample Inference Output

1. Preparation

Download a trained model and ROI detector files. Download our demo video, or provide your own as input.
Fill the DATA_PATH fields of config YAML in configs/inference folder.
- Set path for output directory.
- Set valid detector paths (YOLO weights) if ROI cropping is enabled. Otherwise, set DO_CROP_INFANT_REGION: False.
- Set input video file or video folder path.

DATA_PATH:
  OUTPUT_DIR: /absolute/path/to/output_dir/
  BODY_DETECTOR_PATH: /absolute/path/to/yolov8m.pt
  FACE_DETECTOR_PATH: /absolute/path/to/yolov8n-face.pt
  # Provide exactly one of the following:
  VIDEO_FILE: /absolute/path/to/video.mp4
  # VIDEO_DIR: /absolute/path/to/videos/

2. Start inference process

Use run_infer.sh to preprocess input video(s) and run a trained model for respiration rate estimation. Specify required config YAML file path and model checkpoint file path in run_infer.sh.

Example run:

./run_infer.sh

3. Outputs

Per-video JSON under OUTPUT_DIR/inference/{video}_{datetime} with prediction result JSON file and generated artifacts (HDF5 format time series and PNG format waveform plots).
A summary JSON across all processed videos (summary_{datetime}.json).
Logs saved under OUTPUT_DIR/logs/.

📚 Annotated Infant Respiration Dataset (AIR-400)

The AIR-400 dataset consists of two parts:

AIR-125 — original dataset (125 videos from 8 subjects, labeled S01 through S08, with S06, S07, and 08 provided as public web links)
AIR-400 — expanded dataset (275 videos from 10 additional subjects from the same study, labeled S01 through S10, but not the same as the ones from AIR-125)

Each subject directory contains synchronized video files (.mp4) and breathing signal annotations (.hdf5).

In the AIR_125 folder, each subject directory (S01, S02, ... S08) includes paired video and annotation files:

AIR_125/
    S01/
    │-- 001.mp4
    │-- 001.hdf5
    │-- 002.mp4
    │-- 002.hdf5
    │   ...
    │-- n.mp4
    │-- n.hdf5
    │
    S02/
    │-- 001.mp4
    │-- 001.hdf5
    │   ...
    ...

In the AIR_400 folder, annotation files are stored separately inside each subject's out/ directory:

AIR_400/
    S01/
    │-- 001.mp4
    │-- 002.mp4
    │-- 003.mp4
    │   ...
    │-- n.mp4
    │
    │-- out/
    │    │-- 001.hdf5
    │    │-- 002.hdf5
    │    │-- 003.hdf5
    │    │   ...
    │    │-- n.hdf5
    │
    S02/
    │-- 001.mp4
    │-- ...
    │-- out/
    │    │-- 001.hdf5
    │    ...
    ...

🔬 Reproducing Paper Results

1. (Optional) Sign W&B and login to record training results

export WANDB_API_KEY=<your_api_key>
wandb login

Set USE_WANDB: True in YAML file.

2. Download AIR-400 dataset and ROI detector files.

3. Fill the YAML `DATA_PATH` fields.

DATA_PATH:
  AIR_125: [air-125-dir-path]
  AIR_400: [air-400-dir-path]
  COHFACE: [cohface-dir-path]
  CACHE_DIR: [your-cache-dir]
  OUTPUT_DIR: [your-output-dir]
  BODY_DETECTOR_PATH: [yolov8-path]
  FACE_DETECTOR_PATH: [yolov8-face-path]

4. Preprocess the data

Specify required config YAML file path in run.sh. Then uncomment --preprocess after python main.py --config "$CONFIG" to enable preprocess-only mode. Run this approach first to make sure dataset is preprocessed correctly before following training and testing.

./run.sh

5. Start training and testing process

Comment out --peprocess after python main.py --config "$CONFIG" in run.sh to start training and testing process.

./run.sh

📝 Citation

@inproceedings{song_bishnoi_overcoming_2026,
	booktitle = {2026 {IEEE}/{CVF} {Winter} {Conference} on {Applications} of {Computer} {Vision} ({WACV})},
	publisher = {IEEE},
	title = {Overcoming {Small} {Data} {Limitations} in {Video}-{Based} {Infant} {Respiration} {Estimation}},
	author = {Song, Liyang and Bishnoi, Hardik and Manne, Sai Kumar Reddy and Ostadabbas, Sarah and Taylor, Brianna J and Wan, Michael},
	year = {2026},
}

📜 License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
configs		configs
dataloaders		dataloaders
loss		loss
models		models
processors		processors
results		results
trainers		trainers
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
air125_all_comb_results.ipynb		air125_all_comb_results.ipynb
eda.ipynb		eda.ipynb
environment.yml		environment.yml
infer.py		infer.py
main.py		main.py
run.sh		run.sh
run_infer.sh		run_infer.sh
visualize_labels.ipynb		visualize_labels.ipynb
visualize_of.py		visualize_of.py
visualize_results.ipynb		visualize_results.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overcoming Small Data Limitations in Video-Based Infant Respiration Estimation & AIR-400 Dataset

📋 Table of Contents

📦 Requirements & Setup

1. Set up the environment

2. Compile pyflow library and import it as a module

⚡ Quickstart: Inference

1. Preparation

2. Start inference process

3. Outputs

📚 Annotated Infant Respiration Dataset (AIR-400)

🔬 Reproducing Paper Results

1. (Optional) Sign W&B and login to record training results

2. Download AIR-400 dataset and ROI detector files.

3. Fill the YAML `DATA_PATH` fields.

4. Preprocess the data

5. Start training and testing process

📝 Citation

📜 License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

michaelwwan/air-400

Folders and files

Latest commit

History

Repository files navigation

Overcoming Small Data Limitations in Video-Based Infant Respiration Estimation & AIR-400 Dataset

📋 Table of Contents

📦 Requirements & Setup

1. Set up the environment

2. Compile pyflow library and import it as a module

⚡ Quickstart: Inference

1. Preparation

2. Start inference process

3. Outputs

📚 Annotated Infant Respiration Dataset (AIR-400)

🔬 Reproducing Paper Results

1. (Optional) Sign W&B and login to record training results

2. Download AIR-400 dataset and ROI detector files.

3. Fill the YAML DATA_PATH fields.

4. Preprocess the data

5. Start training and testing process

📝 Citation

📜 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

3. Fill the YAML `DATA_PATH` fields.

Packages