Thanks to visit codestin.com
Credit goes to zenodo.org

Published June 2024 | Version 1.0.0
Dataset Open

OAM-TCD: A globally diverse dataset of high-resolution tree cover maps

  • 1. ROR icon ETH Zurich
  • 2. Restor
  • 3. ROR icon University of Chicago
  • 4. ROR icon Swiss Federal Institute for Forest, Snow and Landscape Research
  • 5. University of Florida
  • 6. Gainforest

Description

This repository contains files for the OAM-TCD dataset. This repository contains:

  • GeoTIFF format images (images.tar.gz)
  • Semantic segmentation annotation masks (masks.tar.gz)
  • MS-COCO annotation files (train.tar.gz and test.tar.gz)
  • Associated metadata for images in each split

Images are named <oam_id>_<image_id>.tif

For more information, see our arXiv paper here: https://arxiv.org/abs/2407.11743

We recommend that you download the dataset via HuggingFace Hub and we provide a utility to convert the dataset (including folds) to disk in our repository. This archive is provided mainly for long-term availability and reference.

The data are split into three groups depending on image license. The vast majority of the data are CC BY 4.0 licensed (approx. 90%), with smaller portions as CC BY-NC 4.0 and CC BY-SA 4.0. These subsets have the zip extension '-nc' and '-sa' respectively. All CC BY-SA images are in the test set.

Additionally, we provide dataset split indices that can be used for 5-fold cross-validation. To avoid duplication, we do not provide separate annotation files for each fold. You can find these indices in the JSON files in the metadata using the image_id as a key. Each image is given a validation_fold which is an integer in [0,4], a value of -1 indicates that the image belongs to the holdout dataset and should not be used for training with this split arrangement.

All images in the dataset are courtesy of contributors of the Open Imagery Network via Open Aerial Map.

 

Files

Files (4.0 GB)

Name Size Download all
md5:8f4c271f37df0cab06adf8ffe656a4ab
8.4 MB Download
md5:ce9c1e6e8b821d9954418c62cc180208
6.7 MB Download
md5:0ce6f951a38d85b8c4f9f9f038889360
3.0 GB Download
md5:a45153d442d70251a1ec5a7368d44b43
409.9 MB Download
md5:b19b1f757564c66185ab6bb34c6e6d6d
335.4 MB Download
md5:ea889e87d5af386d806c43bbb38aaaa4
140.2 MB Download
md5:504b4e48d621f03972e3e4f2b1105e79
354.0 kB Download
md5:149d72e68550e2abccf2c730bd45f3ef
1.1 MB Download
md5:2e878689487c2ff703caa3f7c23110e8
5.6 MB Download
md5:56ea52b31419dbc49c2e0372a7344133
11.2 MB Download
md5:f2cc4984fc50d84ce5ecf14eb8868b5d
45 Bytes Download
md5:805822d17eee63325894f39598b0edbb
99.4 MB Download

Additional details

Related works

Is published in
Publication: arXiv:2407.11743 (arXiv)

Funding

Google (United States)
AI and ML for advancing the monitoring of Forest Restoration TF2012-096892

Software

Repository URL
https://github.com/restor-foundation/tcd
Programming language
Python
Development Status
Active