OAM-TCD: A globally diverse dataset of high-resolution tree cover maps
Creators
Description
This repository contains files for the OAM-TCD dataset. This repository contains:
- GeoTIFF format images (images.tar.gz)
- Semantic segmentation annotation masks (masks.tar.gz)
- MS-COCO annotation files (train.tar.gz and test.tar.gz)
- Associated metadata for images in each split
Images are named <oam_id>_<image_id>.tif
For more information, see our arXiv paper here: https://arxiv.org/abs/2407.11743
We recommend that you download the dataset via HuggingFace Hub and we provide a utility to convert the dataset (including folds) to disk in our repository. This archive is provided mainly for long-term availability and reference.
The data are split into three groups depending on image license. The vast majority of the data are CC BY 4.0 licensed (approx. 90%), with smaller portions as CC BY-NC 4.0 and CC BY-SA 4.0. These subsets have the zip extension '-nc' and '-sa' respectively. All CC BY-SA images are in the test set.
Additionally, we provide dataset split indices that can be used for 5-fold cross-validation. To avoid duplication, we do not provide separate annotation files for each fold. You can find these indices in the JSON files in the metadata using the image_id
as a key. Each image is given a validation_fold
which is an integer in [0,4], a value of -1 indicates that the image belongs to the holdout dataset and should not be used for training with this split arrangement.
All images in the dataset are courtesy of contributors of the Open Imagery Network via Open Aerial Map.
Files
Files
(4.0 GB)
Name | Size | Download all |
---|---|---|
md5:8f4c271f37df0cab06adf8ffe656a4ab
|
8.4 MB | Download |
md5:ce9c1e6e8b821d9954418c62cc180208
|
6.7 MB | Download |
md5:0ce6f951a38d85b8c4f9f9f038889360
|
3.0 GB | Download |
md5:a45153d442d70251a1ec5a7368d44b43
|
409.9 MB | Download |
md5:b19b1f757564c66185ab6bb34c6e6d6d
|
335.4 MB | Download |
md5:ea889e87d5af386d806c43bbb38aaaa4
|
140.2 MB | Download |
md5:504b4e48d621f03972e3e4f2b1105e79
|
354.0 kB | Download |
md5:149d72e68550e2abccf2c730bd45f3ef
|
1.1 MB | Download |
md5:2e878689487c2ff703caa3f7c23110e8
|
5.6 MB | Download |
md5:56ea52b31419dbc49c2e0372a7344133
|
11.2 MB | Download |
md5:f2cc4984fc50d84ce5ecf14eb8868b5d
|
45 Bytes | Download |
md5:805822d17eee63325894f39598b0edbb
|
99.4 MB | Download |
Additional details
Related works
- Is published in
- Publication: arXiv:2407.11743 (arXiv)
Funding
- Google (United States)
- AI and ML for advancing the monitoring of Forest Restoration TF2012-096892
Software
- Repository URL
- https://github.com/restor-foundation/tcd
- Programming language
- Python
- Development Status
- Active