CloudCast: A large-scale dataset and baseline for forecasting clouds

The CloudCast dataset contains 70080 images with 11 different cloud types for multiple layers of the atmosphere annotated on a pixel level. The dataset has a spatial resolution of 928 x 1530 pixels recorded with 15-min intervals for the period 2017-2018, where each pixel represents an area of 3×3 km.

To enable standardized datasets for benchmarking computer vision methods, we include a full-resolution dataset centered and projected dataset over Europe (728×728). To support small-scale experiments and analysis, we also include a downsampled low-resolution dataset 128×128 (15×15 km), which is significantly smaller in size compared to the full dataset.

Example observation from the CloudCast dataset for the 2017-05-01 13:00 UTC time.

License

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Citation

If you use this dataset in your research or elsewhere, please cite/reference the following paper:
CloudCast: A Satellite-Based Dataset and Baseline for Forecasting Clouds

@article{nielsen2020cloudcast,
title={CloudCast: A Satellite-Based Dataset and Baseline for Forecasting Clouds},
author={A. H. Nielsen and A. Iosifidis and H. Karstoft},
year={2020},
eprint={2007.07978},
archivePrefix={arXiv},
url={https://arxiv.org/abs/2007.07978},
primaryClass={cs.CV}
}

Download links

Small Dataset (128×128, 249 MB compressed)

Cropped Full Dataset (728×728, 8.5 GB compressed)

Raw Full Dataset (928×1530, 328 GB compressed)