DisasterM3 includes 26,988 bi-temporal satellite images and 123k instruction pairs across 5 continents, with three characteristics:
Multi-hazard: 36 historical disaster events with significant impacts, which are categorized into 10 common natural and man-made disasters
Multi-sensor: Extreme weather during disasters often hinders optical sensor imaging, making it necessary to combine Synthetic Aperture Radar (SAR) imagery for post-disaster scenes
Multi-task: 9 disaster-related visual perception and reasoning tasks, harnessing the full potential of VLM's reasoning ability
News
2025/10/23, We released the DisasterM3 instruct set.
2025/10/17, We released the benchmark set of DisasterM3.
2025/09/22, We are preparing the dataset and code.
2025/09/22, Our paper got accepted by NeurIPS 2025.
Benchmark
Please run this code for benchmarking the DisasterM3 dataset.
Two examples:
Qwen2.5 VL:
If you use DisasterM3 in your research, please cite our following papers.
text
@article{wang2025disasterm3,
title={DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response},
author={Wang, Junjue and Xuan, Weihao and Qi, Heli and Liu, Zhihao and Liu, Kunyi and Wu, Yuhan and Chen, Hongruixuan and Song, Jian and Xia, Junshi and Zheng, Zhuo and Yokoya, Naoto},
booktitle={Proceedings of the Neural Information Processing Systems},
year={2025}
}
Acknowledgments
This dataset builds upon the following excellent open datasets: