CrisisNLP

Home
About

Deep Learning Benchmarks and Datasets for Social Media Image Classification for Disaster Response

Description of the dataset

The crisis image benchmark dataset consists data from several data sources such as CrisisMMD, data from AIDR and Damage Multimodal Dataset (DMD). The purpose of this work was to develop a consolidated dataset, create non-overlapping train/dev/test set and provide a benchmark results for the community.

The dataset consists labels for four tasks:

Task 1: Disaster types
- Earthquake
- Fire
- Flood
- Hurricane
- Landslide
- Not disaster
- Other disaster
Task 2: Informativeness
- Informative
- Not informative
Task 3: Humanitarian categories
- Affected, injured, or dead people
- Infrastructure and utility damage
- Not humanitarian
- Rescue volunteering or donation effort
Task 4: Damage severity
- Little or none
- Mild
- Severe

Downloads: Labeled data

Crisis Image Benchmarks Dataset version v1.0: download (11G)
- Individual task sets:
  - Disaster types: download (3.2G)
  - Informativeness: download (8.81G)
  - Humanitarian categories: download (2.29G)
  - Damage severity: download (6.06G)
Alternate download link: Dataverse

Please cite the following papers, if you use any of these resources in your research.

Firoj Alam, Ferda Ofli, Muhammad Imran, Tanvirul Alam, Umair Qazi, Deep Learning Benchmarks and Datasets for Social Media Image Classification for Disaster Response, In 2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 2020. [Bibtex] [Arxiv]
Firoj Alam, Ferda Ofli, and Muhammad Imran, CrisisMMD: Multimodal Twitter Datasets from Natural Disasters. In Proceedings of the 12th International AAAI Conference on Web and Social Media (ICWSM), 2018, Stanford, California, USA. [Bibtex]
Hussein Mozannar, Yara Rizk, and Mariette Awad, Damage Identification in Social Media Posts using Multimodal Deep Learning, In Proc. of ISCRAM, May 2018, pp. 529–543.