Resources for Research on Humanitarian Computing

The following resources are made available to advance research on humanitarian and crisis computing by developing new computational methods, techniques, and systems useful for humanitarian aid.

RESOURCE # 1
This resource consists of Twitter data collected during 19 natural and man-made disasters. Each dataset contains crisis-related tweets, human-labeled tweets, dictionaries of out-of-vocabulary(OOV) words, word2vec embeddings, and other related tools. Please cite the following paper, if you use any of these resources in your research.

Muhammad Imran, Prasenjit Mitra, and Carlos Castillo: Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages. In Proceedings of the 10th Language Resources and Evaluation Conference (LREC), pp. 1638-1643. May 2016, Portoro┼ż, Slovenia. [Bibtex]

Resource details and downloading »
RESOURCE # 2
This resource consists of human-labeled tweets collected during the 2012 Hurricane Sandy and the 2011 Joplin tornado. Please cite the following paper, if you use this resource in your research.

Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz, and Patrick Meier. Practical Extraction of Disaster-Relevant Information from Social Media. In Proceedings of the 22nd international conference on World Wide Web companion, May 2013, Rio de Janeiro, Brazil. [Bibtex]

Download

RESOURCE # 3
This resource consists of human-labeled tweets collected during the 2011 Joplin tornado and labeled into humanitarina categories. Please cite the following paper, if you use this resource in your research.

Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz, and Patrick Meier. Extracting Information Nuggets from Disaster-Related Messages in Social Media.In Proceedings of the 10th International Conference on Information Systems for Crisis Response and Management (ISCRAM), May 2013, Baden-Baden, Germany. [Bibtex]

Download

RESOURCE # 4 NEW
This resource provides read-to-use Python implementation of a number of neural network and non-neural network baesd classifiers for the classification of crisis-related Twitter data. Please cite the following paper, if you use this resource in your research.

Dat Tien Nguyen, Kamela Ali Al-Mannai, Shafiq Joty, Hassan Sajjad, Muhammad Imran, Prasenjit Mitra. Robust Classification of Crisis-Related Data on Social Networks using Convolutional Neural Networks. In Proceedings of the 11th International AAAI Conference on Web and Social Media (ICWSM), 2017, Montreal, Canada.

Resource details and downloading »

RESOURCE # 5 NEW
This resource provides human-labeled multimodal datasets comprised of tweets and images collected during seven major natural disasters. Please cite the following paper, if you use this resource in your research.

Firoj Alam, Ferda Ofli, Muhammad Imran. CrisisMMD: Multimodal Twitter Datasets from Natural Disasters. To appear at the 12th International AAAI Conference on Web and Social Media (ICWSM), 2018, Stanford, California, USA. [Bibtex]

Download (~1.8GB)

RESOURCE # 6 NEW
This resource comprised of tweet-ids and a sample of raw tweets (50k) collected during three devastating hurricanes in 2017 namely Hurricane Harvey, Hurricane Irma, and Hurricane Maria.

Firoj Alam, Ferda Ofli, Muhammad Imran, Michael Aupetit. A Twitter Tale of Three Hurricanes: Harvey, Irma, and Maria. In proceedings of the 15th International Conference on Information Systems for Crisis Response and Management (ISCRAM), May 2018, Rochester NY, USA. [Bibtex]

Download (~64MB)

Please carefully read our Terms of use before using resources available on this site.

Subscribe to CrisisNLP to receive announcements about these and new resources.
Follow us on Twitter: @NLP4Crisis
For inquiries, issues, feedback, or collaborations, contact: Admins