Resources for Research on Humanitarian Computing

The following resources are made available to research communities to advance the research on humanitarian & crisis computing by tailoring existing or developing new Natural Language Processing (NLP) techniques and computational methods useful for humanitarian purposes.

RESOURCE # 1
This resource consists of 19 humanitarian crises datasets. Each dataset contains crisis-related posts collected from Twitter, human-labeled tweets, dictionaries of out-of-vocabulary(OOV) words, word2vec embeddings, and other related tools. Please cite the following article, if you use any of these resources in your research.

Muhammad Imran, Prasenjit Mitra, and Carlos Castillo: Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages. In Proceedings of the 10th Language Resources and Evaluation Conference (LREC), pp. 1638-1643. May 2016, Portoro┼ż, Slovenia. [Bibtex]

» Resource details and download
RESOURCE # 2
This resource consists of labeled tweets which were collected from two humanitarian crises (Sandy hurricane, Joplin tornado). Please cite the following article, if you use any of these resources in your research.

Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz, and Patrick Meier. Practical Extraction of Disaster-Relevant Information from Social Media. In Proceedings of the 22nd international conference on World Wide Web companion, May 2013, Rio de Janeiro, Brazil. [Bibtex]

Full labeled data v1.0.zip

RESOURCE # 3
This resource consists of tweets collected from Joplin tornado and labeled into humanitarina categories. Please cite the following article, if you use any of these resources in your research.

Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz, and Patrick Meier. Extracting Information Nuggets from Disaster-Related Messages in Social Media.In Proceedings of the 10th International Conference on Information Systems for Crisis Response and Management (ISCRAM), May 2013, Baden-Baden, Germany. [Bibtex]

Full labeled data v1.0.zip

Do you have a resource that can help advance humanitarian computing research? If yes, we can link or host your resource, please contact: Muhammad Imran


Subscribe to CrisisNLP to receive announcements about these and new resources.
Follow us on Twitter: @NLP4Crisis
For inquiries, issues, feedback, or collaborations, contact: Muhammad Imran