This resource consists of Twitter data collected during 19 natural and man-made disasters. Each dataset contains crisis-related tweets, human-labeled tweets, dictionaries of out-of-vocabulary(OOV) words, word2vec embeddings, and other related tools. Please cite the following article, if you use any of these resources in your research.
Muhammad Imran, Prasenjit Mitra, and Carlos Castillo: Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages.
In Proceedings of the 10th Language Resources and Evaluation Conference (LREC), pp. 1638-1643. May 2016, Portorož, Slovenia. [Bibtex
Resource details and downloading