Automatic Classification of Disaster-Related Tweets

International conference on Innovative Engineering Technologies (ICIET’2014) Dec. 28-29, 2014 Bangkok (Thailand) Automatic Classification of Disaster-Related Tweets Beverly Estephany Parilla-Ferrer, Proceso L. Fernandez Jr., PhD, and Jaime T. Ballena IV, PhD Abstract— The social networking site Twitter has become one of of its growing ubiquity, communications rapidity, and cross- the quickest sources of news and other information. Twitter platform accessibility [1]. The interactions on social media information feeds known as tweets, are voluntarily sent by registered being highly distributed, decentralised and occurring in real users and reach even non-registered users, sometimes ahead of time, provide the necessary breadth and immediacy of traditional sources of mass news. In this study, we develop some machine learning models that can automatically detect informative information required in times of emergencies [2]. disaster-related tweets. Twitter is one microblogging service that allows its A dataset of tweets, collected during the Habagat flooding of subscribers to broadcast short messages, called tweets, of up Metro Manila in 2012, was used in building the classifier models. A to 140 characters. These tweets are used to share relevant random subset of this dataset was manually labeled as either information and report news [3]. In emergency situations, informative or uninformative to produce the ground truth. Two tweets provide either first-person observations or bring machine learning algorithms, Naive Bayes and Support Vector Machine (SVM), were used to build models for the automatic relevant knowledge from external sources [1]. Twitter is classification of the tweets, and these models were evaluated across becoming a valuable tool in disaster and emergency situations, the metrics of accuracy, precision, recall, area under curve and F- as there is increasing evidence that it is not just a social measure. Experimental results show that the model generated from network, it is also a news service [4]. Relevant tweets shared SVM has significantly better results compared to that of the Naive by users is a vital source of information and is useful in Bayes. understanding and visualizing the situation of affected parties. This study also revealed that uninformative tweets outnumbered informative tweets, suggesting that the subscribers used Twitter to This medium is seen as a place for ―harvesting‖ information broadcast more of tweets that express their subjective messages and during a crisis event to determine what is happening on the emotions regarding the Habagat event. However, the informative ground [5]. The growing use of social media during crises tweets were more likely to be retweeted than uninformative tweets, offers new information sources from which the right indicating that subscribers retweet messages they deem informative authorities can enhance emergency situation awareness which and useful for public awareness. These insights, together with the is significantly recognized as a critical part of making built classifier models, can help in the development of a system that can sift through the voluminous Twitter data and in real-time detect successful and effective decisions for emergency response [6]. informative disaster-related tweets so that appropriate action may be Tweets highly vary in terms of subject and content and the done promptly. influx of tweets particularly in the event of a disaster may be overwhelming. It consists of socio-behaviors that include Keywords— Disaster, Machine learning, Text mining, Tweets intensified information search and information contagion [7]. Microblogging offers ways to retrieve, produce and spread I. INTRODUCTION information; the nature of that sharing has a lifecycle of URING disasters and emergencies, microblogs, have information production and consumption that is rapid and D been used by people whether from the private or public repetitive [1]. Since these varied tweets are rapidly sector, local or international community, as a medium to broadcasted, it is imperative to automatically classify the broadcast their messages. This social medium is being tweets in order to extract needed information. The availability considered as a means for emergency communications and accessibility of disaster-relevant information can because contribute to an effective and efficient disaster response mechanism, which eventually can alleviate damages or loss of life and property during a disaster or crisis. Beverly Estephany Parilla-Ferrer is with the Department of Information Disaster-related tweets are one of the many subjects of text Technology and Computer Science, School of Information and Computing mining researches nowadays. Specifically on the area of Science Saint Louis University, Baguio City, Philippines(+637409205884351; e-mail:[email protected]). classifying and extracting information from disaster-related Proceso L. Fernandez Jr., is with the Department of Information Systems tweets, Caragea et al conducted a study to classify the set of and Computer Science, School of Science and Engineering, Ateneo de Manila tweets collected during the Haiti earthquake for the University, Quezon City, Philippines (e-mail:[email protected]). Jaime T. Ballena IV is with the Math Department, School of Information emergency response sector [8]. The authors compared and Computing Science Saint Louis University, Baguio City Philippines (e- different feature representations for training SVM classifiers mail: [email protected]). http://dx.doi.org/10.15242/IIE.E1214072 62 International conference on Innovative Engineering Technologies (ICIET’2014) Dec. 28-29, 2014 Bangkok (Thailand) to classify tweets. Imran et al extracted information using and classifies tweets into user-defined categories. The disaster-related message ontology to classify tweets using the experimental results returned 75% classification accuracy Joplin dataset [9]. Multi-level and multi-label classification based on AUC metric. Prasetyo et al used SVM algorithm to using Naive Bayes classifier in Weka was used in the study. classify software related microblogs as relevant or irrelevant Another paper of Imran et al focused on the classification and to engineering software systems with text from URLs and extraction of disaster-relevant information from the Joplin and microblogs as features [15]. Training and testing was Sandy dataset using conditional random fields for training performed using 10-fold cross validation and the model [10]. revealed a significantly good performance based on accuracy, These studies have clearly presented that disaster–relevant precision, recall and F-measure. information can be classified and can provide information that On the area of comparing machine learning algorithms for can augment people's awareness on incidents. However, these classification using short text messages as dataset, the studies did not cover statistical analysis on the big data of following researches were conducted. Duwairi and Qarqaz tweets and the performance evaluation of machine learning compared Naive Bayes, SVM and K-nearest Neighbor as algorithms for the classification of tweets. Although there are implemented in Rapidminer to classify sentiments of tweets as several studies that have evaluated machine learning positive, negative or neutral using a dataset on general topics algorithms for the classification of tweets, these studies dealt such as education, sports and political news [16]. With 10-fold with sentiment or opinion analysis. In this study, we aim to cross validation, SVM returned the highest precision, while K- create a machine learning model to classify disaster-related Nearest Neighbor (KNN) with the highest recall. A study on tweets as informative or uninformative and compare the the classification of Reuters headline news as dataset, Khamar performance of two of the most common machine classifying compared SVM, K-Nearest Neighbor and other algorithms algorithms Naive Bayes and Support Vector Machine. [17]. After a process of training and testing, KNN returned a Performance evaluation is based on the validation of results higher accuracy compared to Naive Bayes and SVM. Lu across the metrics of accuracy, precision, recall, area under conducted a study to identify online messages using C4.5, curve and F-measure, with the application of statistical tools. Naive Bayes and SVM [18]. Based on experiment, SVM Furthermore, the research investigates the information that can outperformed C4.5 and Naive Bayes in terms of accuracy and be extracted from the statistics of broadcasted tweets during F-measure. Zielenski and Bugel investigated on classifying the Habagat incident which caused widespread flooding in tweets posted in 4 different languages (English and 3 Metro Manila in 2012. Mediterranean languages in Turkey, Greece and Romania) as relevant or not relevant to an earthquake event by testing a II.RELATED WORKS language-specific detection classifier with keywords that are There are several researches on text mining for synonyms or translations of the word earthquake as features in classification and prediction on various domains such as in the the classification [19]. Training and testing used different medical, business, crime investigation, e-mail detection, etc. datasets using regular expression and Naive Bayes. The The following works are focused on the classification of results showed a best performance on the official languages tweets and the comparison of classifying algorithms.

Automatic Classification of Disaster-Related Tweets

Volume Quantification by Contrast-Enhanced Ultrasound

On the Sampling Variance of Intraclass Correlations and Genetic Correlations

HERITABILITY ESTIMATES from TWIN STUDIES I. Formulae of Heritability Estimates

Title: Assessing Test-Retest Reliability of Psychological Measures: Persistent

Inference from Complex Samples

We Need to Talk About Reliability: Making Better Use of Test-Retest Studies for Study Design and Interpretation

The Intraclass Covariance Matrix

Psychometrics Using Stata

Confidence Interval Estimation for Intraclass Correlation Coefficient

Statistical Methods for Research Workers BIOLOGICAL MONOGRAPHS and MANUALS

Estimation and Inference of the Three-Level Intraclass Correlation Coefficient

Guidelines for Reporting Reliability and Agreement Studies (GRRAS) Were Proposed Jan Kottnera,*, Laurent AudigEb, Stig Brorsonc, Allan Donnerd, Byron J