Detecting Citizen Problems and Their Locations Using Twitter Data
Total Page:16
File Type:pdf, Size:1020Kb
Detecting Citizen Problems and Their Locations Detecting CitizenUsing Problems Twitter and Data Their Locations Using Twitter Data Gizem Abal Enis Karaarslan Department Gizemof Computer Abal Engineering DepartmentEnis of Computer Karaarslan Engineering Muğla Stk Koçman University Muğla Stk Koçman University DepartmentMugla, of Computer Turkey Engineering DepartmentMugla, of Computer Turkey Engineering [email protected]ğla Stk Koçman University [email protected]ğla Stk Koçman University Mugla, Turkey Mugla, Turkey [email protected] [email protected] Ali Hürriyetoğlu Feriştah Dalklç CentreAli for Hürriyeto Languageğ Studieslu DepartmentFeri ofş Computertah Dalk lEngineeringç Radboud University Dokuz Eylul University CentreNijmegen, for Language Netherlands Studies Department Izmir,of Computer Turkey Engineering [email protected] University [email protected] Eylul University Nijmegen, Netherlands Izmir, Turkey [email protected] [email protected] Abstract—Twitter is a social network, which contains started to migrate to the cities. The increment in the number of information of the city events (concerts, festival, etc.), city citizens brought along many city problems like pollution, problemsAbstract—Twitter (traffic, collision,is a andsocial road network, incident), thewhich news, contains feelings accidents,started to migrate traffics, to etc. the Whilecities. Thechanges increment were occurring,in the number many of ofinformation people, etc. of For the these city reasons, events there(concerts, are manyfestival, studies, etc.), which city socialcitizens networking brought along services many (like city Facebook, problems Snap like chat,pollution, etc.) useproblems tweet (traffic,data to detectcollision, useful and information road incident), to support the news, the feelings smart whichaccidents, people traffics, use toetc. communicate While changes were were created occurring, and rapidly many of people, etc. For these reasons, there are many studies, which city management. In this paper, the ways of finding citizen grew.social Usersnetworking have started services using (like them Facebook, to not only Snap communicate chat, etc.) use tweet data to detect useful information to support the smart problems with their locations by using tweet data is discussed. withwhich the people others, use but to to communicate report the problems were created of the citiesand rapidly where Tweetscity management. in Turkish languageIn this paper, from thethe Aegean ways ofRegion finding of Turkeycitizen grew. Users have started using them to not only communicate problems with their locations by using tweet data is discussed. they live in. Because of this usage of the social platforms, the were used for the study. It is aimed to form a smart system, socialwith the media others, analysis but to becamereport the a popularproblems research of the citiesarea wherein the whichTweets detects in Turkish problems language of citizens from the and Aegean extracts Region the problems’of Turkey urbanthey live projects in. Because and some of thisresearch usage teams of the were social formed platforms, to work the exactwere usedlocations for thefrom study. tweet It texts. is aimed Firstly, to theform collected a smart data system, was social media analysis became a popular research area in the which detects problems of citizens and extracts the problems’ to create better urban areas. In order to get information about analyzed to get information of any city event, citizen's complaint urban projects and some research teams were formed to work exact locations from tweet texts. Firstly, the collected data was an area, people tend to use many social network platforms. or requests about a problem. After the possibility of detecting to create better urban areas. In order to get information about tweets,analyzed which to get have information any city ofproblem, any city was event, ensured, citizen's two complaint datasets One of these platforms is Twitter. Twitter has 330 million or requests about a problem. After the possibility of detecting monthlyan area, activepeople users tend asto ofuse the many third socialquarter network of 2017 platforms.according were created. The first one consists of the tweets that have an One of these platforms is Twitter. Twitter has 330 million eventtweets, information which have orany a city problem problem, and was the ensured, second twoone datasetshas the to the statistics of the Statista Statistics Portal. Twitter is monthly active users as of the third quarter of 2017 according tweets,were created. which haveThe firstother one information consists of not the related tweets tothat our have study. an different from other social networks; as the main purpose is Thenevent Naiveinformation Bayes classifieror a problem was trained and the on thesecond annotated one has tweets the notto theto statisticsinteract withof the friends; Statis tait Statisticsis formed Portal. for sharing Twitter and is andtweets, was which tested have on a otherseparate information set of tweets. not relatedAccuracy, to ourprecision, study. seekingdifferent information from other [1].social Owing networks; to the factas the that main Twitter purpose is used is recall,Then Naive and BayesF-measure classifier of wasthe trainedclassifier on is the given. annotated A location tweets bynot people to interact to proclaim with theirfriends; wishes it is or formedto announce for sharingproblems, and it recognizer,and was tested which on afinds separate the Turkishset of tweets. place Accuracy,names in precision,a text, is becomesseeking information a research [1]. tool Owing that tocan the be fact used that toTwitter analyze is used the createdrecall, andand F-measure applied onof thethe classifiertweets thatis given.are markedA location as situationby people of to an proclaim area and their find wishes the problems. or to announce problems, it information-containingrecognizer, which finds by the the Turkish classifier place to detect names the in location a text, ofis becomes a research tool that can be used to analyze the created and applied on the tweets that are marked as the problem precisely. The first findings of the project is situation of an area andII. findRELATED the problems. WORKS promising.information-containing The high byaccuracy, the classifier which to detectis obtained the location by the of classifier,the problem shows precisely. that it isThe proper first tofindings use this ofclassifier the project for our is When the past studiesII. R usingELATED Twitter WORKS data were studied, it promising. The high accuracy, which is obtained by the was seen that many of them are about analyzing traffic tweets study. The location recognizer is planned to be improved and When the past studies using Twitter data were studied, it placeclassifier, names shows on the that real-time it is proper tweet datato use is tothis be classifierdetected. for our [2-4], some of them were done to understand the happiness in study. The location recognizer is planned to be improved and awas city seen [5], that finding many theof them locations are about of where analyzing a user traffic tweets[6], tweets Indexplace namesTerms —on Datathe real-time Analysis, tweet Machine data isLearning, to be detected. Smart Cities, analyzing[2-4], some disaster of them tweets were [7, done 8], toand understand detecting placethe happiness names that in Social Media Analysis, Text Mining. area city passed [5], onfinding a tweet the text locations [9]. of where a user tweets[6], Index Terms— Data Analysis, Machine Learning, Smart Cities, analyzing disaster tweets [7, 8], and detecting place names that Social Media Analysis, Text Mining. are Somepassed studieson a tweet [10, text 11] [9]. show that, Twitter is the most I. INTRODUCTION important social network which people use to communicate Some studies [10, 11] show that, Twitter is the most In the past years, the technology has developed and with other users and give information to them during a disaster I. INTRODUCTION important social network which people use to communicate brought a big change to the lives of people. Many people or a critical event. What's more, it is observed that tweet texts In the past years, the technology has developed and with other users and give information to them during a disaster brought a big change to the lives of people. Many people or a critical event. What's more, it is observed that tweet texts 978-1-5386-4478-2/18/$31.00 ©2018 IEEE 978-1-5386-4478-2/18/$31.00 ©2018 IEEE 30 2018 6th International Istanbul Smart Grids and Cities Congress and Fair (ICSG) consist of information about traffics, transportation in a city, IV. IMPLEMENTATION security, environmental factors in these studies [10, 11]. When we consider these researches and our aim to form a smart A. Using The Tweet Data For Smart Cities system in this study together, they show us that Twitter data For this project, we aim to detect the tweets that include a can also be used in Turkish language to perform our work. problem about a part of a city and find exact location of the problem. In Figure 1, some tweets that are related to our work In the next section, basic concepts; smart city concept, were given. In the first tweet, the user says, “Küçükbakkalköy tweet data and the usage of machine learning in the tweet data Beyaz Street, Nergis, Çiğdem and Yenidoğan Avenues are analysis will be discussed. Then, the methodology and the closed. We have a hard time. Please immediately help