Paper Title (Use Style: Paper Title)
Total Page:16
File Type:pdf, Size:1020Kb
A Data Mining Method for Facebook Social Network: Take "New Row Mian (Beef Noodle)" in Taiwan for Example Jong-Shin Chen1, Chi-Yueh Hsu2*, Cheng-Ying Yang3, Ching-Chuan Wei4, Han Guo Ciang5 Department of Information and Communication Engineering, Chaoyang University of Technology, Wufeng, Taichung 41349, Taiwan, R.O.C.1,4,5 Department of Leisure Services Management, Chaoyang University of Technology, Wufeng, Taichung 41349, Taiwan, R. O. C. 2 Department of Computer Science University of Taipei, Taipei 10048, Taiwan, R.O.C. 3 E-mail:[email protected]* Abstract—Facebook penetration rate in Taiwan is the highest where variety of chefs and restaurants compete for the 'best in the world, until July 2015 in Taiwan, the number of daily users beef noodle' title in Taiwan. 2011 Taipei International Beef reached 13 million for approximately 23 million population. Festival has been Taiwan beef noodle translated as "New Row Location-based Facebook check-in service is a hot topic, Mian". The naming imitates Japanese Sushi as or Korean numerous Facebook users go to their interested numerous Kimchi that translated from the local language literal checkin-in places and check in there. Taiwan beef noodle is considered a national dish. 2011 Taipei International Beef translation, highlighting the unique characteristics. Festival has been Taiwan beef noodle translated as New Row Accordingly, we selected "New Row Mian" as the topic to Mian. The naming imitates Japanese Sushi or Korean Kimchi explore related places from Facebook social network. that translated from the local language literal translation, Numerous places and check-in behaviors at these places can highlighting the unique culture. Through the culture in the form public options for example hot places, high density human activities, it will also produce the relevant Facebook regions of places. How can we know that related places is check-in places and check-in behaviors. In this study, we propose popular in which administrative regions? Unfortunately, it is a method to collect the big data of Facebook check-in places, find difficult to get administrative regions of places from Facebook. out the places related to "New Row Mian" and position for these The 1st and 2nd division regions in Taiwan Island includes 6 places. special municipalities, 3 cities, and 10 countries. The 3rd Keywords—big data; Facebook; public option; New Row Mian division of Taiwan includes 352 regions, which are 170 districts, 13 county-controlled cities, and 169 townships [3-4]. I. INTRODUCTION In this case, we position the places to 1st, 2nd, and 3rd type Facebook penetration rate in Taiwan is the highest in the divisions of administrative regions. world, until July 2015 in Taiwan, the number of daily users In this study, we propose A Data Mining Method for reached 13 million for approximately 23 million population. Facebook Social Network and take "New Row Mian" in Check-in for places is a location-based function of Facebook. Taiwan for example. In section 2, the methods to collect place, Facebook user can go to some famous scenic spots (Facebook to update the data of places and to position the administrative places) or participate some activity and check in on the regions of places are introduced. In section 3, we demonstrate Facebook to show that the user participate some activities[1-2]. the results. Finally, the conclusion and future work is gave in If there are no suitable name for the Facebook place, Facebook section 4. user can create a new place for the scenic spot. In the rest of II. RELATED WORK this article, 'place' is referred to as 'Facebook place' simply and "…" is used to present a name of a specific Facebook place. The current research of check-in on social networks is After several years, there are numerous places and numerous divided into two types. The first type is based on big data that check-in behaviors at these places in Taiwan. In this study, we uses some technologies of Geographic Information Science [5- attend to explore the public option from Facebook places for a 8]. The other type is based on Ethnography-style study. The special topic. big-data based check-in studies generally depend on open Application Programming Interfaces(APIs), to acquire data Taiwan beef noodle exists in variety of forms throughout from open social networking platforms and then do data East Asia and Southeast Asia. Beef noodle soup was first mining and analyzing. The disadvantage of this method is that created by the Hui people (a Chinese Muslim ethnic group) it cannot be discussed in depth with the focused persons. during the Tang Dynasty of China. The red braised beef noodle soup was invented by the veterans in Kaoshiung, These studies always are based on Foursquare as the Taiwan who fled from mainland China during the Chinese research field. Facebook is the most popular check-in platform civil war. In Taiwan it is considered a national dish and Taipei in the world. However few studies are based on it as the City holds several times of International Beef Noodle Festival, research field. One of the major reasons is that Facebook platform only allows limited data access. The other reason is Facebook server is formed as JSON format[9] and encoded by that Facebook API often changes. Therefore, it is difficult to utf-8 format[10]. Figure 2 presents an overview of the JSON acquire mass data from Facebook platform by programming. formatted data in which there are data, and paging fields. The Foursquare is an another platform that user can do check-in at data field contains the data of n places. Each data of a place places. A Foursquare user dose check-in at a place and the includes the identification, name, location, category of this information simultaneously display on Twitter. All of the place. The next filed contains the next url if there are other information on Twitter is open. Accordingly, the mass data places at this area. related to Foursquare can be acquired from Twitter platform. Format: https://graph.facebook.com/search?type=place¢er=c& Ethnography is the systematic study of people and cultures. distance=r&access_token=ak&limit=n It is designed to explore cultural phenomena where the Example:https://graph.facebook.com/search?type=place¢er=24.069093, researcher observes society from the point of view of the 120.7127943&distance=150&access_token=1368411763*****|k95dZzlRoY subject of the study. An ethnography is a means to represent Vqg9I9NF_QxU*****&limit=50 graphically and in writing the culture of a group. The resulting field study or a case report reflects the knowledge and the Fig. 1. A https format of requesting identifications from Facebook server system of meanings in the lives of a cultural group. For { Ethnography-style study, our study can help to find out the hot "data": [ {data of the first place}, { data of the second place }, ..., { data of regions and hot locations. It is worth mentioning that each the nth place } ], Facebook place has a corresponded web page. The names of "paging": { "cursors": {...}, Facebook users, that do check-in at this place, can be found out "next": "the next url" from this page. According to the hyperlinks of Facebook users } on this place page, we can visits the web pages of the Facebook } users. Then, we can acquire some information related to the Fig. 2. An overview of the responded JSON formatted data from Facebook Facebook users on their web pages. Indirectly, by not server difficultly, we can contacts to the real persons of the Facebook users. These specific Facebook users could be the candidates in B. Content Maintenance Ethnography-style study. For the general public, many of the The detailed data of a place can be acquired by sending a Facebook places in the paper is a popular locations. Many request with its place id, according to the format as shown in people have been to these locations. Moreover, the web page of Fig. 3. After Facebook server received the request, it will return a Facebook place page, that has higher like count, represents the data of this place back. The data includes the name, there are many Facebook users to follow this page. This place location (latitude and longitude), category, description, about, can be used as a outdoor recreation location or a good web checkins, ..., and so on, where ‘checkins’ is the number of page can be visited on Internet. check-in behaviors at this place. III. RESEARCH METHOD Format: https://graph.facebook.com/id/?access_token=ak Our research method is divided into 3 steps, place Example: https://graph.facebook.com/1789770481248040 collection, content maintenance, and place positioning [11]. /?access_token=1368411763*****|k95dZzlRoYVqg9I9NF_QxU***** A. Place Collection Fig. 3. A request of maintaining data of a place The data of Facebook places is open data. According to an access token ak, a developer can acquire the data from Facebook servers. The ak of a developer can be requested from the web page with url "https://developers.facebook.com". Each Facebook place has a unique identification, termed as id. A developer can acquire several identifications according to Hypertext Transfer Protocol Secure (https) protocol by sending a request to Facebook server. The format of this request is shown as Fig. 1, where c is a latitude and longitude coordinate, r is a distance, and n is a limit number. Fig. 4. The bounder of Kaohsiung City in Taiwan When a Facebook server acquired this request, it returns at most n identifications at a geographical circle-area A with center c and radius r. If the number of places at A is larger than n, the responded data will contain a next url.