Research Article a Behavior-Driven Forum Spammer Recognition Method with Its Application in Automobile Forums

Hindawi Mathematical Problems in Engineering Volume 2021, Article ID 7682579, 11 pages https://doi.org/10.1155/2021/7682579 Research Article A Behavior-Driven Forum Spammer Recognition Method with Its Application in Automobile Forums Han Su,1 Minglun Ren,1 Anning Wang,1 Xiaoan Tang ,1 Xin Ni,2 and Zhao Fang1,3 1School of Management, Hefei University of Technology, Hefei 230009, China 2Department of Design, Information System and Inventive Processes, INSA de Strasbourg, Strasbourg, France 3Department of Information Systems and Analytics, National University of Singapore, 13 Computing Drive, Singapore Correspondence should be addressed to Xiaoan Tang; [email protected] Received 10 June 2021; Revised 9 August 2021; Accepted 13 August 2021; Published 31 August 2021 Academic Editor: Xin Tian Copyright © 2021 Han Su et al. )is is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Forum comments are valuable information for enterprises to discover public preferences and market trends. However, extensive marketing and malicious attack behaviors in forums are always an obstacle for enterprises to make effective use of this information. And these forum spammers are constantly updating technology to prevent detection. )erefore, how to accurately recognize forum spammers has become an important issue. Aiming to accurately recognize forum spammers, this paper changes the research target from understanding abnormal reviews and the suspicious relationship among forum spammers to discover how they must behave (follow or be followed) to achieve their monetary goals. First, we classify forum spammers into automated forum spammers and marketing forum spammers based on different behavioral features. )en, we propose a support vector machine-based automated spammer recognition (ASR) model and a k-means clustering-based marketing spammer recognition (MSR) model. )e experimental results on the real-world labelled dataset illustrate the effectiveness of our methods on clas- sification spammer from common users. To the best of our knowledge, this work is among the first to construct behavior-driven recognition models according to the different behavioral patterns of forum spammers. 1. Introduction happened, the collective memory becomes relatively resis- tant to competing information [3]. )us, fake reviews in In recent years, with the background of social media, forums forums are now the biggest problem for forum users and have become a specific community for users who have the enterprises. same interests. An increasing number of users post related Lots of current studies indirectly identify fake reviews by reviews in forums [1]. )ese reviews cover a wide variety of recognizing forum spammers based on behavioral features content, ranging from breaking news, discussions on various or sentiment analysis methods [4–7]. However, forum topics, posts about one’s personal life, and the sharing of spammers are constantly updating their technology or activities and interests [2]. As a significant platform for the changing their posting methods to prevent them from being users’ discussion, some forums maintain a high level of user detected by the fake reviews recognition system, which activity. In addition, the feedback from forum users is makes many methods no longer useful for recognizing fo- usually an important source of information for potential rum spammers. Although the forum spammers try to dis- consumers to access product features. Enterprises also aim guise themselves as ordinary users, this purposeful posting to discover product defects and real users’ requirements via will eventually show different behaviors from ordinary users. reviews in forums. )erefore, this paper changes the research target from Due to the strong negative response to the initial ex- understanding abnormal reviews and the suspicious rela- posure to erroneous information, it is difficult to correct tionship among forum spammers to discovering how they such influences later. Once a network agrees on what must behave (follow or be followed) to achieve their 2 Mathematical Problems in Engineering monetary goals. Firstly, we classify forum users as automated integrated semantic analysis, was quite promising in the spammers, marketing spammers, and normal users real-world case study, as confirmed with both supervised according to the different behavior patterns of forum users. learning and unsupervised learning techniques by com- Automated spammers are those forum users who are paring a nonsemantic and semantic analysis. As demon- controlled by the spam software. )ey disguise themselves as strated in [16], by analyzing the features of forum users, normal users who display an intention to purchase the forum spammer, and forums, the authors found that every related product or express dissatisfaction toward a related forum has many fake reviews, including some forums with product. Normally, automated spammers mislead forum good reputations. users by posting reviews with a biased emotional tendency. However, our work found that the methods mentioned Marketing spammers are real users who are hired by a spam above are no longer working well. For instance, most users company. In contrast to automated spammers, marketing are now able to easily distinguish rough and fake websites spammers disguise themselves as leading users in forums to with many advertisements, so the number of fake reviews promote related products. )ey post deep, detailed, and with URLs [12, 13] has become much lower. Additionally, positive reviews to overstate the quality of related products. we found that the recognition effect of the method in [14] In general, the more detailed analysis, the more useful in- would be compromised if a large number of forum spam- formation for forum users [8–10]. Moreover, marketing mers have occupied the forums. In our study, the abnormal spammers, as a new but contemptible marketing mode, are feature named spamicity in the first post in [5] does not work emerging in many forums [11]. )en, we propose a be- currently for recognizing forum spammers. At the same havior-driven automated spammer recognition (ASR) time, we found that marketing spammers have a similar model and a marketing spammer recognition (MSR) model abnormal behavioral feature named the submission time of to recognize forum spammers based on the above three types replies in [5] but we cannot find the same behavioral pattern of forum users. Final experimental results illustrate our among automated spammers. In [16], the method that behavior-driven recognition models are able to accurately recognizes spam pages based on spam content features is still detect forum spammers. effective, but this method cannot efficiently recognize forum )e paper is organized as follows: Section 2 reviews the spammers who have many reviews that are similar to those related works. In Section 3, we define some variables to of normal users. In [6], the authors mentioned that once a measure the behavior features of forum users. )e proposed mission is finished, a paid spam poster normally discards the ASR and MSR models are introduced in Section 4. Subse- user ID and never uses it again, potential paid spam posters quently, we describe the experimental dataset and discuss are not willing to continue their activities for a long time. the main experimental results in Section 5. Finally, we In recent years, research on spammers in social media conclude with a summary in Section 6. and e-business websites has been increasing. Liu [17] proposed a two-stage cascading model, named ProZombie, 2. Related Works which balanced effectiveness and accuracy well in recognizing spammers in Weibo. In [18], message content, user At present, the research on recognizing spammers and fake behavior, and social relationship information were fully used reviews is mainly focused on social media like Twitter. Some to recognize spammers in Weibo. )e work by Hayati et al. e-business websites, such as Amazon and Taobao, have also [19] proposed using a self-organizing map and neural achieved more research attention. In terms of recognizing networks to determine the features of spammers on the forum spammers, a few studies have been conducted in Internet. )ey classified spammers into four categories based recent years, mainly focusing on the recognition of fake on the different behavioral patterns of spammers: content forums and forum spam automator tools. Some recognition submitters, profile editors, content viewers, and mixed be- methods based on abnormal text content have also been havior. Radford et al. [20] constructed an unsupervised proposed by researchers. Some researchers attempt to use representation learning system, which reached an accuracy abnormal URL characteristics in reviews and the link of 91.8% in sentiment analysis by using reviews in Amazon structure of the graph rooted at the posted URL to recognize as training datasets. Furthermore, the authors in [12, 21] posts from the forum spammers [12, 13]. Additionally, recognized fake reviews via the difference of emoticons, contents unrelated to the target posts in the forum were used URLs, @ symbols, and photos in different reviews from to recognize forum spammers [14]. Shin [15] discovered spammers and normal users. Dewang et al. [22] proposed a some features and operational mechanisms of a forum spam spam detection framework combining the PageRank algo- automator tool

Research Article a Behavior-Driven Forum Spammer Recognition Method with Its Application in Automobile Forums

Address Munging: the Practice of Disguising, Or Munging, an E-Mail Address to Prevent It Being Automatically Collected and Used

Adversarial Web Search by Carlos Castillo and Brian D

Battling the Internet Water Army: Detection of Hidden Paid Posters

The History of Digital Spam

Image Spam Detection: Problem and Existing Solution

Understanding the Business Ofonline Pharmaceutical Affiliate Programs

Internet Security

Administrator's Guide for Synology Mailplus Server

An Empirical Assessment of the CAN SPAM Act

Criminal Abuse of Domain Names Bulk Registration and Contact Information Access

Anti-Spam Methods

Spam (Spam 2.0) Through Web Usage