SocialSpamGuard: A Data Mining-Based Spam Detection System for Social Media Networks Xin Jin Cindy Xide Lin Department of Computer Science Department of Computer Science University of Illinois at Urbana-Champaign University of Illinois at Urbana-Champaign 201 N. Goodwin Ave. 201 N. Goodwin Ave. Urbana, IL USA Urbana, IL USA
[email protected] [email protected] Jiebo Luo Jiawei Han Kodak Research Laboratories Department of Computer Science Eastman Kodak Company University of Illinois at Urbana-Champaign 1999 Lake Avenue 201 N. Goodwin Ave. Rochester, USA Urbana, IL USA
[email protected] [email protected] ABSTRACT actor/director, artist, athlete, author, book, health/beauty, We have entered the era of social media networks repre- movie, cars, clothing, community, food/beverages, games, sented by Facebook, Twitter, YouTube and Flickr. Internet toys, government organization, interest, sports, TV chan- users now spend more time on social networks than search nel, TV show, and website. The current most popular page engines. Business entities or public ¯gures set up social (Texas Hold'em Poker) has 40 million fans, and there are networking pages to enhance direct interactions with on- around 3000 pages that have more than 500,000 fans, with line users. Social media systems heavily depend on users an average of 2 millions. Fans not only can see information for content contribution and sharing. Information is spread submitted by the page, but also can post comments, photos across social networks quickly and e®ectively. However, at and videos to the page. the same time social media networks become susceptible Social media websites allow users freely distribute and to di®erent types of unwanted and malicious spammer or share information to friends.