Arxiv:2003.05096V1 [Cs.MM] 11 Mar 2020
Total Page:16
File Type:pdf, Size:1020Kb
Exploring the Role of Visual Content in Fake News Detection Juan Cao1;2, Peng Qi1;2[0000−0002−6458−8320], Qiang Sheng1;2[0000−0002−2481−5023], Tianyun Yang1;2, Junbo Guo1, and Jintao Li1 1 Key Laboratory of Intelligent Information Processing & Center for Advanced Computing Research, Institute of Computing Technology, CAS, China fcaojuan,qipeng,shengqiang18z,yangtianyun19z,guojunbo,[email protected] 2 University of Chinese Academy of Sciences, China Abstract. The increasing popularity of social media promotes the pro- liferation of fake news, which has caused significant negative societal effects. Therefore, fake news detection on social media has recently be- come an emerging research area of great concern. With the development of multimedia technology, fake news attempts to utilize multimedia con- tent with images or videos to attract and mislead consumers for rapid dissemination, which makes visual content an important part of fake news. Despite the importance of visual content, our understanding about the role of visual content in fake news detection is still limited. This chapter presents a comprehensive review of the visual content in fake news, including the basic concepts, effective visual features, representa- tive detection methods and challenging issues of multimedia fake news detection. This chapter can help readers to understand the role of visual content in fake news detection, and effectively utilize visual content to assist in detecting multimedia fake news. Keywords: fake news detection · fake-news images · social media · im- age forensics · image repurposing · multimedia · multi-modal · deep learn- ing · computer vision. 1 Introduction 1.1 Motivation arXiv:2003.05096v1 [cs.MM] 11 Mar 2020 Social media platforms, such as Twitter1 and Chinese Sina Weibo2, have become important access where people acquire the latest news and express their opin- ions freely34. However, the convenience and openness of social media have also 1 https://twitter.com/ 2 https://weibo.com/ 3 http://www.cac.gov.cn/2019-08/30/c 1124938750.htm 4 https://www.journalism.org/2018/09/10/news-use-across-social-media-platforms- 2018/ 2 J Cao et al. promoted the proliferation of fake news, i.e., news with intentionally false infor- mation, which not only disturbed the cyberspace order but also caused many detrimental effects on real-world events. For example, in the political field, during the month before the 2016 U.S. presidential election campaign, the Americans encountered between one and three fake stories on average from known publish- ers [1], which inevitably misled the voters and influenced the election results; In the economic field, a piece of fake news claiming that Barack Obama was injured in an explosion wiped out $130 billion in stock value5; In the social field, dozens of innocent people were beaten to death by locals in India because of a piece of fake news about child trafficking that was widely spread on social media6. Hence, the automatic detection of fake news has become an urgent problem of great concern in recent years [18, 33, 49]. The development of multi-media technology promotes the evolution of self- media news from text-based posts to multimedia posts with images or videos, which attracts more attention from consumers and provides more credible sto- rytelling. On the one hand, as a vivid description form, the visual content in- cluding images and videos is more attractive and salient than plain text and consequently boosts the news propagation. For instance, tweets with images get 18% more clicks, 89% more likes, and 150% more retweets than those without images7. On the other hand, visual content is often used as evidence of a story in our common sense, which can increase the credibility of the news8. Unfor- tunately, this advantage is also taken by fake news. For rapid dissemination, fake news usually contains misrepresented or even tampered images or videos to attract and mislead consumers. As a result, visual content has become an important part of fake news that cannot be neglected, making multimedia fake news detection a new challenge. Multimedia fake news detection aims at effectively utilizing the information of several modalities, such as textual, visual and social modalities, to detect fake news. Visual modality can provide abundant visual information, which is pre- liminarily proven to be effective in fake news detection [15]. However, although the importance of exploiting visual content have been revealed, our understand- ing about the role of visual content in fake news detection remains limited. To further facilitate research on this problem, we present a comprehensive review of the visual content in fake news in this chapter, including the problem defi- nition, available visual characteristics, representative detection approaches and challenging problems. 5 https://www.telegraph.co.uk/finance/markets/10013768/Bogus-AP-tweet-about- explosion-at-the-White-House-wipes-billions-off-US-markets.html 6 https://www.washingtonpost.com/world/asia pacific/as-mob-lynchings- fueled-by-whatsapp-sweep-india-authorities-struggle-to-combat-fake- news/2018/07/02/683a1578-7bba-11e8-ac4e-421ef7165923 story.html 7 https://www.invid-project.eu/tools-and-services/invid-verification-plugin/ 8 https://www.businesswire.com/news/home/20190204005613/en/Visual- SearchWins-Text-Consumers%E2%80%99-Trusted-Information Exploring the Role of Visual Content in Fake News Detection 3 1.2 Problem Definition In this subsection, we introduce the concept of fake news and analyze the differ- ent types of visual content in fake news. Fake news is widely defined as news articles that are intentionally and ver- ifiably false and could mislead consumers [1, 20, 33]. On the context of social multimedia, news articles refer to news posts with multimedia content that are published by users, so the general definition of fake news has been further re- fined [3, 5, 6, 46]. Formally, we state the refined definition as follows, Definition 1.1 A piece of fake news is a news post that shares mul- timedia content that does not faithfully represent the event that it refers to. In real-world scenarios, the visual content in fake news can be broadly clas- sified into three categories: (1) visual content that is deliberately manipulated (also known as tampering, doctoring or photoshopping) or automatically gen- erated by deep generative networks, which equals to fake images/videos in our common sense (see Figure 1a), (2) visual content from an irrelevant event, such as a past event, a staged work or an artwork, that is reposted as being captured in the context of an emerging event (see Figure 1b), or (3) visual content that is real (not edited) but is published together with a false claim about the de- picted event (see Figure 1c). All examples in Figure 1 fall under our definition of fake news, because the images and associated texts jointly convey the mislead- ing information regardless of the veracity of the textual or the visual content itself. For this reason, fake news is also referred to as misleading content [6] or fauxtography [46] in the context of social multimedia. (a) (b) (c) Fig. 1: Examples of the visual content in fake news: (a) A tampered image where Putin is spliced on the middle seat at G-20 to show that he is in the center position of an intense discussion among other world leaders; (b) A real image captured in 2009 New York air crash, but it is claimed to be the wrecked Malaysia Airlines MH370 in 2014; (c) A real image taken at the moment when Hillary Clinton accidentally stumbled, but it was maliciously interpreted as evidence of Clinton's failing health. 4 J Cao et al. 1.3 Organization The remainder of this chapter is organized as follows. In Chapter 2, we introduce available visual features for fake news detection. We continue to present existing approaches utilizing visual content to detect fake news in Chapter 3. In Chapter 4, we discuss several challenging problems for multimedia fake news detection. Finally, we summarize available data repositories, tools (or software systems) and relevant competitions about multimedia fake news detection research in the appendix. 2 What Visual Content Tells? Visual content has been shown as an important promoter for fake news propa- ganda9. At the same time, visual content also tells abundant cues for detecting fake news. To capture the distinctive characteristics of fake news, works ex- tracted visual features from visual content (generally, images and videos), which can be categorized into four types: forensics features, semantic features, statis- tical features and context features. 2.1 Forensics Features Since the addressed problem is the verification of multimedia posts, one rea- sonable approach would be to directly verify the truth of visual content, i.e., whether the image or video is captured in the event. Intuitively, if the visual content has undergone manipulation or severe re-compression, or is generated by deep learning techniques, the news post that it belongs to is likely to be fake. To access the authenticity, (blind) forensics features which can highlight the dig- itally edited traces of the visual content, are exploited in fake news detection from different perspectives, including the manipulation detection, generation de- tection and re-compression detection. Manipulation Detection Manipulation detection aims at looking for patterns or discontinuities left by operations such as splicing, copy-move and removal. The splicing refers to copy- ing a part of one image and inserting it into another, while the copy-move and removal both happen in the same image. Because very few works [3] directly used these features in fake news detection yet, we also investigated the features mentioned in related works and summarized as follows: { Camera-related features are particular patterns caused by the imaging pipeline, such as the sensor pattern noise and color filter array interpola- tion patterns, which can be destroyed by manipulation.