Artificial Intelligence As a Disinformation Engine Revision

Total Page:16

File Type:pdf, Size:1020Kb

Artificial Intelligence As a Disinformation Engine Revision Artificial Intelligence as a Disinformation Engine Revision #76 Dylan W. Wheeler University of New Hampshire Philosophy Thesis – In Review May 13, 2020 Wheeler 2 ACKNOWLEDGMENTS This research would not have been possible without the mentorship and direction given by Nicholas J. Smith, Ph.D. His undergraduate classes in topics of science, technology, and society originally inspired me to consider these pressing problems. Appreciation is due to Brandon Bryant for his expertise in cryptography and decentralized currencies. I am grateful for the wisdom of Tom Haines and Jaed Coffin, whom each helped me crystalize the broad scope of disinformation and its implications. My heartfelt thanks to my parents for their unwavering support that empowered me to enjoy so many opportunities. Wheeler 3 Table of Contents Introduction ..................................................................................................................................... 4 Contemporary Disinformation ........................................................................................................ 5 The Role of Artificial Intelligence .................................................................................................. 7 The Role of Social Media ............................................................................................................. 11 Fuel for the Engine........................................................................................................................ 15 Shutting Down the Engine ............................................................................................................ 18 The Implications ........................................................................................................................... 23 Social Media .............................................................................................................................. 23 Governments ............................................................................................................................. 25 Ideological Challenges .............................................................................................................. 26 Recommendations ......................................................................................................................... 28 Awareness Through Digital Literacy ........................................................................................ 29 Transparency Through New Technologies ............................................................................... 30 Increase the Cost of Spam ......................................................................................................... 33 Conclusion .................................................................................................................................... 36 Figures........................................................................................................................................... 38 Appendix A ................................................................................................................................... 39 Bibliography ................................................................................................................................. 40 Wheeler 4 Introduction New forms of artificial intelligence (A.I.) can autonomously replace people in existing images or videos with the likeness of others. The resulting media are called deepfakes. These replacements could take any form—anything from basic face-swapping (think: Snapchat augmented reality filters) to full-body mapping, where a person’s body can be mapped onto videos of professional dancers to make it appear that they, too, are highly coordinated. People of the Internet are now using various kinds of A.I. that are excellent at face-swapping to deepfake actor Nicholas Cage into various movies that did not cast him. These algorithms are allowing anyone in the world to easily make it look like a target is doing or saying anything. In the wrong hands, these innocent algorithms enable the mass manufacture of disinformation (or “fake news”), false information spread deliberately to deceive harmfully. If we are not careful, we could enter a post-truth future, where the truth about issues becomes overshadowed by emotionally charged headlines and discourse, making it even more challenging to identify truthful and genuine media. With little ways to determine which images and videos have been synthesized, the most sensationalist disinformation will go viral and attract the most attention, usually at the expense of a population or demographic. In this paper, I argue that AI- powered disinformation tools are becoming increasingly ubiquitous. I argue that the best way to address disinformation is by facilitating widespread digital literacy while building new technologies that provide increased context for news articles and social media posts. My central recommendations will be advocating for digital literacy education, investments in technologies to provide ample context and transparency, and to pursue new technologies that deter inauthentic behavior. Wheeler 5 Contemporary Disinformation Disinformation exists in many forms but is generally characterized by a deliberate attempt to mislead and deceive people. American engineer and YouTuber Destin Sandlin interviewed Renée DiResta, a 2019 Mozilla Fellow in Media, Misinformation, and Trust, who said that perpetrators of disinformation can be both corporations and individuals and have two primary motivations: financial and ideological.1 That is, they usually seek to extract advertisement revenue from their manufactured content, or try to manipulate public opinion (usually for financial or political gain). This reality is not surprising, given that money is power in our global economic system. Disinformation is not new, either, for its earlier forms were often government-sponsored propaganda. Propaganda has taken various forms, with earlier examples being pamphlets posted in town squares. This model’s cost per impression ratio is high; governments would design and manufacture these pamphlets for the masses, but only people who were walking by and happened to stop would see them. Computers and computer graphics have helped pave the way for the next generation of disinformation: synthetic media, or algorithmically created/modified media. Contemporary disinformation is far more dangerous because it is created fully autonomously using A.I. that can synthesize and spread it without a single human interaction. No longer must curious people seek out and read propaganda; it is now being delivered to peoples’ inboxes and newsfeeds, without consent, and backed by social features that make engaging with it easier than ever before. Disinformation can now reach more people with comparatively little cost and effort. Industry-grade tools like Adobe Photoshop (released in 1988) and Adobe After Effects (released five years later in 1993) gave humanity the ability to manipulate images and videos at 1 Sandlin, Destin W. 2019. Manipulating the YouTube Algorithm. Wheeler 6 large scale for the first time. No longer was it as expensive or time-consuming to cut and paste physical pieces of paper for mass production—we could now produce near-perfect computer- generated imagery (CGI) from the digital realm because all the computationally-intensive rendering took place by a computer. However, the use of Adobe’s robust software programs is still costly and offers an overwhelmingly vast selection of features. Only a team of talented graphic design artists can, with time and money, learn how to doctor images and videos convincingly. This phenomenon is why CGI is typically solely used by large-scale production companies with massive budgets like Marvel Studios, whose budget for Avengers: Endgame was $356 million.2 As with all forms of nascent technology, the landscape is rapidly changing. With the advent of community-driven content-generating algorithms, the process of creating disinformation is automated. Developers from all over the world are researching, refining, and optimizing these algorithms and are making them free to the public. Instead of needing an expensive computer graphics company to alter the visual data with proprietary tools, we can have free algorithms do the job just as well—if not better. FaceSwap, for example, is an open- source algorithmic face-swapping tool that anyone with an internet connection can utilize.3 The software is maintained by over sixty contributors who have built an infrastructure that takes this highly technical process and simplifies it to a matter of (1) gathering image data for a subject and (2) metaphorically pressing “go.” Real-Time Voice Cloning is another open-source tool that can not only clone a voice from mere minutes of a sample sound but can also make that cloned voice 2 n.d. Avengers: Endgame. 3 https://github.com/deepfakes/faceswap Wheeler 7 speak whatever the author desires.4 The software has received rave reviews5 and spawned over 2,800 open-source variations. Research and progress in these spaces are booming. For little-to-no cost, anyone can use these algorithms to produce deepfakes that look or sound utterly genuine to our senses. We are observing a shift in resources away from expensive creativity and talent toward inexpensive training data and processing power. Data and computational speed are resources that are becoming more accessible among individuals and corporations all
Recommended publications
  • Arxiv:1907.05276V1 [Cs.CV] 6 Jul 2019 Media Online Are Rapidly Outpacing the Ability to Manually Detect and Respond to Such Media
    Human detection of machine manipulated media Matthew Groh1, Ziv Epstein1, Nick Obradovich1,2, Manuel Cebrian∗1,2, and Iyad Rahwan∗1,2 1Media Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA 2Center for Humans & Machines, Max Planck Institute for Human Development, Berlin, Germany Abstract Recent advances in neural networks for content generation enable artificial intelligence (AI) models to generate high-quality media manipulations. Here we report on a randomized experiment designed to study the effect of exposure to media manipulations on over 15,000 individuals’ ability to discern machine-manipulated media. We engineer a neural network to plausibly and automatically remove objects from images, and we deploy this neural network online with a randomized experiment where participants can guess which image out of a pair of images has been manipulated. The system provides participants feedback on the accuracy of each guess. In the experiment, we randomize the order in which images are presented, allowing causal identification of the learning curve surrounding participants’ ability to detect fake content. We find sizable and robust evidence that individuals learn to detect fake content through exposure to manipulated media when provided iterative feedback on their detection attempts. Over a succession of only ten images, participants increase their rating accuracy by over ten percentage points. Our study provides initial evidence that human ability to detect fake, machine-generated content may increase alongside the prevalence of such media online. Introduction The recent emergence of artificial intelligence (AI) powered media manipulations has widespread societal implications for journalism and democracy [1], national security [2], and art [3]. AI models have the potential to scale misinformation to unprecedented levels by creating various forms of synthetic media [4].
    [Show full text]
  • Editorial Algorithms
    March 26, 2021 Editorial Algorithms The next AI frontier in News and Information National Press Club Journalism Institute Housekeeping - You will receive slides after the presentation - Interactive polls throughout the session - Use QA tool on Zoom to ask questions - If you still have questions, reach out to Francesco directly [email protected] or connect on twitter @fpmarconi What will your learn during today’s session 1. How AI is being used and will be used by journalists and communicators, and what that means for how we work 2. The ethics of AI, particularly related to diversity, equity and representation 3. The role of AI in the spread of misinformation and ‘fake news’ 4. How journalists and communications professionals can avoid pitfalls while taking advantage of the possibilities AI provides to develop new ways of telling stories and connecting with audiences Poll time! What’s your overall feeling about AI and news? A. It’s the future of news and communications! B. I’m cautiously optimistic C. I’m cautiously pessimistic D. It’s bad news for journalism Journalism was designed to solve information scarcity Newspapers were established to provide increasingly literate audiences with information. Journalism was originally designed to solve the problem that information was not widely available. The news industry went through a process of standardization so it could reliably source and produce information more efficiently. This led to the development of very structured ways of writing such as the “inverted pyramid” method, to organizing workflows around beats and to plan coverage based on news cycles. A new reality: information abundance The flood of information that humanity is now exposed to presents a new challenge that cannot be solved exclusively with the traditional methods of journalism.
    [Show full text]
  • Deepfaked Online Content Is Highly Effective in Manipulating People's
    Deepfaked online content is highly effective in manipulating people’s attitudes and intentions Sean Hughes, Ohad Fried, Melissa Ferguson, Ciaran Hughes, Rian Hughes, Xinwei Yao, & Ian Hussey In recent times, disinformation has spread rapidly through social media and news sites, biasing our (moral) judgements of other people and groups. “Deepfakes”, a new type of AI-generated media, represent a powerful new tool for spreading disinformation online. Although Deepfaked images, videos, and audio may appear genuine, they are actually hyper-realistic fabrications that enable one to digitally control what another person says or does. Given the recent emergence of this technology, we set out to examine the psychological impact of Deepfaked online content on viewers. Across seven preregistered studies (N = 2558) we exposed participants to either genuine or Deepfaked content, and then measured its impact on their explicit (self-reported) and implicit (unintentional) attitudes as well as behavioral intentions. Results indicated that Deepfaked videos and audio have a strong psychological impact on the viewer, and are just as effective in biasing their attitudes and intentions as genuine content. Many people are unaware that Deepfaking is possible; find it difficult to detect when they are being exposed to it; and most importantly, neither awareness nor detection serves to protect people from its influence. All preregistrations, data and code available at osf.io/f6ajb. The proliferation of social media, dating apps, news and copy of a person that can be manipulated into doing or gossip sites, has brought with it the ability to learn saying anything (2). about a person’s moral character without ever having Deepfaking has quickly become a tool of to interact with them in real life.
    [Show full text]
  • The Nooscope Manifested: AI As Instrument of Knowledge Extractivism
    AI & SOCIETY https://doi.org/10.1007/s00146-020-01097-6 ORIGINAL ARTICLE The Nooscope manifested: AI as instrument of knowledge extractivism Matteo Pasquinelli1 · Vladan Joler2 Received: 27 March 2020 / Accepted: 14 October 2020 © The Author(s) 2020 Abstract Some enlightenment regarding the project to mechanise reason. The assembly line of machine learning: data, algorithm, model. The training dataset: the social origins of machine intelligence. The history of AI as the automation of perception. The learning algorithm: compressing the world into a statistical model. All models are wrong, but some are useful. World to vector: the society of classifcation and prediction bots. Faults of a statistical instrument: the undetection of the new. Adversarial intelligence vs. statistical intelligence: labour in the age of AI. Keywords Nooscope · Political economy · Mechanised knowledge · Information compression · Ethical machine learning 1 Some enlightenment The purpose of the Nooscope map is to secularize AI regarding the project to mechanise from the ideological status of ‘intelligent machine’ to one of reason knowledge instruments. Rather than evoking legends of alien cognition, it is more reasonable to consider machine learn- The Nooscope is a cartography of the limits of artifcial ing as an instrument of knowledge magnifcation that helps intelligence, intended as a provocation to both computer to perceive features, patterns, and correlations through vast science and the humanities. Any map is a partial perspec- spaces of data beyond human reach. In the history of science tive, a way to provoke debate. Similarly, this map is a mani- and technology, this is no news; it has already been pursued festo—of AI dissidents.
    [Show full text]
  • Detecting Synthetic Faces by Understanding Real Faces
    Detecting Synthetic Faces by Understanding Real Faces Shruti Agarwal Electrical Engineering and Computer Sciences University of California, Berkeley Technical Report No. UCB/EECS-2021-171 http://www2.eecs.berkeley.edu/Pubs/TechRpts/2021/EECS-2021-171.html August 4, 2021 Copyright © 2021, by the author(s). All rights reserved. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission. Detecting Synthetic Faces by Understanding Real Faces By Shruti Agarwal A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Engineering – Electrical Engineering and Computer Sciences in the Graduate Division of the University of California, Berkeley Committee in charge: Professor Hany Farid, Chair Professor Emily Cooper Professor Alexei A. Efros Professor Ren Ng Summer 2021 Detecting Synthetic Faces by Understanding Real Faces Copyright 2021 by Shruti Agarwal 1 Abstract Detecting Synthetic Faces by Understanding Real Faces by Shruti Agarwal Doctor of Philosophy in Engineering – Electrical Engineering and Computer Sciences University of California, Berkeley Professor Hany Farid, Chair The creation of sophisticated fake videos has been largely relegated to Hollywood studios or state actors. Recent advances in deep learning, however, have democratized the creation of sophisticated and compelling fake images, videos, and audios. This synthetically-generated media – so-called deep fakes – continue to capture the imagination of the computer-graphics and computer-vision communities.
    [Show full text]
  • Deep Fakes Understanding the Illicit Economy for Synthetic Media
    Deep Fakes Understanding the illicit economy for synthetic media Author: Rob Volkert Contributions: Michael Eller, Dev Badlu, and Kyle Ehmke (Threat Connect) NISOS ™ Are deep fakes actually being weaponized? In this paper we will examine the illicit ecosystem for deep fakes, their technology evolution and migration paths from surface web to deep and dark sites, and uncover some of the actors creating and disseminating these videos. Nisos undertook research into deep fake technology (superimposing video footage of a face onto a source head and body) to determine if we could find the existence of a deep fake illicit digital underground economy or actors offering these services. Our research for this white paper focused specifically on the commoditization of deep fakes: whether deep fake videos or technology is being sold for illicit purposes on the surface, deep, or dark web. Our research found that indeed deep fakes are being commoditized but primarily on the surface web and in the open. Most deep fake commoditization appears to be focused on research and basic cloud automation for satire or parody purposes. The underpinnings of the illicit ecosystem is nonconsensual face swapping pornography where commoditization happens primarily via ad revenue and subscription fees. While we expected to find a community, we did not find evidence of a marketplace (selling as a service) for e-crime or disinformation purposes. We assess the lack of an underground economy for uses other than satire, parody, or pornography is due to the resource and technological barrier to entry, lack of convincing quality in the videos. In short, the dark market is not yet lucrative enough.
    [Show full text]
  • Fighting Deepfakes by Detecting GAN DCT Anomalies
    Journal of Imaging Article Fighting Deepfakes by Detecting GAN DCT Anomalies Oliver Giudice 1,* , Luca Guarnera 1,2 and Sebastiano Battiato 1,2 1 Department of Mathematics and Computer Science, University of Catania, 95125 Catania, Italy; [email protected] (L.G.); [email protected] (S.B.) 2 iCTLab s.r.l., Spinoff of University of Catania, 95125 Catania, Italy * Correspondence: [email protected] Abstract: To properly contrast the Deepfake phenomenon the need to design new Deepfake detection algorithms arises; the misuse of this formidable A.I. technology brings serious consequences in the private life of every involved person. State-of-the-art proliferates with solutions using deep neural networks to detect a fake multimedia content but unfortunately these algorithms appear to be neither generalizable nor explainable. However, traces left by Generative Adversarial Network (GAN) engines during the creation of the Deepfakes can be detected by analyzing ad-hoc frequencies. For this reason, in this paper we propose a new pipeline able to detect the so-called GAN Specific Frequencies (GSF) representing a unique fingerprint of the different generative architectures. By employing Discrete Cosine Transform (DCT), anomalous frequencies were detected. The b statistics inferred by the AC coefficients distribution have been the key to recognize GAN-engine generated data. Robustness tests were also carried out in order to demonstrate the effectiveness of the technique using different attacks on images such as JPEG Compression, mirroring, rotation, scaling, addition of random sized rectangles. Experiments demonstrated that the method is innovative, exceeds the state of the art and also give many insights in terms of explainability.
    [Show full text]
  • FBI Warns Companies of “Almost Certain” Threats from Deepfakes Matthew F
    Volume 4, No. 4 | July–August 2021 The Journal of Robotics, R A I L Artificial Intelligence & Law Editor’s Note: AI Developments Steven A. Meyerowitz National Security Commission on Artificial Intelligence Final Report Prioritizes U.S. Global Competition, Conflict Preparation, and Enhanced Protection of Privacy and Civil Liberties Katherine Sheriff and K.C. Halm Advancing America’s Dominance in AI: The 2021 National Defense Authorization Act’s AI Developments Jonathan M. Baker, Adelicia R. Cliffe, Kate M. Growley, Laura J. Mitchell Baker, and Michelle D. Coleman FDA Releases Action Plan for Artificial Intelligence/Machine Learning-Enabled Software as a Medical Device Nathan A. Brown, Christin Helms Carey, and Emily I. Gerry Deepfake Litigation Risks: The Collision of AI’s Machine Learning and Manipulation Erin M. Bosman, Christine E. Lyon, Michael Burshteyn, and Benjamin S. Kagel FBI Warns Companies of “Almost Certain” Threats from Deepfakes Matthew F. Ferraro, Jason C. Chipman, and Benjamin A. Powell Prepare for the Impending Wave of Facial Recognition Technology Regulation—Before It’s Too Late David J. Oberly Considerations in Machine Learning-Led Programmatic Underwriting Scott T. Lashway, Christopher A. Lisy, and Matthew M.K. Stein Making Safer Robotic Devices William D. Kennedy, James D. Burger, and Frank A. Bruno OFAC Settles With Digital Currency Services Provider for Apparent Violations of Multiple Sanctions Programs Gustavo J. Membiela and Natalia San Juan Report on ExamSoft’s ExamID Feature (and a Method to Bypass It) Gabe Teninbaum Current Developments: AI Research, Crypto Cases Make News Victoria Prussen Spears Everything Is Not Terminator: The AI Genie Bottle John Frank Weaver ® FULL COURT PRESS The Journal of Robotics, Artificial Intelligence & Law RAIL Volume 4, No.
    [Show full text]
  • Forbidden Knowledge in Machine Learning Reflections on the Limits of Research and Publication
    Forbidden knowledge in machine learning Reflections on the limits of research and publication Dr. Thilo Hagendorff [email protected] University of Tuebingen Cluster of Excellence “Machine Learning: New Perspectives for Science” Abstract – Certain research strands can yield “forbidden knowledge”. This term refers to knowledge that is considered too sensitive, dangerous or taboo to be produced or shared. Discourses about such publication restrictions are already entrenched in scientific fields like IT security, synthetic biology or nuclear physics research. This paper makes the case for transferring this discourse to machine learning research. Some machine learning applications can very easily be misused and unfold harmful consequences, for instance with regard to generative video or text synthesis, personality analysis, behavior manipulation, software vulnerability detection and the like. Up to now, the machine learning research community embraces the idea of open access. However, this is opposed to precautionary efforts to prevent the malicious use of machine learning applications. Information about or from such applications may, if improperly disclosed, cause harm to people, organizations or whole societies. Hence, the goal of this work is to outline norms that can help to decide whether and when the dissemination of such information should be prevented. It proposes review parameters for the machine learning community to establish an ethical framework on how to deal with forbidden knowledge and dual-use applications. Keywords – forbidden knowledge, machine learning, artificial intelligence, governance, dual-use, publication norms 1 Introduction Currently, machine learning research is, not much different from other scientific fields, embracing the idea of open access. Research findings are publicly available and can be widely shared amongst other researchers and foster idea flow and collaboration.
    [Show full text]
  • Applied Imagery Pattern Recognition Workshop
    0 Applied Imagery Pattern Recognition Workshop49th Annual IEEE AIPR 2020 Trusted Computing, Privacy, and Securing Multimedia Washington DC. (On-line) 13-15 October, 2020 1 2020 Applied Imagery Pattern Recognition Workshop AIPR is sponsored by: 2 2020 Applied Imagery Pattern Recognition Workshop Table of Contents Schedule At-a-Glance ................................................................................................................................................... 3 Welcome ....................................................................................................................................................................... 4 AIPR Executive Committee ........................................................................................................................................... 5 Guest Speakers ............................................................................................................................................................. 5 Schedule ...................................................................................................................................................................... 15 Tuesday, OctoBer 13 ............................................................................................................................................... 16 Session I AI and Cyber Physical .................................................................................................................. 16 Session II Security with IoT/Cloud/BlockChains .........................................................................................
    [Show full text]
  • Synthetic Media & Content
    Synthetic Media & Content New Realities, Synthetic Media, 00 01 02 03 04 05 06 07 08 09 10 11 12 Watch Closely Informs Strategy Act Now News & Information 2ND YEAR ON THE LIST Synthetic Media and Content KEY INSIGHT EXAMPLES DISRUPTIVE IMPACT EMERGING PLAYERS Synthetic media is created using artificial Watch for synthetic media to appear • Samsung Next Synthetic media con- intelligence. Algorithms use an initial set more frequently in 2021, representing • Loudly sists of algorithmically of data to learn—people, voices, pho- new opportunities and risks for busi- tos, objects, motions, videos, text, and • Endel generated digital con- nesses, and reshaping the entertainment, other types of media. The end result is service, and communications landscape. • Replica Studios tent, including audio, realistic-looking and realistic-sounding artificial digital content. Voice clones, • Lovo video, deepfakes, virtual voice skins, unique gestures, photos, • Modulate characters and environ- and interactive bots are all part of the ecosystem. Synthetic media can be used • Rephrase.ai ments, and more. The for practical reasons, such as generating • Synthesia technology will become characters in animated movies or act- ing as a stand-in for live action movies. • Alethea.ai an integral aspect of Synthetic media can automate dubbing in • Carv3d future XR experiences. foreign languages on video chats and fill in the blanks when video call frames are • Animatico dropped because of low bandwidth issues. • Narrativa Imagine an entirely new genre of soap • DeepNatural opera, where AI systems learn from your digital behavior, biometrics, and personal • Baidu Research data, and use details from your personal life for the storylines of synthetic charac- ters.
    [Show full text]
  • Deepfakes: Pro & Contra of Democratic Order
    DeepFakes: pro & contra of democratic order © 2020, Vitaliy Goncharuk Agenda 1. State of technology in 2020 2. State of technology in 3-5 years 3. Is it possible to control DeepFakes? 4. Synthetic media concept 5. Is DeepFakes the right to freedom and counter-technology against AI control? 6. Synthetic media and politics 7. Synthetic media and the change of centres of power 8. Dynamics of changes in society and dynamics of AI 9. Summary 10. Sources About the speaker Vitaliy Goncharuk: 1. Founder of Augmented Pixels, Inc (autonomous navigation for robotics) and HLAB.AI (AI for human behaviour prediction); 2. The Head of The AI Committee of Ukraine 3. Member of AD HOC AI Committee / Council De Europe What is Deepfakes? Deepfakes - synthetic media in which a person in an existing image or video is replaced with someone else's likeness (WiKi). Relevance Politics Taboo of the body Thefts Fake people Zoom + DeepFakes Watch this video Questions - Is it possible to detect DeepFakes? - Is it possible to control DeepFakes? - How will DeepFakes and synthetic content affect elections, society, government? - What is the impact on ethics, social norms in 1- 2-5-10 years? Trends Mass social media Facebook users Availability of DeepFakes technologies Photoshop DeepFakes - long - 3-4 clicks - expensive - 1 minute - cheap and mass use DeepFakes opportunities in 2020 It is possible to change, completely synthesize or simulate: • visual series (photo-video); • voice; • body, head, lip movements (photorealistic 3D model); • environment (including physical
    [Show full text]