Abusive Advertising: Scrutinizing Socially Relevant Algorithms in a Black Box Analysis to Examine Their Impact on Vulnerable Patient Groups in the Health Sector

Total Page:16

File Type:pdf, Size:1020Kb

Abusive Advertising: Scrutinizing Socially Relevant Algorithms in a Black Box Analysis to Examine Their Impact on Vulnerable Patient Groups in the Health Sector Abusive Advertising: Scrutinizing socially relevant algorithms in a Black Box analysis to examine their impact on vulnerable patient groups in the health sector Master Thesis by Martin Reber March 2, 2020 arXiv:2101.02018v1 [cs.CY] 4 Jan 2021 Technische Universit¨atKaiserslautern, Department of Computer Science, 67653 Kaiserslautern, Germany Examiner: Prof. Dr. Katharina Zweig Tobias Krafft Eigenst¨andigkeitserkl¨arung Hiermit versichere ich, dass ich die von mir vorgelegte Arbeit mit dem Thema Abusive Advertising: Scrutinizing socially relevant algorithms in a Black Box analysis to examine their impact on vulnerable patient groups in the health sectorselbstst¨andigverfasst habe, dass ich die verwendeten Quellen und Hilf- smittel vollst¨andigangegeben habe und dass ich die Stellen der Arbeit - ein- schließlich Tabellen und Abbildungen -, die anderen Werken oder dem Internet im Wortlaut oder dem Sinn nach entnommen sind unter Angabe der Quelle als Entlehnung kenntlich gemacht habe. Kaiserslautern, den 2.3.2020 Martin Reber ii Abstract The targeted direct-to-customer marketing of unap- proved stem cell treatments by a questionable online industry is directed at vulnerable users who search the Internet in the hope of a cure. This behavior espe- cially poses a threat to individuals who find themselves in hopeless and desperate phases in their lives. They might show low reluctance to try therapies that solely promise a cure but are not scientifically proven to do so. In the worst case, they suffer serious side-effects. Therefore, this thesis examines the display of advertise- ments of unapproved stem cell treatments for Parkin- son's Disease, Multiple Sclerosis, Diabetes on Google's results page. The company announced a policy change in September 2019 that was meant to prohibit and ban the practices in question. However, there was evidence that those ads were still being delivered. A browser extension for Firefox and Chrome was devel- oped and distributed to conduct a crowdsourced Black Box analysis. It was delivered to volunteers and vir- tual machines in Australia, Canada, the USA and the UK. Data on search results, advertisements and top stories was collected and analyzed. The results showed that there still is questionable advertising even though Google announced to purge it from its platform. iii Zusammenfassung Die Direktvermarktung von nicht zugelassenen Stammzellbehandlungen von der fragwurdigen¨ Online- Industrie dahinter zielt auf Patienten, die das Internet in der Hoffnung auf Heilung durchsuchen. Dieses Verhalten stellt eine besondere Gefahr fur¨ Menschen dar, die sich in verzweifelten Phasen in ihrem Leben befinden. Sie k¨onnten wenig Zuruckhaltung¨ zeigen, die beworbenen Therapien auszuprobieren, die zwar eine Heilung versprechen, diese aber nicht durch anerkann- te klinische Tests belegen k¨onnen. Im schlimmsten Fall erwarten die Patienten schwerwiegende Neben- wirkungen. Daher untersucht diese Theis die oben genannten Werbeanzeigen auf der Ergebnisseite der Online- Suchmaschine Google nach einer Anderung¨ der Plat- formrichtlinien im September 2019. Besonders ging es dabei um Anzeigen bezuglich¨ Behandlugen von Parkin- son, Multipler Sklerose und Diabetes, die in den Ver- haltensregeln ausdrucklich¨ verboten wurden. Browsererweiterungen fur¨ Firefox und Chrome wur- den entwickelt und verteilt, um damit eine crowdsour- ced Black Box Analyse durchzufuhren.¨ Freiwillige Teil- nehmer und virtuelle Maschinen in Australien, Kana- da, den USA und Großbritannien wurden rekrutiert. Es wurden Daten zu Suchergebnissen, Werbung und Schlagzeilen auf der Ergebnisseite von Google gesam- melt. Die Analyse derer ergab, dass es trotz des explizi- ten Verbots dieser Praktiken noch immer fragwurdige¨ Werbung gab. Contents List of Figures vii 1. Introduction1 1.1. Motivation . .2 1.2. Outline . .4 2. Fundamentals5 2.1. Information . .5 2.2. Models . .6 2.3. Socially Relevant Algorithms . .7 2.3.1. Algorithms . .7 2.3.2. Social Relevance . .9 2.4. Communication . 11 2.4.1. Shannon's technical Model of Communication . 12 2.4.2. Watzlawick's psychological Communication Model . 13 2.4.3. Context-oriented Communication Model . 14 2.5. Socio-technical Systems . 18 2.5.1. Technical System . 19 2.5.2. Social System . 20 2.5.3. Kienle and Kunau's Socio-Technical System . 21 2.6. Data Economy and advertising . 22 2.6.1. Web-tracking and data collection . 22 2.6.2. Tracking protection . 25 2.6.3. Targeted advertising . 25 2.7. Integrated Search Engines: Google as an advertisement enabler 26 2.7.1. Integrated Search Engines . 27 2.7.2. Web Advertisement . 29 2.7.3. Business Models . 30 2.7.4. The Integrated Search Engine's role as an intermediary 34 2.8. Application to Web Search . 37 3. Related Work 41 3.1. Digitalized Health in the Realm of Stem-Cell-Tourism . 41 3.2. Governance . 46 3.2.1. The need for control . 46 3.2.2. Proposals of governance . 48 3.2.3. Challenges . 52 3.3. Algorithm Accountability . 53 3.3.1. Transparency . 54 3.3.2. Responsibility . 55 v Contents 3.4. Black Box Analysis . 60 3.4.1. Methodology . 62 3.4.2. Challenges . 67 4. EuroStemCell Data Donation 2019 / 2020 (EDD) 69 4.1. The Donation Plugin . 70 4.1.1. Requirements . 70 4.1.2. Conceptual Design . 70 4.1.3. Development . 72 4.1.4. Deployment . 75 4.2. Findings . 76 4.2.1. Data Analysis . 76 4.2.2. Limitations . 88 4.3. Lessons learned . 89 4.3.1. Study Design . 89 4.3.2. Technical . 92 5. Conclusion 97 6. Future Work 99 Bibliography 103 A. EuroStemCell Data Donation: Development 133 A.1. My Code . 133 A.2. User Story . 133 A.3. Product Backlog . 134 A.4. Participant survey . 136 A.5. Query composition and crawled HTML elements . 138 A.5.1. Query composition . 138 A.5.2. Crawled HTML elements . 139 B. EuroStemCell Data Donation: Data Analysis and Visualizations 141 B.1. Downloads . 141 B.2. Participants . 143 B.3. Advertisements and Advertisers . 143 C. Functionality of a Search Engine 145 vi List of Figures 1.1. Sample SERP before policy change I . .3 1.2. Sample SERP before policy change II . .3 2.1. Shannon's Communication Model . 12 2.2. Luhmann's triple selection . 15 2.3. Kienle & Kunau's Communication Model . 16 2.4. Kienle's sender activities . 17 2.5. Organic vs. sponsored search . 29 2.6. Yuan et al.'s online ad ecosystem . 31 2.7. Muthukrishan's ad paths . 32 2.8. WSS communication processes . 38 2.9. STS of web search and advertising . 39 3.1. Lessig's four forces . 50 3.2. Zweig's Chain of Responsibility . 58 3.3. Black Box Chain of Responsibility . 63 3.4. Krafft’s Black Box Concept . 64 4.1. Plugin-Server-Communication . 72 4.2. EDD Registration . 74 4.3. EDD collection process . 75 4.4. Overview all studies . 76 4.5. Overview of PD studies . 77 4.6. Donations over time . 78 4.7. Daily donation distribution . 78 4.8. Donation events . 79 4.9. Donations by individual participant and cumulative submissions 79 4.10. Histogram of donor's donation distributions . 80 4.11. Advertiser histogram . 80 4.12. Top 20 advertisers . 83 4.13. Critical actors in host categories . 84 4.14. Overview of donation statistics . 85 4.15. Proportion of critical ads . 86 4.16. Critical ads by group . 87 4.17. Critical ads by keyword . 87 4.18. Typical examples for swissmedica advertisement creatives . 88 4.19. Keyword suggestions . 90 4.20. Krafft’s Black Box Concept . 92 4.21. Security incidents . 94 A.1. On-bording Process . 136 vii List of Figures A.2. Description of crawled elements . 140 B.1. Firefox plugin statistics . 141 B.2. Daily chrome users . 142 B.3. Cumulative Chrome registrations . 142 B.4. Donations by group . 143 B.5. Absolute Number of advertisements per group . 143 B.6. Fraction of Prescription Treatment Advertisements per group . 144 viii 1. Introduction Digitalization has changed mankind in many ways. One technology that has grown to be indispensable is the search engine. They serve as an entry point to the WWW separating the websites most relevant to a user from noise. Most search engines provide their service to Internet users at no monetary cost. They finance their operation through advertisements (or short: ads) displayed along with search results on the search engine's result page (SERP). They call this sponsored or affiliated search. These integrated search engines (ISEs) combine search utility with the capabilities of an advertising exchange and thus connect advertisers, content providers and searchers. However, most ISEs are privately operated Internet platforms. They rose to be powerful intermediaries that take the role of algorithmic gatekeepers. Not only do they control the flow of communication between users and content providers. On their platform, they also organize ad distribution and direct attention. Concurrently, they get to retain transaction data of all involved participants, e.g. user data, website content, ad efficacy, conversion costs of businesses. In some domains, misconfiguration of algorithms has only minor conse- quences, like irrelevant search results or dysfunctional technical components. When it comes to medicine and health though, digitalization is probably go- ing to have the \most immediate and profound personal and social conse- quences" (Petersen, Tanner, et al., 2019, p.368). ISEs and their advertising partners combine various personal data to explore \the most intimate aspects of our selves" (Petersen, Tanner, et al., 2019, p.368). In the realm of health, they can have immediate effect on the well-being of citizens and their sur- roundings. Although society is heavily affected by privately-operated Internet-based platforms it cannot assess the functionality and safety of those software sys- tems. A Guardian journalist describes this situation as \operating on blind, ignorant, misplaced trust" (Goldacre, 2014) and adds that choices in algorithm design are generally being made without.
Recommended publications
  • Arguments for USPTO Error in Granting U.S. Patent 9,904,924 to Square, Inc
    Arguments for USPTO Error in Granting U.S. Patent 9,904,924 to Square, Inc. Umar Ahmed Badami Annabel Guo [email protected] [email protected] Samantha Jackman Kathryn Ng Sharon Samuel [email protected] [email protected] [email protected] Brandon Theiss* [email protected] New Jersey’s Governor’s School of Engineering and Technology July 24, 2020 *Corresponding Author Abstract—The United States Patent and Trademark Office how their invention works. When the patent expires 2, the (USPTO) erred in granting U.S. Patent 9,904,924 to Square Inc. public is permitted to make, use, or sell the invention without U.S. Patent 9,904,924 claims a system for the transfer of funds the inventor’s permission. from a sender to a recipient via email. However, this system was disclosed prior to the U.S. Patent 9,904,924’s effective filing For an inventor to patent their invention, there are some date of March 15, 2013. Claims 1 and 9 were disclosed by U.S. requirements that must be fulfilled as per the U.S. Code. One 8,762,272 and U.S. Patent 8,725,635 while claim 20 was taught by requirement for patentability, as stated under 35 U.S.C. § 101 a combination of U.S. Pat. App. Publication 2009/0006233, U.S. Patent 10,395,223, and U.S. Patent 10,515,345, thereby rendering is that the invention must be useful, defined as having “a at least one of the claims of U.S. Patent 9,904,924 invalid.
    [Show full text]
  • Application Number
    Intellectual Property India· Application Number. 4302WHE/2014 0 Home (h ttp://i pin di a.gov. in/index. htm) Contact Us (hrtp :/I i pi ndia.gov. in/ contact-us. h tm) Feed back (http:/ I i pi nd ia .gov. in/feedback. h tm) FAQ s (http:// i pi nd i a .gov. i n/faq-patents .h tm) Si tem a p (http://i pi nd i a .gov. i n/sitemap. htm) ., v ~~i&l lnck; ,, P ..1renr Advat<r.~:d. Se.1rch System (http:/ I i pi ndia. gov. in/index. htm) (http:/ /ipin dia .gov.in/index.h t Application Number: 4302/CHE/2014 Bib/agraphic Data Complete Specification Application Status Invention Title PHARMACEUTICAL COMPOSITIONS COMPRISING GERANIAL AS ATPASE INHIBITORS "" Publication Number 27/2016 Publication Date 2016/07/01 Publication Type INA Application Number 4302/CHE/2014 Application Filing Date 2014/09/03 Priority Number Priority Country Priority Date Field Of Invention (F/11) PHARMACEUTICALS Classification (IPC) A61 K-31 /00 Inventor Name Address Country Nationality V SAISHA Department of Biotechnology, BMSCE, PB No. 1908, Bull Temple Road, Bangalore · 560019, Karnataka (India) IN IN http://i pi ndi aservi ces .g ov.1 nlpubli csearchlneltvpublicsearchlpatent_detail.php?UC ID= SGF6l.J Di1JN zJhSjJ KYkhZM jJGaF dzUG 1mSW1 NZ2F TaWCZ01\f\JRzkxRjF GST 0%30 L"'S~·il/5·31~2'0~13 ' lir~iiA~~~~=:"~ l~~&~f&;~g:~c,~;~(i) I #GS/4, N~G,A;Sf.\Em H,t.;Ciii) lNOfAN g, f>c, G. JYOf:IHI, #,3'6/:t7~B-.f0l Ofv1~RSAR®Vf4:AAI PAPW ·~OFr~G~; [email protected];Afi<APlaM•M#lN~R()AP, ~SklSM'ANAGWE>l, ·BA'NGJl;llQRE-560004'• IJIJDIAI'f .
    [Show full text]
  • Artificial Intelligence, Economics, and Industrial Organization Hal Varian1
    Artificial Intelligence, Economics, and Industrial Organization Hal Varian1 Nov 2017 Revised June 2018 Introduction Machine learning overview What can machine learning do? What factors are scarce? Important characteristics of data Data ownership and data access Decreasing marginal returns Structure of ML-using industries Machine learning and vertical integration Firm size and boundaries Pricing Price differentiation Returns to scale Supply side returns to scale Demand side returns to scale Learning by doing Algorithmic collusion Structure of ML-provision industries Pricing of ML services Policy questions Security Privacy 1 I am a full time employee of Google, LLC, a private company. I am also an emeritus professor at UC Berkeley. Carl Shapiro and I started drafting this chapter with the goal of producing a joint work. Unfortunately, Carl became very busy and had to drop out of the project. I am grateful to him for the time he was able to put in. I also would like to thank Judy Chevalier and the participants of the NBER Economics of AI conference in Toronto, Fall 2017. Explanations Summary References Introduction Machine learning (ML) and artificial intelligence (AI) have been around for many years. However, in the last 5 years, remarkable progress has been made using multilayered neural networks in diverse areas such as image recognition, speech recognition, and machine translation. AI is a general purpose technology that is likely to impact many industries. In this chapter I consider how machine learning availability might affect the industrial organization of both firms that provide AI services and industries that adopt AI technology. My intent is not to ​ ​ ​ ​ provide an extensive overview of this rapidly-evolving area, but instead to provide a short summary of some of the forces at work and to describe some possible areas for future research.
    [Show full text]
  • Data Analytics: the Future of Legal JEREMIAH CHAN1
    International In-house Counsel Journal Vol. 9, No. 34, Winter 2016, 1 Data Analytics: The Future of Legal JEREMIAH CHAN1 Legal Director, Google Inc., USA & JAY YONAMINE Senior Data Scientist, Google Inc., USA & NIGEL HSU Head of Patent Operations, Verily Life Sciences LLC, USA INTRODUCTION During the 2002 season of Major League Baseball (MLB), the Oakland Athletics and New York Yankees both won an astounding 103 games, a feat only topped three times by any team in the last 24 years. Despite identical records, the New York Yankees payroll on opening day was $125,928,583 compared to only $39,679,746 for the Oakland Athletics. On a per win basis, the Oakland Athletics paid $385,240 per win compared to the New York Yankee’s $1,222,608. The New York Yankees were not an outlier. All other MLB teams (excluding Oakland) paid on average $853,252 per win in 2002. So how were the Oakland Athletics able to generate wins with over two to three times the cost efficiency of other teams in the league? By now, the answer has been well documented by Michael Lewis in the book and subsequent movie Moneyball, in which Lewis recounts how Oakland Athletics’ General Manager Billy Beane replaced traditional approaches to scouting and roster management with a rigorous use of data and statistical models. This is often referred to as “data analytics.” The quest for increased efficiency is certainly not unique to MLB. Be it a widget factory, technology company, or legal department, the desire to maximize the quality of outcomes while maintaining or reducing costs is ubiquitous.
    [Show full text]
  • Improving Google Patents with European Patent Office Patents And
    Improving Google Patents with European Patent Office patents and the Prior Art Finder Tuesday, August 14, 2012 at 10:23 AM ET Posted by Jon Orwant, Engineering Manager Cross-posted from the Google Research blog At Google, we're constantly trying to make important collections of information more useful to the world. Since 2006, we’ve let people discover, search, and read United States patents online. Starting this week, you can do the same for the millions of ideas that have been submitted to the European Patent Office, such as this one. [An example Moisture-tight edible film dispenser and method thereof EP 1692064 B1Abstract (text from WO2005051822A1]. Typically, patents are granted only if an invention is new and not obvious. To explain why an invention is new, inventors will usually cite prior art such as earlier patent applications or journal articles. Determining the novelty of a patent can be difficult, requiring a laborious search through many sources, and so we’ve built a Prior Art Finder to make this process easier. With a single click, it searches multiple sources for related content that existed at the time the patent was filed. From googlepublicpolicy.blogspot.com/2012/08/improving-google-patents-with-european.html 21 April 2013 Patent pages now feature a “Find prior art” button that instantly pulls together information relevant to the patent application. The Prior Art Finder identifies key phrases from the text of the patent, combines them into a search query, and displays relevant results from Google Patents, Google Scholar, Google Books, and the rest of the web.
    [Show full text]
  • Google Patents
    USPTO University Outreach: Patent Searching and Search Resources Presenter Date Objectives • Where can you search? • What are the benefits of searching? • When should you search? and most importantly: • How do you search using USPTO search tools? • What are some important strategies? Overview of IP: Why Get a Patent? • A patent can be – Used to gain entry to a market – Used to exclude others from a market – Used as a marketing tool to promote unique aspects of a product – Sold or licensed, like other property What is patentable? compositi on Manufact NEW, USEFUL, Machine ure NONOBVIOUS, ENABLED & CLEARLY DESCRIBED Method/ Improvements or process thereof Types of Patents • Utility - New and useful process, machine, article of manufacture, or composition of matter, or any new and useful improvement thereof Term – 20 years from earliest effective filing date • Design - Any new, original and ornamental design Term - 15 years from issue date • Plant - Whoever invents or discovers and asexually produces any distinct and new variety of plant... Term – 20 years from earliest effective filing date Overview of IP: What is a Patent? • A Property Right – Right to exclude others from making, using, selling, offering for sale or importing the claimed invention – Limited term – Territorial: protection only in territory that granted patent; NO world-wide patent • Government grants the property right in exchange for the disclosure of the invention What Type of Application? • Provisional – abandoned after one year, no claims required, written disclosure must meet same requirements as non- provisional, not allowed for design. • Non-Provisional - claims required, written disclosure must meet requirements of 35 USC 112(a).
    [Show full text]
  • Machine Learning for Marketers
    Machine Learning for Marketers A COMPREHENSIVE GUIDE TO MACHINE LEARNING CONTENTS pg 3 Introduction pg 4 CH 1 The Basics of Machine Learning pg 9 CH. 2 Supervised vs Unsupervised Learning and Other Essential Jargon pg 13 CH. 3 What Marketers can Accomplish with Machine Learning pg 18 CH. 4 Successful Machine Learning Use Cases pg 26 CH. 5 How Machine Learning Guides SEO pg 30 CH. 6 Chatbots: The Machine Learning you are Already Interacting with pg 36 CH. 7 How to Set Up a Chatbot pg 45 CH. 8 How Marketers Can Get Started with Machine Learning pg 58 CH. 9 Most Effective Machine Learning Models pg 65 CH. 10 How to Deploy Models Online pg 72 CH. 11 How Data Scientists Take Modeling to the Next Level pg 79 CH. 12 Common Problems with Machine Learning pg 84 CH. 13 Machine Learning Quick Start INTRODUCTION Machine learning is a term thrown around in technol- ogy circles with an ever-increasing intensity. Major technology companies have attached themselves to this buzzword to receive capital investments, and every major technology company is pushing its even shinier parentartificial intelligence (AI). The reality is that Machine Learning as a concept is as days that only lives and breathes data science? We cre- old as computing itself. As early as 1950, Alan Turing was ated this guide for the marketers among us whom we asking the question, “Can computers think?” In 1969, know and love by giving them simpler tools that don’t Arthur Samuel helped define machine learning specifi- require coding for machine learning.
    [Show full text]
  • Implications of the Google's US 8,996,429 B1 Patent in Cloud
    DOI: 10.5772/intechopen.70279 Provisional chapter Chapter 8 Implications of the Google’s US 8,996,429 B1 Patent in ImplicationsCloud Robotics-Based of the Google’s Therapeutic US 8,996,429 Researches B1 Patent in Cloud Robotics-Based Therapeutic Researches Eduard Fosch Villaronga and Jordi Albo-Canals Eduard Fosch Villaronga and Jordi Albo-Canals Additional information is available at the end of the chapter Additional information is available at the end of the chapter http://dx.doi.org/10.5772/intechopen.70279 Abstract Intended for being informative to both legal and engineer communities, this chapter raises awareness on the implications of recent patents in the field of human-robot interaction (HRI) studies. Google patented the use of cloud robotics to create robot personality(-ies). The broad claims of the patent could hamper many HRI research projects in the field. One of the possible frustrated research lines is related to robotic therapies because the person- alization of the robot accelerates the process of engagement, which is extremely beneficial for robotic cognitive therapies. This chapter presents, therefore, the scientific examina- tion, description, and comparison of the Tufts University CEEO project “Data Analysis and Collection through Robotic Companions and LEGO® Engineering with Children on the Autism Spectrum project” and the US 8,996,429 B1 Patent on the Methods and Systems for Robot Personality Development of Google. Some remarks on ethical implica- tions of the patent will close the chapter and open the discussion to both communities. Keywords: cognitive therapeutic robots, cloud robotics, Google patent, personality, personalization, ASD research, ethical implications 1.
    [Show full text]
  • Do Patents Disclose Useful Information?
    Harvard Journal of Law & Technology Volume 25, Number 2 Spring 2012 DO PATENTS DISCLOSE USEFUL INFORMATION? Lisa Larrimore Ouellette* TABLE OF CONTENTS I. INTRODUCTION .............................................................................. 532 II. THE DISCLOSURE JUSTIFICATION FOR PATENTS .......................... 536 A. Legal Requirements and Judicial Interpretation ...................... 536 B. Is Disclosure a Compelling Justification for Patents? ............. 540 C. Previous Surveys of the Value of Patent Disclosures .............. 547 III. UTILITY OF PATENTS FOR NANOTECH RESEARCHERS ................ 552 A. Survey Method and Summary Statistics ................................... 553 B. Why Do Scientists Read Patents (or Not)? ............................... 557 1. Reasons Researchers Avoid Patents ...................................... 557 2. Reasons Researchers Read Patents ........................................ 559 C. Do Patents Contain Useful Technical Information? ................ 560 D. Are Patents Reproducible? ...................................................... 562 E. Do Researchers Worry About Willful Infringement? ............... 564 IV. NANOTECHNOLOGY PATENT CASE STUDIES .............................. 566 A. Carbon Nanotube Resonators .................................................. 566 B. Carbon Nanotube Sensors ........................................................ 569 V. IMPLICATIONS FOR PATENT POLICY ............................................. 571 A. Strengthen Enforcement of Disclosure Requirements
    [Show full text]
  • The Future According to Google
    Travis: THE FUTURE ACCORDING TO GOOGLE THE FUTURE ACCORDING TO GOOGLE: TECHNOLOGY POLICY FROM THE STANDPOINT OF AMERICA'S FASTEST-GROWING TECHNOLOGY COMPANY By Hannibal Travis* 11 YALE J.L. & TECH. 209 (2009) As the fastest-growing technology company in the United States,' Google has been at the center of some of the most contentious technology policy disputes of recent years. In the federal courts, these disputes focus on the fair or noncommercial use of copyrighted work and trademarks on the Internet. In Congress, Google is leading the charge in favor of laws protecting innovative Internet companies from discriminatory or exorbitant charges by broadband and wireless infrastructure providers. It has also been a vocal opponent of excessive governmental control over Internet content. Copyright lawsuits arising out of search engines and user- generated content sites such as Google Video and YouTube have the potential to change the rules governing communication over the Internet. Similarly, trademark litigation alleging that comparative and Internet keyword-based advertising are infringing may limit the ability of technology companies and their customers to compete online. Many technology companies also believe that injunctive relief obtained by the owners of patents in comparatively minor components of complex software-enabled products may chill innovation and divert capital away from applied research. But it seems that the power of infrastructure providers to favor allied content providers has truly spooked technology leaders like Google. Meanwhile, Google, other technology and Internet companies, and members of Congress have demanded action to limit foreign governments' ability to block U.S.-based Web content from being accessed by persons present in their territory.
    [Show full text]
  • Adscape: Harvesting and Analyzing Online Display Ads
    Adscape: Harvesting and Analyzing Online Display Ads Paul Barford Igor Canadi Darja Krushevskaja University of University of Rutgers University Wisconsin-Madison Wisconsin-Madison [email protected] [email protected] [email protected] Qiang Ma S. Muthukrishnan Rutgers University Rutgers University [email protected] [email protected] ABSTRACT online users and the increasing amount of time that peo- Over the past decade, advertising has emerged as the pri- ple spend online has led to an estimated $36B in online ad mary source of revenue for many web sites and apps. In spending in the US for FY2012, which represents an 15% this paper we report a first-of-its-kind study that seeks to increase over the previous year [2]. The majority of this broadly understand the features, mechanisms and dynamics spending is on advertising that most commonly appears in of display advertising on the web - i.e., the Adscape. Our search results as text ads. There is, however, a growing pref- study takes the perspective of users who are the targets of erence for display ads | typically image and video ads that display ads shown on web sites. We develop a scalable crawl- appear in response to users' browsing and other activities ing capability that enables us to gather the details of dis- on the web | that can convey more robust and visual mes- play ads including creatives and landing pages. Our crawling sages to users. A recent report by Forrester estimates that strategy is focused on maximizing the number of unique ads $12.7B was spent in the US on display and video advertising harvested.
    [Show full text]
  • Privacy Leakage in Personalized Mobile In-App Ads
    The Price of Free: Privacy Leakage in Personalized Mobile In-App Ads Wei Meng, Ren Ding, Simon P. Chung, Steven Han, and Wenke Lee College of Computing Georgia Institute of Technology {wei, rding, pchung34, s.han}@gatech.edu, [email protected] Abstract—In-app advertising is an essential part of the ecosys- always a concern. While ad personalization has been well tem of free mobile applications. On the surface, this creates a studied for web, relatively little research explores mobile ad win-win situation where app developers can profit from their personalization in terms of what user information is being work without charging the users. However, as in the case of collected. We believe research focused on mobile ad personal- web advertising, ad-networks behind in-app advertising employ ization is a significant pursuit for the following reasons: 1) personalization to improve the effectiveness/profitability of their Mobile devices are a lot more intimate to users; they are ad-placement. This need for serving personalized advertisements in turn motivates ad-networks to collect data about users and carried around at all times and are being used more and more profile them. As such, “free” apps are only free in monetary for sensitive operations like personal communications, dating, terms; they come with the price of potential privacy concerns. banking, etc. Therefore, privacy concerns regarding what infor- The question is, how much data are users giving away to pay for mation is collected for ad personalization are more serious. 2) “free apps”? Unlike in-browser advertising, where the advertisement content is strictly isolated from the rest of the displayed page by the In this paper, we study how much of the user’s interest and demographic information is known to these major ad networks well-known “same origin policy”, in-app advertising operates on the mobile platform.
    [Show full text]