Visualization-Driven Data Aggregation

Total Page:16

File Type:pdf, Size:1020Kb

Visualization-Driven Data Aggregation Computer Science Dissertation - Technische Universität Berlin VLDB BEST PAPER inside Uwe Jugel Visualization-Driven Data Aggregation Rethinking data acquisition for data visualizations 1010 10 00 0101001 001 10101 00 10 01001 01100 000 111 00 1001 001 101011010 10 00 010 010111 010 010100 000 111 00 10 10 0011 1011010 10 00 010 100 000 111 0010111 001 0010 10111 001 001010111 001 0010 1010 10 00 01000 10 01001 01 00 10 01001 01100 000 111 00 1001 001 101011010 10 00 010 010111 010 010100 000 111 00 10 10 0011 1011001 001 10101 100 000 111 001001 001 10101 10111 001 00101010 10 00 010 Visualization-Driven Data Aggregation Rethinking data acquisition for data visualizations vorgelegt von Dipl.-Inf. Uwe Jugel geb. in Dresden von der Fakultät IV – Elektrotechnik und Informatik der Technischen Universität Berlin zur Erlangung des akademischen Grades Doktor der Ingenieurwissenschaften – Dr.-Ing. – genehmigte Dissertation Promotionsausschuss: Vorsitzender Prof. Dr. Ziawasch Abedjan Gutachter Prof. Dr. rer. nat. Volker Markl Gutachter Prof. Dr.-Ing. Wolfgang Lehner Gutachter Prof. Dr.-Ing. Stefan Tai Tag der wissenschaftlichen Aussprache: 13. April 2017 Berlin 2017 Dissertation Visualization-Driven Data Aggregation Rethinking data acquisition for data visualizations Uwe Jugel Abstract Visual analysis of high-volume numerical data is traditionally required for un- derstanding sensor data in manufacturing and engineering scenarios. Today, the visual analysis of any kind of big data has become ubiquitous and is a most-wanted feature for data analysis tools. It is vital for commerce, finance, sales, and an ever-growing number of industries, whose data are traditionally stored in relational database management systems (RDBMS). Unfortunately, contemporary RDBMS-based data visualization and data analysis systems have difficulty to cope with the hard latency requirements and high ingestion rates required for interactive visualizations of big data. Disregarding the spatial properties of the visualization, they are incapable of effectively sampling or aggregating the big data for subsequent data visu- alization. The resulting big data visualizations suffer from measurable and perceivable visualization errors. Moreover, existing visualization-related tech- niques for data reduction are domain-specific and focus on a few custom types of visualizations. The underlying problems have neither been analyzed com- prehensively and in general, nor in the context of RDBMS-based systems in particular. A general-purpose solution for visualization-related data reduction has been missing. To facilitate truly interactive visualizations of the growing volume and vari- ety of big data, computer systems need to change the way they acquire data for the purpose of data visualization. Therefore, the present work introduces the Visualization-Driven Data Aggregation (VDDA) that facilitates high-quality to error-free visualizations of high-volume datasets at high data reduction rates. Built on an in-depth analysis of the underlying problem of visual aggregation, VDDA defines specific data reduction operators for the most common chart types and for chart matrices. For RDBMS-based systems in particular, these operators can be used at the query level in a transparent query rewriting sys- tem, making VDDA applicable to any visualization system that consumes data stored in relational databases. Using real-world datasets from high-tech manufacturing, stock markets, and sports analytics domains, this work demonstrates the applicability of VDDA, reducing data volumes and query answer times by up to two orders of magni- tude, while preserving pixel-perfect visualizations of the raw data. Zusammenfassung Eine visuelle Analyse großer numerischer Datenmengen ist herkömmlich not- wendig, um Sensordaten aus Konstruktions- und Fertigungsprozessen auszuwer- ten und zu verstehen. Heute ist die visuelle Analyse jeglicher Art von großen Datenmengen (Big Data) allgegenwärtig und eine meistgesuchte Funktion von Softwareprodukten zur Datenanalyse. Sie ist unverzichtbar für den Handelsver- kehr, das Finanzwesen, den Einzelhandel und einer ständig wachsenden Anzahl an Branchen, deren Daten herkömmlich in relationalen Datenbankmanagement- systemen (RDBMS) gespeichert werden. Leider haben gegenwärtige RDBMS-basierte Datenvisualisierungs- und Da- tenanalysesysteme Schwierigkeiten mit den hohen Anforderungen an Latenz- zeiten und den hohen Dateneingangsraten fertig zu werden. Sie missachten die räumlichen Eigenschaften der Visualisierungen und sind dadurch nicht imstande, die für die Datenvisualisierung benötigten Stichproben und Aggregationswerte der großen Datenmengen effektiv zu erzeugen. Die resultierenden Datenvisua- lisierungen haben mess- und sichtbare Visualisierungsfehler. Bestehende Me- thoden zur visualisierungsbezogenen Datenreduktion sind anwendungsspezifisch und werden nur auf einige wenige, individuelle Visualisierungstypen angewandt. Die zugrunde liegenden Probleme wurden weder umfassend und allgemein, noch mit speziellem Bezug zu RDBMS-basierten Systemen analysiert. Es existiert keine universelle Methode zur visualisierungsbezogenen Datenreduktion. Um zukünftig interaktive Visualisierungen der vielfältigen und wachsenden Datenmengen zu ermöglichen, müssen Computersysteme die Art der visualisie- rungsbezogenen Datenerfassung anpassen. Die vorliegende Arbeit entwirft da- für eine visualisierungsgesteuerte Datenaggregation (VDDA, engl. Visualization- Driven Data Aggregation), die einerseits hohe Datenreduktionsraten und ande- rerseits Visualisierungen von sehr hoher Qualität ermöglicht. Aufbauend auf ei- ner detaillierten Analyse des Problems der visuellen Aggregation, bietet VDDA spezifische Datenreduktionsoperatoren für alle gängigen Typen von Diagram- men und für Diagramm-Matrizen. In RDBMS-basierten Systemen können diese Operatoren dann für eine Datenreduktion auf Anfrage-Ebene angewandt wer- den, und sie ermöglichen dadurch eine transparente Anpassung der ursprüngli- chen visualisierungsbezogenen Anfrage. Die entwickelte Methode ist allgemein anwendbar und bietet eine verbesserte Datenerfassung für jegliche Art von Da- tenvisualisierungssystemen, die Daten aus relationalen Datenbanken beziehen. Die Arbeit demonstriert die Anwendbarkeit von VDDA anhand realer Da- ten aus den Bereichen der Hochtechnologie-Fertigung, Aktienmärkte und Sport- Datenanalyse. Die übertragenen Datenvolumen und Antwortzeiten werden dabei um bis zu zwei Größenordnungen reduziert, während die Datenvisualisierungen pixelgenau erhalten bleiben. Disclaimer I declare that the content of the present dissertation is based on my own thoughts and research. To the best of my knowledge, I listed and referenced in this document all sources and tools used to develop the dissertation. Acknowledgments The idea for this work arose during my first year as Ph.D. candidate at SAP Research Dresden and the Database Systems and Information Management Group at TU Berlin. I would like to thank Prof. Dr. rer. nat. Volker Markl for supervising this dissertation, for the many suggestions and discussions, and for relentlessly pushing for quality rather than quantity. Furthermore, I would like to thank Prof. Dr.-Ing. Wolfgang Lehner and Prof. Dr.-Ing. Stefan Tai for their helpful advice and for acting as second and third reviewer for this dissertation. During my studies, I got a lot of support from my colleagues at SAP. In par- ticular, I would like to thank Dr. Zbigniew Jerzak and Dr. habil. Gregor Hack- enbroich. They significantly contributed to the progress of my work through many discussions, professional, scientific, and methodological advice, and their co-authorship in my publications. Special thanks also go to Dr. Anja Jugel for validating and improving the formal and orthographic quality of the thesis. She helped a lot in improving the figures and making complicated paragraphs more understandable. I cordially thank my family, my friends, and especially and again my beloved wife Anja, who provided me the freedom and time required to conduct the endeavor of a dissertation. Her contribution to the progress of this work cannot be underestimated, since she took a lot of work off my hands, caring for our two children, who were both born during the course of the dissertation. Uwe Jugel, September 2016 Contents Publicationsv Notation vii List of Abbreviations . vii Mathematical Symbols . ix Relational Algebra Notation . .x Essential SQL Statements . xi 1. Introduction1 1.1. Background and Motivation . .1 1.2. Goal of this work . .5 1.3. Solution Overview . .7 1.3.1. Considered Big Data Visualization System . .8 1.3.2. Visual Aggregation for Basic Charts . .9 1.3.3. Advanced Visual Aggregation Techniques . 10 1.4. Structure of the work . 11 2. Elements and Principles of Data Visualization Systems 13 2.1. System Components . 13 2.2. SQL Databases . 17 2.3. Pixels on the Canvas . 18 2.4. The Visualization Pipeline . 22 2.5. Visualization System Architectures . 25 2.6. Existing Data Visualization Systems . 28 2.6.1. Visual Analytics Tools . 28 2.6.2. Big Data Visualization Systems . 28 2.6.3. Common Web Applications . 29 2.6.4. Additional Systems . 31 2.7. Data Reduction . 33 2.7.1. Piecewise Data Reduction Techniques . 33 2.7.2. Geometric Data Reduction Techniques . 34 2.7.3. Additional Data Reduction Techniques . 36 i Contents 2.8. Data Visualization . 39 2.8.1. Data Visualization Terminology . 40 2.8.2. Considered Chart Types . 42 2.8.3. Scalability of Visualizations . 45 2.8.4. Data Visualization in Interactive Systems . 46 Summary
Recommended publications
  • Statement of Natalie S. Talpas, Senior Vice President and Digital Product Management Group Manager, PNC Bank
    Statement of Natalie S. Talpas, Senior Vice President and Digital Product Management Group Manager, PNC Bank Consumer Financial Protection Bureau Symposium on Consumer Access to Financial Records, Section 1033 of the Dodd-Frank Act Wednesday, February 26, 2020 I. Introduction and Executive Summary PNC Bank, National Association (PNC) appreciates the opportunity to participate in the Consumer Financial Protection Bureau’s (Bureau) symposium regarding data aggregators and consumer access to financial records. PNC is a Main Street bank focused on serving the financial needs of our customers and communities. We have employees in more than 40 states across the country and a retail branch network located primarily in the Mid-Atlantic, Midwest and Southeast, with approximately 2,300 branches and 9,100 ATMs. We are proud of our longstanding history of supporting our customers, communities and employees while operating a sustainable long-term business. We are committed to providing our customers with access to convenient technology tools and the financial applications of their choice, while protecting the personal and financial information they entrust to us and maintaining the integrity of our systems. We support our customers’ use of financial applications (apps) and informed consumer choice in accessing and sharing their financial data. In fact, we process millions of trouble-free logins by financial apps, and the data aggregators that support these apps, each week. What are data aggregators? Data aggregators are nonbank financial services companies that gather financial data on consumers from banks and other financial institutions, such as broker- dealers, and make this information available to financial apps and, potentially, other purchasers of consumer data.
    [Show full text]
  • Technology and Innovation in the Insurance Sector
    Technology and innovation in the insurance sector TECHNOLOGY AND INNOVATION IN THE INSURANCE SECTOR Please cite this publication as: OECD (2017), Technology and innovation in the insurance sector This work is published under the responsibility of the Secretary-General of the OECD. The opinions expressed and arguments employed herein do not necessarily reflect the official views of the OECD or of the governments of its member countries or those of the European Union. This document and any map included herein are without prejudice to the status or sovereignty over any territory, to the delimitation of international frontiers and boundaries and to the name of any territory, city, or area. © OECD 2017 FOREWORD │ 3 Foreword “Insurtech” is the term being used to describe the new technologies with the potential to bring innovation to the insurance sector and impact the regulatory practices of insurance markets. This report catalogues these technologies and examines how InsurTech is being funded and how insurers are engaging with the start-ups entering the market. This report was prepared as part of the programme of work of the OECD Insurance and Private Pensions Committee, the international forum for addressing policy and regulatory issues in insurance and private pensions for governments, international organisations and industry. It has benefited from input from the insurance market and industry, including from a number of insurance start-ups. This report contributes to the OECD’s Going Digital project which is examining from a wide range of perspectives how technology and innovation is affecting the economy. TECHNOLOGY AND INNOVATION IN THE INSURANCE SECTOR TABLE OF CONTENTS │ 5 Table of contents Executive summary ..............................................................................................................................
    [Show full text]
  • Do Data Breach Disclosure Laws Reduce Identity Theft?
    Do Data Breach Disclosure Laws Reduce Identity Theft? Sasha Romanosky, Rahul Telang, Alessandro Acquisti The Heinz College Carnegie Mellon University ABSTRACT In the United States, identity theft resulted in corporate and consumer losses of $56 billion dollars in 2005, with up to 35 percent of known identity thefts caused by corporate data breaches. Many states have responded by adopting “data breach disclosure laws” that require firms to notify consumers if their personal information has been lost or stolen. While the laws are expected to reduce identity theft, their effect has yet to be empirically measured. We used panel data from the U.S. Federal Trade Commission to estimate the impact of data breach disclosure laws on identity theft from 2002 to 2009. We find that adoption of data breach disclosure laws reduce identity theft caused by data breaches by, on average, 6.1 percent. Keywords Data breach disclosure, economics of information security, identity theft, fixed effects regression 2 Do Data Breach Disclosure Laws Reduce Identity Theft? INTRODUCTION Data breaches occur when personally identifiable information such as names, social security numbers, and credit card numbers are accidentally lost or maliciously stolen. These breaches can result in hundreds of thousands (sometimes millions) of compromised records, and lead to identity theft and related crimes (Givens, 2000).1 In the United States, identity theft resulted in corporate and consumer losses of around $56 billion dollars in 2005 (Javelin Research, 2006).2 In an effort to reduce these crimes, many states have responded by adopting data breach disclosure (or “security breach notification”) laws, requiring firms to notify individuals when their personal information has been compromised.
    [Show full text]
  • E-Aggregation: the Present and Future of Online Financial Services in Asia-Pacific (PACIS)
    E-Aggregation: The Present and Future of Online Financial Services in Asia-Pacific (PACIS) Hiroshi Fujii Taeko Okano Stuart Madnick Michael Siegel Working Paper CISL# 2002-06 September 2002 Composite Information Systems Laboratory (CISL) Sloan School of Management, Room E53-320 Massachusetts Institute of Technology Cambridge, MA 02142 E-Aggregation: The Present and Future of Online Financial Services in Asia-Pacific Hiroshi Fujii1, Taeko Okano1, Stuart Madnick2, Michael Siegel2 1Suruga Bank, Japan 2Sloan School of Management, Massachusetts Institute of Technology, USA Abstract Financial institutions see the Internet as an important channel and many have built websites to inform and attract customers. Financial aggregation services as represented by “Account Aggregator” present an opportunity by which financial institutions can build stronger relationships with customers. Account aggregation services began in the United States, but they are now widely used by financial institutions in other countries. In this paper, we examine financial aggregation services by classifying aggregator types and the method for implementing their service. Second, we explain the differences between financial account relationship aggregation services in the U.S. and in Asia-Pacific countries, including Australia, South Korea, and Japan. We then discuss the status of financial comparison aggregation services and related issues. Owing to the popularity of WAP phones and the fast- growing market for mobile phone service in Asia-Pacific, we will also look into the development of mobile aggregation services. Finally, we examine future directions for aggregators in conjunction with universal and global banking concepts. Keywords: Financial institution, Aggregation service, Universal banking, Global banking I. Introduction Financial institutions are one of the most influential businesses in the information technology (IT) revolution.
    [Show full text]
  • Identity Software – May 2021 EXECUTIVE SUMMARY Identity Software Sector Update
    SECTOR SNAPSHOT: Identity Software – May 2021 EXECUTIVE SUMMARY Identity Software Sector Update The pandemic has driven digital transformation into overdrive with retailers and financial institutions rapidly adopting to consumer demand shifting from offline to online transactions, and enterprises accelerating migration to cloud to manage millions of workers working remotely. This unprecedented growth in digital activity saw a corresponding increase in intensity and regularity of identity fraud and data breaches on an almost daily basis. Advanced persistent threats are fueling rapid innovation and adoption of identity security and data protection solutions across different market verticals, which in turn have catalyzed some of the most significant M&A and funding activities in recent history Recent M&A activity has been robust across multiple sub categories especially in IAM, highlighted by: Select M&A Transactions ($MM) $28,940 $1,400 $6,408 $850 Financing and investment volume in the overall identity sector has continued unabated with sponsors making bold bets to unleash growth and create new unicorns: Select Financing Transactions ($MM) $210 $205 $150 $100 $100 $100 GCA expects these trends to continue with even stronger identity related market activity excepted in the remainder of 2021 2 I. GCA OVERVIEW II. IDENTITY SOFTWARE MARKET OBSERVATIONS III. APPENDIX 3 GCA’S DEDICATED SECTOR KNOWLEDGE AND EXECUTION EXCELLENCE GCA’s Identity Software Leadership Team Chris Gough Saif Malik Managing Director / Head of Cybersecurity Executive
    [Show full text]
  • Symposium on Consumer Access to Financial Records February 26, 2020
    Symposium on Consumer Access to Financial Records February 26, 2020 Written Statement of Natalie R. Williams Managing Director & Associate General Counsel, Responsible Banking, Data & Privacy JPMorgan Chase & Co. Submitted to the Consumer Financial Protection Bureau Thank you for the opportunity to serve as a panelist at the Consumer Financial Protection Bureau’s (CFPB) Data Aggregation Symposium. I. Data Aggregation Presents Both Benefits and Risks Financial data aggregation has grown dramatically in recent years and has yielded consumer benefits. Among other things, it has facilitated the creation of an array of innovative products and services that can help consumers better understand and manage their financial lives. However, data aggregation should be undertaken in the best interest of consumers. It therefore must be executed with appropriate measures to ensure data safety and security and informed consumer consent. Most aggregators require consumers to provide financial account login credentials so that they can “screen scrape” the consumer’s financial information. Through this practice of “screen scraping,” aggregators have the ability to collect all of the consumer’s financial information, even if the financial application the customer desires to use doesn’t need it. That information can include account numbers, payees, and contact information as well as details on mortgage, investment and credit card accounts, joint accounts, and children’s accounts. Aggregators also generally store account login credentials and scraped customer data, creating highly attractive targets for hackers and malicious insiders. Moreover, current data aggregation practices often are insufficiently transparent regarding the use and sharing of consumer financial data. This, in turn, limits the control that consumers have over their own financial information, placing data privacy and data security at risk.
    [Show full text]
  • RIN 3064-ZA24 Attachments: SSRN-Id3622468.Pdf; SSRN-Id3864965.Pdf
    From: Susan Von Struensee Sent: Saturday, June 19, 2021 12:22 PM To: Comments Subject: [EXTERNAL MESSAGE] RIN 3064-ZA24 Attachments: SSRN-id3622468.pdf; SSRN-id3864965.pdf COMMENTS OF SUSAN VON STRUENSEE, JD, MPH to the Request for Information and Comment on Financial Institutions' Use of Artificial Intelligence, Including Machine Learning 86 FR 16837-38 (March 31, 2021) Agency/Docket Numbers: Docket ID OCC-2020-0049 Docket No. OP-1743 Docket No. CFPB-2021-0004 Docket No. NCUA-2021-0023 Please see attached file A Survey of Fintech Research and Policy Discussion The intersection of finance and technology, known as fintech, has resulted in the dramatic growth of innovations and has changed the entire financial landscape. While fintech has a critical role to play in democratizing credit access to the unbanked and thin-file consumers around the globe, those consumers who are currently well served also turn to fintech for faster services and greater transparency. Fintech, particularly the blockchain, has the potential to be disruptive to financial systems and intermediation. Our aim in this paper is to provide a comprehensive fintech literature survey with relevant research studies and policy discussion around the various aspects of fintech. The topics include marketplace and peer-to-peer lending, credit scoring, alternative data, distributed ledger technologies, blockchain, smart contracts, cryptocurrencies and initial coin offerings, central bank digital currency, robo-advising, quantitative investment and trading strategies, cybersecurity, identity theft, cloud computing, use of big data and artificial intelligence and machine learning, identity and fraud detection, anti- money laundering, Know Your Customers, natural language processing, regtech, insuretech, sandboxes, and fintech regulations.
    [Show full text]
  • Dara Aggregators and Financial Institutions Kimberly L
    NORTH CAROLINA BANKING INSTITUTE Volume 5 | Issue 1 Article 17 2001 If You Can't Beat Them, Join Them: Dara Aggregators and Financial Institutions Kimberly L. Wierzel Follow this and additional works at: http://scholarship.law.unc.edu/ncbi Part of the Banking and Finance Law Commons Recommended Citation Kimberly L. Wierzel, If You Can't Beat Them, Join Them: Dara Aggregators and Financial Institutions, 5 N.C. Banking Inst. 457 (2001). Available at: http://scholarship.law.unc.edu/ncbi/vol5/iss1/17 This Comments is brought to you for free and open access by Carolina Law Scholarship Repository. It has been accepted for inclusion in North Carolina Banking Institute by an authorized administrator of Carolina Law Scholarship Repository. For more information, please contact [email protected]. If You Can't Beat Them, Join Them: Data Aggregators and Financial Institutions I. INTRODUCTION: Data aggregation has been a hot topic in the financial industry since First Union's December 1999 suit against Secure Commerce Services.! Data aggregation is the process of gathering information from multiple websites and delivering it to a consumer on a single website.2 There are essentially two ways for an aggregator to gather information-direct feed and screen scraping? Both methods require a customer to create on-line account access with the institutions they want on their single website.4 The customer then turns over their account numbers, I.D. numbers and passwords to their selected aggregator.5 1. First Union Alleges Online Payment Service Used Bank Customers' Information Illegally, B.N.A., Feb. 1, 2000, at 2437 (citing First Union Corp v.
    [Show full text]
  • Americans for Financial Reform Education Fund Center for Responsible Lending Consumer Action Consumer Federation of America USPIRG
    National Consumer Law Center (on behalf of its low-income clients) Americans for Financial Reform Education Fund Center for Responsible Lending Consumer Action Consumer Federation of America USPIRG February 4, 2021 Via regulations.gov Comment Intake Consumer Financial Protection Bureau 1700 G Street NW Washington, DC 20552 Re: Consumer Access to Financial Records, Docket No. CFPB–2020–0034/RIN 3170- AA78. The National Consumer Law Center (on behalf of its low-income clients) (NCLC), Americans for Financial Reform Education Fund, Center for Responsible Lending, Consumer Action, Consumer Federation of America, and USPIRG are pleased to submit these comments in response to the Consumer Financial Protection Bureau (CFPB)’s Advanced Notice of Proposed Rulemaking regarding Consumer Access to Financial Records, Docket No. 2020-0034, issued November 6, 2020. The CFPB has requested comment on 46 questions to assist it in developing a proposed rule to implement Section 1033 of the Dodd-Frank Wall Street Reform and Consumer Protection Act. These comments address a selected number of these questions that we as consumer advocacy groups are particularly suited to answer. In summary, these comments discuss how: 1. The potential benefits for consumers of authorized data access, assuming strong provisions for consumer control, security, and use limitations, are significant, as consumer use of their own data could provide a better alternative and provide true competition to the Big Three credit bureaus. The CFPB should issue a strong rule under 1033 to ensure protections for consumers accessing their own account data. 2. Authorized data access also poses significant risks to consumers. Consumers face the dangers of losing control over the data, having it used against them, and having their privacy invaded.
    [Show full text]
  • The Use of Cash-Flow Data in Underwriting Credit Market Context & Policy Analysis
    The Use of Cash-Flow Data in Underwriting Credit Market Context & Policy Analysis FEBRUARY 2020 About FinRegLab FinRegLab is a non-profit research organization that was founded on the premise that independent, rigorous research is a primary ingredient in helping develop market norms and policy solutions that enable responsible innovation in financial services. Acknowledgments This report, The Use of Cash-Flow Data in Underwriting Credit: Market Context & Policy Analysis, and the accompanying Policy Overview were made possible through founding support from Flourish/Omidyar Network. Additional support was provided by the Milken Institute, which enabled FinRegLab to evaluate cash-flow data in small business credit underwriting. Detailed information on our funders can be found on the inside back cover. We would like to extend a special note of appreciation for their leadership and facilitation of FinRegLab’s policy working groups to Melanie Brody, Mayer Brown; Mark Schultz, Capital One; P-R Stark, Oliver Wyman; and Corey Stone, Financial Health Network. Thanks also to the Fintech Team at the Federal Reserve Bank of San Francisco for co-hosting a symposium entitled “The Role of Consumers in the Data Ecosystem.” FinRegLab would also like to recognize the individuals and organizations who participated in the policy working groups as listed in Appendix A, the participants of the Data Ecosystem symposium, and stakeholders who participated in interviews. We are particularly grateful to Marsha Courchane and Arthur Baines, Charles River Associates, for the role they played in designing and executing the empirical research analysis. We are also grateful to Thomas P. Brown and Stephen J. Bandrowsky, Paul Hastings, for legal assistance throughout the course of the research process.
    [Show full text]
  • Data Aggregation Services – De-Identification
    TECHNOLOGY MAY-RATHON DIGITAL HEALTH PRIVACY: THERE’S AN APP FOR THAT Reece Hirsch Partner, Morgan Lewis, San Francisco Co-chair, Privacy and Cybersecurity Practice May 19, 2017 © 2017 Morgan, Lewis & Bockius LLP The Technologies are New, The Laws … Not So Much • When the Health Insurance Portability and Accountability Act was enacted in 1996, there were no smart phones, no mobile apps, no cloud computing – HIPAA Privacy Rule became effective April 14, 2003 – HIPAA Security Rule became effective April 21, 2005 – Compliance date of HIPAA Final Rule: September 23, 2013 • In recent years regulators and digital health companies have had to stretch, tweak and interpret existing laws to fit this new landscape of – Healthcare mobile apps – Cloud hosting services – Personal health records 2 Privacy by Design • For companies venturing into the digital health space, privacy and security are critical issues that must be addressed from Day One – For startups, questions about privacy and security will be among the first that get asked by customers and potential acquirers – The due diligence process will show when a company scrambled to improve privacy and security immediately prior to potential acquisition – For established companies venturing into digital health, a stumble in the digital privacy space can damage a brand and customer relationships • Privacy by design is the FTC’s mantra, baking in privacy and security during the development of a product or service 3 The FTC and OCR • One overarching theme in digital health privacy is the overlapping jurisdiction of: – The Federal Trade Commission, the primary U.S. privacy regulator with the broadest purview – The Dept.
    [Show full text]
  • Do Data Breach Disclosure Laws Reduce Identity Theft? / 257 Mitigate, Any Resulting Harm
    Do Data Breach Disclosure Sasha Romanosky Rahul Telang Laws Reduce Identity Theft? Alessandro Acquisti Abstract In the United States, identity theft resulted in corporate and consumer losses of $56 billion dollars in 2005, with up to 35 percent of known identity thefts caused by corporate data breaches. Many states have responded by adopting data breach disclosure laws that require firms to notify consumers if their personal informa- tion has been lost or stolen. Although the laws are expected to reduce identity theft, their effect has yet to be empirically measured. We use panel data from the U.S. Federal Trade Commission to estimate the impact of data breach disclosure laws on identity theft from 2002 to 2009. We find that adoption of data breach disclo- sure laws reduce identity theft caused by data breaches, on average, by 6.1 percent. © 2011 by the Association for Public Policy Analysis and Management. INTRODUCTION Data breaches occur when personally identifiable information such as names, Social Security numbers, and credit card numbers are accidentally lost or maliciously stolen. These breaches can result in hundreds of thousands (sometimes millions) of compromised records and lead to identity theft and related crimes (Givens, 2000).1 In the United States, identity theft resulted in corporate and consumer losses of around $56 billion in 2005 (Javelin Research, 2006).2 In an effort to reduce these crimes, many states have responded by adopting data breach disclosure (also known as secu- rity breach notification) laws, requiring firms to notify individuals when their per- sonal information has been compromised. However, to date, no empirical analysis has investigated the effectiveness of such legislative initiatives in reducing identity theft.
    [Show full text]