Devops and Dataops
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Getting Data Right Tackling the Challenges of Big Data Volume and Variety
Compliments of Getting Data Right Tackling the Challenges of Big Data Volume and Variety Jerry Held, Michael Stonebraker, Thomas H. Davenport, Ihab Ilyas, Michael L. Brodie, Andy Palmer & James Markarian Getting Data Right Tackling the Challenges of Big Data Volume and Variety Jerry Held, Michael Stonebraker, Thomas H. Davenport, Ihab Ilyas, Michael L. Brodie, Andy Palmer, and James Markarian Getting Data Right by Jerry Held, Michael Stonebraker, Thomas H. Davenport, Ihab Ilyas, Michael L. Brodie, Andy Palmer, and James Markarian Copyright © 2016 Tamr, Inc. All rights reserved. Printed in the United States of America. Published by O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472. O’Reilly books may be purchased for educational, business, or sales promotional use. Online editions are also available for most titles (http://safaribooksonline.com). For more information, contact our corporate/institutional sales department: 800-998-9938 or [email protected]. Editor: Shannon Cutt Interior Designer: David Futato Production Editor: Nicholas Adams Cover Designer: Randy Comer Copyeditor: Rachel Head Illustrator: Rebecca Demarest Proofreader: Nicholas Adams January 2016: First Edition Revision History for the First Edition 2015-12-15: First Release The O’Reilly logo is a registered trademark of O’Reilly Media, Inc. Getting Data Right and related trade dress are trademarks of O’Reilly Media, Inc. While the publisher and the authors have used good faith efforts to ensure that the information and instructions contained in this work are accurate, the publisher and the authors disclaim all responsibility for errors or omissions, including without limitation responsibility for damages resulting from the use of or reliance on this work. -
University of Waterloo SENATE EXECUTIVE COMMITTEE Notice of Meeting Date: Monday 1 February 2021 Time: 3:30 P.M. Place: Microsof
University of Waterloo SENATE EXECUTIVE COMMITTEE Notice of Meeting Date: Monday 1 February 2021 Time: 3:30 p.m. Place: Microsoft Teams Videoconference OPEN SESSION Action 1. Minutes of the 4 January 2021 Meeting Decision 2. Business Arising from the Minutes 3. Draft 22 February 2021 Senate Agenda Decision 4. Other Business CONFIDENTIAL SESSION 5. Chancellor Review Committee Information KJJ/ees Karen Jack 25 January 2021 University Secretary Secretary to the Executive Committee 1 of 48 University of Waterloo SENATE EXECUTIVE COMMITTEE Minutes of the 4 January 2021 Meeting Present: David Billedeau, Dan Brown, Jeff Casello, Joan Coutu, George Freeman, Feridun Hamdullahpur (chair), Karen Jack (secretary), Christiane Lemieux, William Power, Sam Rubin, James Rush, Abbie Simpson, Richard Staines, Johanna Wandel Regrets: Kofi Campbell 1. MINUTES OF THE 2 NOVEMBER 2020 MEETING Members heard a motion to approve the minutes of the 2 November 2020 meeting. Casello and Simpson. Carried unanimously. 2. BUSINESS ARISING FROM THE MINUTES There was no business arising. 3. APPOINTMENT OF CHANCELLOR REVIEW COMMITTEE The chair spoke to the historical practice of Senate appointing this committee to undertake the role of the nominating, or in this case, review committee for the role of chancellor. Following discussion, the committee heard a motion to recommend to Senate that it appoint the Senate Executive Committee as a chancellor review committee. Freeman and Staines. Carried unanimously. 4. DRAFT 18 JANUARY 2021 SENATE AGENDA The chair provided members with a brief review of the agenda including notice that in addition to his regular update from the president, the provost and vice-president, research and international jointly will present an update on the strategic plan working groups; that good news is coming regarding an award announcement; and he advised that the provost will offer commentary on current academic matters. -
SIGMOD ’19 Proceedings of the 2019 International Conference on Management of Data
June 30–July 5, 2019 Amsterdam, Netherlands SIGMOD ’19 Proceedings of the 2019 International Conference on Management of Data General Chairs: Peter Boncz & Stefan Manegold (CWI, Netherlands) Program Chair: Anastasia Ailamaki (EPFL, Switzerland) Program Vice-Chairs: Amol Deshpande (University of Maryland, USA) Tim Kraska (MIT, USA) The Association for Computing Machinery 1601 Broadway, 10th Floor New York, NY 10019-7434 Copyright © 2019 by the Association for Computing Machinery, Inc. (ACM). Permission to make digital or hard copies of portions of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyright for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permission to republish from: [email protected] or Fax +1 (212) 869-0481. For other copying of articles that carry a code at the bottom of the first or last page, copying is permitted provided that the per-copy fee indicated in the code is paid through www.copyright.com. ISBN: 978-1-4503-5643-5 Additional copies may be ordered prepaid from: ACM Order Department Phone: 1-800-342-6626 (USA and Canada) PO Box 30777 +1-212-626-0500 (Global) New York, NY 10087-0777, USA Fax: +1-212-944-1318 E-mail: [email protected] Hours of Operation: 8:30 am – 4:30 pm ET Printed in the USA ii Welcome to SIGMOD 2019 - The 2019 ACM SIGMOD International Conference on the Management of Data! This year, the conference is held in the city center of Amsterdam, capital of The Netherlands. -
The Pragmatic Wisdom of Michael Stonebraker Editor: Michael L
This book celebrates Michael Stonebraker’s accomplishments that led to his 2014 ACM A.M. Turing Award “for fundamental contributions to the concepts and practices underlying modern database systems.” The book describes, for the broad computing community, the unique nature, significance, and impact of Mike’s achievements in advancing modern database systems over more than forty years. Today, data is considered the world’s most valuable resource, whether it is in the tens of millions of databases used to manage the world’s businesses and governments, in the billions of databases in our smartphones and watches, or residing elsewhere, as yet unmanaged, awaiting the elusive next generation of database systems. Every one of the millions or billions of databases includes features that are celebrated by the 2014 Turing Award and are described in this book. Why should I care about databases? What is a database? What is data management? What is a database management system (DBMS)? These are just some of the questions that this book answers, in describing the development of data management through the achievements of Mike Stonebraker and his over 200 collaborators. In reading the stories in this book, you will discover core data management concepts that were developed over the two greatest eras—so far— of data management technology. The book is a collection of 36 toriess written by Mike and 38 of his collaborators: 23 world-leading database researchers, 11 world-class systems engineers, and 4 business partners. If you are an aspiring researcher, engineer, or entrepreneur you might read these stories to find these turning points as practice to tilt at your own computer-science windmills, to spur yourself to your next step of innovation and achievement. -
MASTER of DATA SCIENCE and ARTIFICIAL INTELLIGENCE and MASTER of MATHEMATICS in DATA SCIENCE Submitted to the Ontario Universities Council on Quality Assurance
GRADUATE PROGRAM PROPOSAL OF MASTER OF DATA SCIENCE AND ARTIFICIAL INTELLIGENCE AND MASTER OF MATHEMATICS IN DATA SCIENCE Submitted to the Ontario Universities Council on Quality Assurance VOLUME II – FACULTY CURRICULA VITAE SEPTEMBER 2018 Curriculum Vitae – M. Data Science and Artificial Intelligence and M.Math in Data Science TABLE OF CONTENTS Ben-David, Shai ............................................................................................................................... 2 Browne, Ryan ................................................................................................................................ 14 Chenouri, Shoja ............................................................................................................................. 20 Coleman, Thomas (Tom) ............................................................................................................... 30 Cormack, Gordon (Gord) ............................................................................................................... 37 Fukasawa, Ricardo ........................................................................................................................ 41 Ghodsi, Ali ..................................................................................................................................... 51 Grossman, Maura.......................................................................................................................... 62 Hoey, Jesse ................................................................................................................................... -
16Looking Back at Postgres
Looking Back at Postgres Joseph M. Hellerstein Postgres was Michael Stonebraker’s most ambitious project—his grand effort to build a one-size-fits-all database system. A decade long, it generated more papers, Ph.Ds., professors, and companies than anything else he did. It also covered more technical ground than any other single system he built. Despite the risk inherent in taking on that scope, Postgres also became the most successful software artifact to come out of Stonebraker’s research groups, and his main contribution to open source. As of the time of writing, the open-source PostgreSQL system is the most popular, independent open-source database system in the world, and the fourth most popular database system in the world. Meanwhile, companies built from a 16Postgres base have generated a sum total of over $2.6 billion in acquisitions. By any measure, Stonebraker’s Postgres vision resulted in enormous and ongoing impact. Context Stonebraker had enormous success in his early career with the Ingres research project at Berkeley (see Chapter 15), and the subsequent startup he founded with Larry Rowe and Eugene Wong: Relational Technology, Inc. (RTI). As RTI was developing in the early 1980s, Stonebraker began working on data- base support for data types beyond the traditional rows and columns of Codd’s original relational model. A motivating example current at the time was to pro- vide database support for CAD tools for the microelectronics industry. In a 1983 paper, Stonebraker and students Brad Rubenstein and Antonin Guttman explained how that industry-needed support for “new data types such as polygons, rectangles, text strings, etc.,” “efficient spatial searching,” “complex integrity constraints,” and “design hierarchies and multiple representations” of the same physical con- structions [Stonebraker 1983a]. -
Making Databases Work: the Pragmatic Wisdom of Michael Stonebraker Editor: Michael L
Making Databases Work ACM Books Editor in Chief M. Tamer Ozsu,¨ University of Waterloo ACM Books is a new series of high-quality books for the computer science community, published by ACM in collaboration with Morgan & Claypool Publishers. ACM Books publications are widely distributed in both print and digital formats through booksellers and to libraries (and library consortia) and individual ACM members via the ACM Digital Library platform. Making Databases Work: The Pragmatic Wisdom of Michael Stonebraker Editor: Michael L. Brodie 2018 The Handbook of Multimodal-Multisensor Interfaces, Volume 2: Signal Processing, Architectures, and Detection of Emotion and Cognition Editors: Sharon Oviatt, Monash University Bjorn¨ Schuller, University of Augsburg and Imperial College London Philip R. Cohen, Monash University Daniel Sonntag, German Research Center for Artificial Intelligence (DFKI) Gerasimos Potamianos, University of Thessaly Antonio Kr¨uger, Saarland University and German Research Center for Artificial Intelligence (DFKI) 2018 Declarative Logic Programming: Theory, Systems, and Applications Editors: Michael Kifer, Stony Brook University Yanhong Annie Liu, Stony Brook University 2018 The Sparse Fourier Transform: Theory and Practice Haitham Hassanieh, University of Illinois at Urbana-Champaign 2018 The Continuing Arms Race: Code-Reuse Attacks and Defenses Editors: Per Larsen, Immunant, Inc. Ahmad-Reza Sadeghi, Technische Universit¨at Darmstadt 2018 Frontiers of Multimedia Research Editor: Shih-Fu Chang, Columbia University 2018 Shared-Memory -
2014 Annual Report
Annual Report 2014 Looking Back, Looking Forward 2014 was yet another wonderful year. Our entrepreneurial ecosystem continued to blossom and so did our stakeholders: startups, social entrepreneurs, job creators, accelerators, angel investors, venture capitalists, active mentors, research centers, and university spinouts. Silicon Valley and MENA-based Arab entrepreneurs, women and men, sparkled. Serial entrepreneurs created in excess of $5 billion in shareholders’ value. Startups mushroomed across the board from mobile payments, mapping, media, entertainment, and 3D printing to LiDAR, Cloud Computing, Security, and Digital Health. Supporting these entrepreneurs, angel investor groups, notably the wonderful WOMENA, formed to provide early stage financing. $50 million+ funds to fill the Series A gap were launched by TechWadi100 members Fadi Ghandour and Hala Fadel, as well as MEVP, Berytech and Badia Impact. And London Mayor Boris Johnson unveiled plans for a $166 million fund to encourage entrepreneurs in the Middle East to go global with their business! Strategic partner organizations expanded our reach. Corporate partners such as Flextronics, SAP and Microsoft scaled their effective programs and platforms to accelerate product deployment. UC Berkeley and Google played significant roles in hosting networking events. We co-hosted the MIT Arab Startup Competition Global Track, which brought 19 budding startups from MENA to San Francisco. Working with Technicolor, we launched the first Wearable World incubator in San Francisco to help accelerate entrepreneurs in Silicon Valley. Our relationship with the Harvard Arab Alumni Association and with Stanford University AMENDS Conference grew for the third year in a row. Last but not least, our collaborations with entrepreneurship initiatives in the Middle East ensured our efforts were focused on what we are committed to: building bridges for entrepreneurship. -
Issue Editor
Bulletin of the Technical Committee on Data Engineering March 2021 Vol. 44 No. 1 IEEE Computer Society Letters Letter from the Editor-in-Chief . Haixun Wang 1 Letter from the Special Issue Editor . Sebastian Schelter 2 Opinions ML-In-Databases: Assessment and Prognosis . Tim Kraska, Umar Farooq Minhas, Thomas Neumann, Olga Papaemmanouil, Jignesh M. Patel, Chris Ré, Michael Stonebraker 3 Special Issue on Data Validation for Machine Learning Models and Applications A Data Quality-Driven View of MLOps. ............ Cedric Renggli, Luka Rimanic, Nezihe Merve Gürel, Bojan Karlaš, Wentao Wu and Ce Zhang 11 From Cleaning before ML to Cleaning for ML . Felix Neutatz, Binger Chen, Ziawasch Abedjan and Eugene Wu 24 Validating Data and Models in Continuous ML Pipelines . ......... Mike Dreves, Gene Huang, Zhuo Peng, Neoklis Polyzotis, Evan Rosen and Paul Suganthan G. C. 42 Automated Data Validation in Machine Learning Systems . ................. Felix Biessmann, Jacek Golebiowski, Tammo Rukat, Dustin Lange and Philipp Schmidt 51 Enhancing the Interactivity of Dataframe Queries by Leveraging Think Time . Doris Xin, Devin Petersohn, Dixin Tang, Yifan Wu, Joseph E. Gonzalez, Joseph M. Hellerstein, Anthony D. Joseph and Aditya G. Parameswaran 66 Responsible AI Challenges in End-to-end Machine Learning . ..................................... Steven Euijong Whang, Ki Hyun Tae, Yuji Roh and Geon Heo 79 Conference and Journal Notices TCDE Membership Form. 92 Editorial Board TCDE Executive Committee Editor-in-Chief Chair Haixun Wang Erich J. Neuhold Instacart -
Ranked XML Querying Dagstuhl Seminar
08111 Abstracts Collection Ranked XML Querying Dagstuhl Seminar Sihem Amer-Yahia1, Divesh Srivastava2 and Gerhard Weikum3 1 Yahoo Research New York, US 2 AT&T Florham Park, US [email protected] 3 MPI Saarbrücken, DE [email protected] Abstract. From 09.03. to 14.03.08, the Dagstuhl Seminar 08111 Ranked XML Querying was held in the International Conference and Research Center (IBFI), Schloss Dagstuhl. During the seminar, several partici- pants presented their current research, and ongoing work and open prob- lems were discussed. Abstracts of the presentations given during the sem- inar as well as abstracts of seminar results and ideas are put together in this paper. The rst section describes the seminar topics and goals in general. Links to extended abstracts or full papers are provided, if available. Keywords. Scoring methods for XML, Ranking approximate XML an- swers, Top-K query processing, Querying structured and unstructured data, XML Full-Text Querying, Querying heterogeneous XML, Extract- ing structure from unstructured data, Text mining, XML data integra- tion 08111 Report Ranked XML Querying This paper is based on a ve-day workshop on "Ranked XML Querying" that took place in Schloss Dagstuhl in Germany in March 2008 and was attended by 27 people from three dierent research communities: database systems (DB), information retrieval (IR), and Web. The seminar title was interpreted in an IR-style "andish" sense (it covered also subsets of Ranking, XML, Querying, with larger sets being favored) rather than the DB-style strictly conjunctive manner. So in essence, the seminar really addressed the integration of DB and IR technologies with Web 2.0 being an important target area.