(BTW 2017) – Workshopband

Total Page:16

File Type:pdf, Size:1020Kb

(BTW 2017) – Workshopband GI-Edition Lecture Notes in Informatics Bernhard Mitschang, Norbert Ritter, Holger Schwarz, Meike Klettke, Andreas Thor, Oliver Kopp, Matthias Wieland (Hrsg.) Datenbanksysteme für Business, Technologie und Web (BTW 2017) Workshopband 6.–10. März 2017 Bernhard Mitschang, Norbert Ritter, Holger Schwarz, Meike Klettke, Andreas Thor, Thor, Andreas Klettke, Meike Schwarz, Holger Norbert Ritter, Bernhard Mitschang, Workshopband BTW 2017 – (Hrsg.): Wieland Matthias Kopp, Oliver Stuttgart 266 Proceedings Bernhard Mitschang, Norbert Ritter, Holger Schwarz, Meike Klettke, Andreas Thor, Oliver Kopp, Matthias Wieland (Hrsg.) Datenbanksysteme für Business, Technologie und Web (BTW 2017) Workshopband 06. – 07.03.2017 in Stuttgart, Deutschland Gesellschaft für Informatik e.V. (GI) Lecture Notes in Informatics (LNI) - Proceedings Series of the Gesellschaft für Informatik (GI) Volume P-266 ISBN 978-3-88579-660-2 ISSN 1617-5468 Volume Editors Bernhard Mitschang Universität Stuttgart Institut für Parallele und Verteilte Systeme, Abteilung Anwendersoftware 70569 Stuttgart, Germany E-Mail: [email protected] Norbert Ritter Universität Hamburg Fachbereich Informatik, Verteilte Systeme und Informationssysteme (VSIS) 22527 Hamburg, Germany E-Mail: [email protected] Holger Schwarz Universität Stuttgart Institut für Parallele und Verteilte Systeme, Abteilung Anwendersoftware 70569 Stuttgart, Germany E-Mail: [email protected] Meike Klettke Universität Rostock Institut für Informatik 18051 Rostock, Germany E-Mail: [email protected] Andreas Thor Universität Leipzig Institut für Informatik, Abteilung Datenbanken 04109 Leipzig, Germany E-Mail: [email protected] Oliver Kopp Universität Stuttgart Institut für Parallele und Verteilte Systeme, Abteilung Anwendersoftware 70569 Stuttgart, Germany E-Mail: [email protected] Matthias Wieland Universität Stuttgart Institut für Parallele und Verteilte Systeme, Abteilung Anwendersoftware 70569 Stuttgart, Germany E-Mail: [email protected] Series Editorial Board Heinrich C. Mayr, Alpen-Adria-Universität Klagenfurt, Austria (Chairman, [email protected]) Dieter Fellner, Technische Universität Darmstadt, Germany Ulrich Flegel, Infineon, Germany Ulrich Frank, Universität Duisburg-Essen, Germany Andreas Thor, HFT Leipzig, Germany Michael Goedicke, Universität Duisburg-Essen, Germany Ralf Hofestädt, Universität Bielefeld, Germany Michael Koch, Universität der Bundeswehr München, Germany Axel Lehmann, Universität der Bundeswehr München, Germany Thomas Roth-Berghofer, University of West London, Great Britain Peter Sanders, Karlsruher Institut für Technologie (KIT), Germany Torsten Brinda, Universität Duisburg-Essen, Germany Ingo Timm, Universität Trier, Germany Karin Vosseberg, Hochschule Bremerhaven, Germany Maria Wimmer, Universität Koblenz-Landau, Germany Dissertations Steffen Hölldobler, Technische Universität Dresden, Germany Thematics Andreas Oberweis, Karlsruher Institut für Technologie (KIT), Germany 0 Gesellschaft für Informatik, Bonn 2017 printed by Köllen Druck+Verlag GmbH, Bonn This book is licensed under a Creative Commons Attribution-NonCommercial 3.0 License. Vorwort In den letzten Jahren hat es auf dem Gebiet des Datenmanagements große Verän- derungen und Herausforderungen gegeben und auch für die Zukunft zeichnen sich interessante Weiterentwicklungen ab. Im Fokus der Datenbankforschung stehen neben Kerndatenbanktechnologien insbesondere auch Herausforderungen von „Big Data“, also die Analyse von riesigen Datenmengen unterschiedlicher Struktur mit kurzen Antwortzeiten. Neue Architekturansätze für skalierbares Datenmanagement auf der Basis von Cloud Computing gewinnen weiter an Be- deutung, sind aber noch nicht umfassend erforscht. Zunehmend werden Fragen der effizienten Speicherung, Verarbeitung und Nut- zung großer Mengen unstrukturierter Daten adressiert, wie sie unter anderem in sozialen Netzwerken und wissenschaftlichen Anwendungen generiert werden. Hierbei gilt es sowohl allgemeine Konzepte für skalierbares Datenmanagement und effiziente Analysen zu erarbeiten als auch das breite Spektrum an Anwen- dungsgebieten zu berücksichtigen. Große Datenmengen zu beherrschen ist heute in allen Geschäftsbereichen von Unternehmen ein ebenso essentieller Erfolgsfak- tor wie in allen Wissenschaftsbereichen, von den Natur- über Ingenieur- bis hin zu Geisteswissenschaften. Wie auf jeder BTW-Konferenz gruppieren sich um die Haupttagung eine Reihe von Workshops, die solche speziellen Themen in kleinen Gruppen aufgreifen und sowohl Wissenschaftlern als auch Praktikern und Anwendern einen Rahmen bie- ten für die intensive Diskussion aktueller Arbeiten in diesen Themenbereichen. Im Rahmen der BTW 2017 finden folgende Workshops statt: Big Data Management Systems in Business and Industrial Applications (BigBIA17) Big (and small) Data in Science and Humanities (BigDS17) Workshop on Industrial Applications of Artificial Intelligence (IAAI’17) Präferenzen und Personalisierung in der Informatik (PPI17) Scalable Cloud Data Management Workshop (SCDM 2017) Mit diesen Schwerpunkten reflektiert das Workshop-Programm aktuelle For- schungsgebiete von hoher praktischer Relevanz. Zusätzlich präsentieren Studie- rende im Rahmen des Studierendenprogramms die Ergebnisse 11 ausgewählter aktueller Abschlussarbeiten im Bereich Datenmanagement. Für jeden Geschmack sollten im Rahmen der BTW 2017 somit interessante Beiträge zu finden sein! Bei der BTW erstmals neu mit dabei ist eine sog. Data Science Challenge, bei der die Teilnehmer bzw. Teilnehmergruppen einen eigenen Ansatz zur Cloud- basierten Datenanalyse im Rahmen der gegebenen Aufgabenstellung entwickeln werden. Hierzu müssen zur Verfügung gestellte Datenquellen integriert und aus- gewertet werden. Die Teilnehmer haben freie Auswahl der verwendeten Cloud- Plattformen und Analysetechniken. Die Data Science Challenge richtet sich spe- ziell an Doktoranden sowie Studierende. Eine kurze Beschreibung zur Data Sci- ence Challenge findet sich hier im Workshop-Band. Die Materialien zur BTW 2017 werden über die Web-Seiten der Tagung unter http://btw2017.informatik.uni-stuttgart.de/ zur Verfügung stehen. Die Organisation der BTW-Tagung nebst allen angeschlossenen Veranstaltungen ist nicht ohne die Unterstützung vieler Partner möglich. Diese sind auf den fol- genden Seiten aufgeführt; zu ihnen zählen insbesondere die Sponsoren, das In- formatik-Forum Stuttgart (infos e.V.), die Universität Stuttgart und auch die Ge- schäftsstelle der Gesellschaft für Informatik. Ihnen allen gilt unser besonderer Dank! Stuttgart, im Januar 2017 Bernhard Mitschang, Tagungsleitung/Vorsitzender des Organisationskomitees Norbert Ritter und Holger Schwarz, Leitung Workshopkomitee Meike Klettke, Andreas Thor, Leitung Studierendenprogramm Oliver Kopp und Matthias Wieland, Tagungsband und Organisationskomitee Tagungsleitung Bernhard Mitschang, Universität Stuttgart Organisationskomitee Johanna Barzen, Univ. Stuttgart Martin Mähler, IBM Michael Behringer, Univ. Stuttgart Michael Matthiesen, Univ. Stuttgart Uwe Breitenbücher, Univ. Stuttgart Berhard Mitschang, Univ. Stuttgart Melanie Herschel, Univ. Stuttgart Holger Schwarz, Univ. Stuttgart Oliver Kopp, Univ. Stuttgart Matthias Wieland, Univ. Stuttgart Frank Leymann, Univ. Stuttgart Lena Wiese, Univ. Göttingen Studierendenprogramm Meike Klettke, Universität Rostock Andreas Thor, Hochschule für Telekommunikation Leipzig Koordination Workshops Norbert Ritter, Universität Hamburg Holger Schwarz, Universität Stuttgart Tutorienprogramm Johann-Christoph Freytag, HU Berlin Data Science Challenge Michael Behrendt, IBM Kai-Uwe Sattler, TU Ilmenau Pascal Hirmer, Univ. Stuttgart Tim Waizenegger, Univ. Stuttgart Martin Mähler, IBM Workshop on Big Data Management Systems in Business and Industrial Applications (BigBIA17) Vorsitz: Benjamin Klöpper, ABB Research; Lena Wiese, Georg-August- Universität Göttingen Stefan Edlich, Beuth Hochschule Daniel Ritter, SAP SE Beate Heisterkamp, Materials Consul- Klaus Schmid, Univ. Hildesheim ting Benedikt Schmidt, ABB Forschungs- Holger Kache, IBM zentrum GmbH Mikro Kämpf, Cloudera Kerstin Schneider, Hochschule Harz Carsten Lanquillion, HS Heilbronn Heiko Thimm, Hochschule Pforzheim Stefan Mandl, EXASOL AG Liesbeth Vanherpe, École polytechni- Carlos Paiz Gatica, Weidmüller Inter- que fédérale de Lausanne face GmbH & Co KG Workshop on Big (and Small) Data in Science and Humanities (BigDS17) Vorsitz: Anika Groß, Universität Leipzig; Birgitta König-Ries, Friedrich- Schiller-Universität Jena; Peter Reimann, Universität Stuttgart; Bernhard Seeger, Philipps-Universität Marburg Alsayed Algergawy, Univ. Jena Andreas Henrich, Univ. Bamberg Peter Baumann, Jacobs Universität Jens Kattge, MPI für Biogeochemie Matthias Bräger, CERN Alfons Kemper, TU München Thomas Brinkhoff, FH Oldenburg Meike Klettke Univ. Rostock Michael Diepenbroeck, Alfred- Frank Leymann, Univ. Stuttgart Wegner-Institut, Bremerhaven Bertram Ludäscher, University of Illi- Jana Diesner, University of Illinois at nois at Urbana-Champaign Urbana-Champaign Alex Markowetz, Univ. Bonn Christoph Freytag, HU Berlin Jens Nieschulze, Univ. Göttingen Michael Gertz, Univ. Heidelberg Eric Peukert, Univ. Leipzig Anton Güntsch, Botanischer Garten Kai-Uwe Sattler, TU Ilmenau und Botanisches Museum, Berlin- Uta Störl, Hochschule Darmstadt Dahlem Andreas Thor, HfT Leipzig Thomas Heinis, IC London Workshop on Industrial Applications of Artificial Intelligence 2017 (IAAI’17) Vorsitz: Alexander
Recommended publications
  • Asset Finder: a Search Tool for Finding Relevant Graphical Assets Using Automated Image Labelling
    Sjalvst¨ andigt¨ arbete i informationsteknologi 20 juni 2019 Asset Finder: A Search Tool for Finding Relevant Graphical Assets Using Automated Image Labelling Max Perea During¨ Anton Gildebrand Markus Linghede Civilingenjorsprogrammet¨ i informationsteknologi Master Programme in Computer and Information Engineering Abstract Institutionen for¨ Asset Finder: A Search Tool for Finding Rele- informationsteknologi vant Graphical Assets Using Automated Image Besoksadress:¨ Labelling ITC, Polacksbacken Lagerhyddsv¨ agen¨ 2 Max Perea During¨ Postadress: Box 337 Anton Gildebrand 751 05 Uppsala Markus Linghede Hemsida: http:/www.it.uu.se The creation of digital 3D-environments appears in a variety of contexts such as movie making, video game development, advertising, architec- ture, infrastructure planning and education. When creating these envi- ronments it is sometimes necessary to search for graphical assets in big digital libraries by trying different search terms. The goal of this project is to provide an alternative way to find graphical assets by creating a tool called Asset Finder that allows the user to search using images in- stead of words. The Asset Finder uses image labelling provided by the Google Vision API to find relevant search terms. The tool then uses synonyms and related words to increase the amount of search terms us- ing the WordNet database. Finally the results are presented in order of relevance using a score system. The tool is a web application with an in- terface that is easy to use. The results of this project show an application that is able to achieve good results in some of the test cases. Extern handledare: Teddy Bergsman Lind, Quixel AB Handledare: Mats Daniels, Bjorn¨ Victor Examinator: Bjorn¨ Victor Sammanfattning Skapande av 3D-miljoer¨ dyker upp i en mangd¨ olika kontexter som till exempel film- skapande, spelutveckling, reklam, arkitektur, infrastrukturplanering och utbildning.
    [Show full text]
  • Search, Analyse, Predict Image Spread on Twitter
    Image Search for Improved Law and Order: Search, Analyse, Predict image spread on Twitter Student Name: Sonal Goel IIIT-D-MTech-CS-GEN-16-MT14026 April, 2016 Indraprastha Institute of Information Technology New Delhi Thesis Committee Dr. Ponnurangam Kumaraguru (Chair) Dr. AV Subramanyam Dr. Samarth Bharadwaj Submitted in partial fulfillment of the requirements for the Degree of M.Tech. in Computer Science, in General Category ©2016 IIIT-D-MTech-CS-GEN-16-MT14026 All rights reserved Keywords: Image Search, Image Virality, Twitter, Law and Order, Police Certificate This is to certify that the thesis titled “Image Search for Improved Law and Order:Search, Analyse, Predict Image Spread on Twitter" submitted by Sonal Goel for the partial ful- fillment of the requirements for the degree of Master of Technology in Computer Science & Engineering is a record of the bonafide work carried out by her under our guidance and super- vision in the Security and Privacy group at Indraprastha Institute of Information Technology, Delhi. This work has not been submitted anywhere else for the reward of any other degree. Dr. Ponnurangam Kumarguru Indraprastha Institute of Information Technology, New Delhi Abstract Social media is often used to spread images that can instigate anger among people, hurt their religious, political, caste, and other sentiments, which in turn can create law and order situation in society. This results in the need for Law Enforcement Agencies (LEA) to inspect the spread of images related to such events on social media in real time. To help LEA analyse the image spread on microblogging websites like Twitter, we developed an Open Source Real Time Image Search System, where the user can give an image, and a supportive text related to image and the system finds the images that are similar to the input image along with their occurrences.
    [Show full text]
  • Reverse Image Search for Scientific Data Within and Beyond the Visible
    Journal, Vol. XXI, No. 1, 1-5, 2013 Additional note Reverse Image Search for Scientific Data within and beyond the Visible Spectrum Flavio H. D. Araujo1,2,3,4,a, Romuere R. V. Silva1,2,3,4,a, Fatima N. S. Medeiros3,Dilworth D. Parkinson2, Alexander Hexemer2, Claudia M. Carneiro5, Daniela M. Ushizima1,2,* Abstract The explosion in the rate, quality and diversity of image acquisition instruments has propelled the development of expert systems to organize and query image collections more efficiently. Recommendation systems that handle scientific images are rare, particularly if records lack metadata. This paper introduces new strategies to enable fast searches and image ranking from large pictorial datasets with or without labels. The main contribution is the development of pyCBIR, a deep neural network software to search scientific images by content. This tool exploits convolutional layers with locality sensitivity hashing for querying images across domains through a user-friendly interface. Our results report image searches over databases ranging from thousands to millions of samples. We test pyCBIR search capabilities using three convNets against four scientific datasets, including samples from cell microscopy, microtomography, atomic diffraction patterns, and materials photographs to demonstrate 95% accurate recommendations in most cases. Furthermore, all scientific data collections are released. Keywords Reverse image search — Content-based image retrieval — Scientific image recommendation — Convolutional neural network 1University of California, Berkeley, CA, USA 2Lawrence Berkeley National Laboratory, Berkeley, CA, USA 3Federal University of Ceara,´ Fortaleza, CE, Brazil 4Federal University of Piau´ı, Picos, PI, Brazil 5Federal University of Ouro Preto, Ouro Preto, MG, Brazil *Corresponding author: [email protected] aContributed equally to this work.
    [Show full text]
  • I Am Robot: (Deep) Learning to Break Semantic Image Captchas
    I Am Robot: (Deep) Learning to Break Semantic Image CAPTCHAs Suphannee Sivakorn, Iasonas Polakis and Angelos D. Keromytis Department of Computer Science Columbia University, New York, USA fsuphannee, polakis, [email protected] Abstract—Since their inception, captchas have been widely accounts and posting of messages in popular services. Ac- used for preventing fraudsters from performing illicit actions. cording to reports, users solve over 200 million reCaptcha Nevertheless, economic incentives have resulted in an arms challenges every day [2]. However, it is widely accepted race, where fraudsters develop automated solvers and, in turn, that users consider captchas to be a nuisance, and may captcha services tweak their design to break the solvers. Recent require multiple attempts before passing a challenge. Even work, however, presented a generic attack that can be applied simple challenges deter a significant amount of users from to any text-based captcha scheme. Fittingly, Google recently visiting a website [3], [4]. Thus, it is imperative to render unveiled the latest version of reCaptcha. The goal of their new the process of solving a captcha challenge as effortless system is twofold; to minimize the effort for legitimate users, as possible for legitimate users, while remaining robust while requiring tasks that are more challenging to computers against automated solvers. Traditionally, automated solvers than text recognition. ReCaptcha is driven by an “advanced have been considered less lucrative than human solvers risk analysis system” that evaluates requests and selects the for underground markets [5], due to the quick response difficulty of the captcha that will be returned. Users may times exhibited by captcha services in tweaking their design.
    [Show full text]
  • A Deep Learning Based CAPTCHA Solver for Vulnerability Assessment
    Deep-CAPTCHA: a deep learning based CAPTCHA solver for vulnerability assessment Zahra Noury ∗, Mahdi Rezaei y Faculty of Computer and Electrical Engineering, Qazvin Azad University Faculty of Environment, Institute for Transport Studies, The University of Leeds ∗[email protected], [email protected] Abstract— CAPTCHA is a human-centred test to distinguish a human operator from bots, attacking programs, or other computerised agents that tries to imitate human intelligence. In this research, we investigate a way to crack visual CAPTCHA tests by an automated deep learning based solution. The goal of this research is to investigate the weaknesses and vulnerabilities of the CAPTCHA generator systems; hence, developing more robust CAPTCHAs, without taking the risks of manual try and fail efforts. We develop a Convolutional Neural Network called Deep- CAPTCHA to achieve this goal. The proposed platform is able to investigate both numerical and alphanumerical CAPTCHAs. To train and develop an efficient model, we have generated a dataset of 500,000 CAPTCHAs to train our model. In this paper, we present our customised deep neural network model, we review the research gaps, the existing challenges, and the solutions to cope with the issues. Our network’s cracking accuracy leads to a high rate of 98.94% and 98.31% for the numerical and the alpha-numerical test datasets, respectively. That means more works is required to develop robust CAPTCHAs, to be non- crackable against automated artificial agents. As the outcome of this research, we identify some efficient techniques to improve the security of the CAPTCHAs, based on the performance analysis conducted on the Deep-CAPTCHA model.
    [Show full text]
  • Perspectives on Visual Search Technology What Is Visual Search? Retailers Are Looking for New Ways to Streamline Product Discovery and Visual Search Is Enabling Them
    July 2019 ComCap’s perspectives on Visual Search technology What is Visual Search? Retailers are looking for new ways to streamline product discovery and Visual Search is enabling them ▪ At it’s core, Visual Search can answer key questions that are more easily resolved using visual prompts, unlike traditional text-based searching − From a consumer standpoint, visual search enables product discovery, connecting consumers directly to the products they desire ▪ Visual search is helping blend physical experiences and online convenience as retailers are competing to offer the best service for high-demanding customers through innovative technologies ▪ Consumers are now actively seeking multiple ways to visually capture products across channels, resulting in various applications including: − Image Recognition: enabling consumers to connect pictures with products, either through camera- or online- produced snapshots through AI-powered tools − QR / bar code scanning: allowing quick and direct access between consumers and specific products − Augmented Reality enabled experiences: producing actionable experiences by recognizing objects and layering information on top of a real-time view Neiman Marcus Samsung Google Introduces Amazon Introduces Snap. Introduces Bixby Google Lens Introduces Find. Shop. (2015) Vision (2017) (2017) StyleSnap (2019) Source: eMarketer, News articles 2 Visual Search’s role in retail Visual Search capabilities are increasingly becoming the industry standard for major ecommerce platforms Consumers place more importance on
    [Show full text]
  • Microsoft ML for Apache Spark
    Microsoft ML for Apache Spark Unifying Machine Learning Ecosystems at Massive Scales Mark Hamilton Microsoft, MIT [email protected] Overview Background Spark + SparkML MMLSpark Unifying ML Ecosystems LightGBM, CNTK, Vowpal Wabbit Vowpal Wabbit LightGBM CNTK Multilingual Bindings Microservice Orchestration Cognitive Services on Spark Cognitive Services Kubernetes Model Deployment with Spark Serving Use Cases The Snow Leopard Trust ● Code Driver ● Data Worker Worker Worker Age: Name: Age: Name: Age: Name: Int String Int String Int String A fault-tolerant distributed 15 Bob 1 Sam 25 Bob computing framework 25 Alice 2 Claire 25 Tom 33 Mary 53 Tina 66 Ted Map Reduce + SQL Data Source Whole program optimization + SQL, AWS Bucket, Local, CosmosDB, etc… query pushdown 120 110 Elastic 100 80 Scala, Python, R, Java, Julia 60 40 20 ML, Graph Processing, Streaming 0.9 0 Running Time(s) Hadoop Spark ML High level library for distributed machine learning More general than SciKit-Learn All models have a uniform interface Load Data Can compose models into Tokenizer Load Data complex pipelines Term Hashing Pipeline Can save, load, and transport Logistic Reg Evaluate models ● Estimator Evaluate ● Transformer Microsoft Machine Learning for Apache Spark v0.18 Microsoft’s Open Source Contributions to Apache Spark #UnifiedAnalytics #SparkAISummit 5 Distributed Fast Model Microservice Multilingual Binding Machine Learning Deployment Orchestration Generation www.aka.ms/spark Azure/mmlspark Unifying Machine Learning Ecosystems Goals
    [Show full text]
  • Studying the Live Cross-Platform Circulation of Images with Computer Vision API: an Experiment Based on a Sports Media Event
    International Journal of Communication 13(2019), 1825–1845 1932–8036/20190005 Studying the Live Cross-Platform Circulation of Images With Computer Vision API: An Experiment Based on a Sports Media Event CARLOS D’ANDRÉA ANDRÉ MINTZ1 Federal University of Minas Gerais, Brazil This article proposes a novel digital methods approach for studying the cross-platform circulation of images, taking these as traffic tags connecting topics across platforms and the open Web. To that end, it makes use of a computer vision API (Google Cloud Vision) to perform automated content-based image searches. The method is experimented on with an analysis of the circulation of pictures shared on Twitter during the 2018 FIFA World Cup Final Draw ceremony. The proposed approach showed high potential for overcoming linguistic and geographical barriers in Internet research by its use of nonverbal objects for cross-linking online content. Moreover, the analysis of the results raised important questions about the opacity of the employed API mechanisms and the limitations imposed by platformization processes for cross-platform endeavors, calling for continued reflexivity in derived studies. Keywords: digital methods, cross-platform, computer vision, media event Understanding the cross-platform circulation of Web content is one of the current challenges of digital methods research (Rogers, 2018). Although most studies still focus on specific sociotechnical practices enacted within social media platforms such as Facebook or Twitter, some methodological proposals and empirical studies (e.g., Burgess & Matamoros-Fernández, 2016) have taken hashtags, likes, and/or URLs as traffic tags (Elmer & Langlois, 2013) that organize online activity across different platforms. Considering the growing “ubiquity of the visual” (Highfield & Leaver, 2016, p.
    [Show full text]
  • Practical Deep Learning for Cloud, Mobile, and Edge Real-World AI and Computer-Vision Projects Using Python, Keras, and Tensorflow
    Practical Deep Learning for Cloud, Mobile, and Edge Real-World AI and Computer-Vision Projects Using Python, Keras, and TensorFlow Anirudh Koul, Siddha Ganju, and Meher Kasam Beijing • Boston • Farnham • Sebastopol • Tokyo O'REILLY Table of Contents Preface xv 1. Exploring the Landscape of Artificial Intelligence 1 An Apology 2 The Real Introduction 3 What Is AI? 3 Motivating Examples 5 A Brief History of AI 6 Exciting Beginnings 6 The Cold and Dark Days 8 A Glimmer of Hope 8 How Deep Learning Became a Thing 12 Recipe for the Perfect Deep Learning Solution 15 Datasets 16 Model Architecture 18 Frameworks 20 Hardware 23 Responsible AI 26 Bias 27 Accountability and Explainability 30 Reproducibility 30 Robustness 31 Privacy 31 Summary 32 Frequently Asked Questions 32 in 2. What's in the Picture: Image Classification with Keras 35 Introducing Keras 36 Predicting an Images Category 37 Investigating the Model 42 ImageNet Dataset 42 Model Zoos 44 Class Activation Maps 46 Summary 48 3. Cats Versus Dogs: Transfer Learning in 30 Lines with Keras 49 Adapting Pretrained Models to New Tasks 50 A Shallow Dive into Convolutional Neural Networks 51 Transfer Learning 53 Fine Tuning 54 How Much to Fine Tune 55 Building a Custom Classifier in Keras with Transfer Learning 56 Organize the Data 57 Build the Data Pipeline 59 Number of Classes 60 Batch Size 61 Data Augmentation 61 Model Definition 65 Train the Model 65 Set Training Parameters 65 Start Training 66 Test the Model 68 Analyzing the Results 68 Further Reading 76 Summary 78 4. Building a Reverse Image
    [Show full text]
  • A Multi-Modal Approach to Infer Image Affect
    A Multi-Modal Approach to Infer Image Aect Ashok Sundaresan, Sugumar Murugesan Sean Davis, Karthik Kappaganthu, ZhongYi Jin, Divya Jain Anurag Maunder Johnson Controls - Innovation Garage Santa Clara, CA ABSTRACT Our approach combines the top-down and boom-up features. e group aect or emotion in an image of people can be inferred Due to the presence of multiple subjects in an image, a key problem by extracting features about both the people in the picture and the that needs to be resolved is to combine individual human emotions overall makeup of the scene. e state-of-the-art on this problem to a group level emotion. e next step is to combine multiple investigates a combination of facial features, scene extraction and group level modalities. We address the rst problem by using a Bag- even audio tonality. is paper combines three additional modali- of-Visual-Words (BOV) based approach. Our BOV based approach ties, namely, human pose, text-based tagging and CNN extracted comprises developing a code book using two well known clustering features / predictions. To the best of our knowledge, this is the algorithms: k-means clustering and a Gaussian mixture model rst time all of the modalities were extracted using deep neural (GMM) based clustering. e code books from these clustering networks. We evaluate the performance of our approach against algorithms are used to fuse individual subjects’ features to a group baselines and identify insights throughout this paper. image level features We evaluate the performance of individual features using 4 dif- KEYWORDS ferent classication algorithms, namely, random forests, extra trees, gradient boosted trees and SVM.
    [Show full text]
  • Learning Visual Similarity for Product Design with Convolutional Neural Networks
    Learning visual similarity for product design with convolutional neural networks Sean Bell Kavita Bala Cornell University∗ Convolutional (c) Results 1: visually similar products Neural (a) Query 1: Input scene and box Network Learned Parameters θ (a) Query 2: Product (b) Project into 256D embedding (c) Results 2: use of product in-situ Figure 1: Visual search using a learned embedding. Query 1: given an input box in a photo (a), we crop and project into an embedding (b) using a trained convolutional neural network (CNN) and return the most visually similar products (c). Query 2: we apply the same method to search for in-situ examples of a product in designer photographs. The CNN is trained from pairs of internet images, and the boxes are collected using crowdsourcing. The 256D embedding is visualized in 2D with t-SNE. Photo credit: Crisp Architects and Rob Karosis (photographer). Abstract 1 Introduction Popular sites like Houzz, Pinterest, and LikeThatDecor, have com- Home owners and consumers are interested in visualizing ideas for munities of users helping each other answer questions about products home improvement and interior design. Popular sites like Houzz, in images. In this paper we learn an embedding for visual search in Pinterest, and LikeThatDecor have large active communities of users interior design. Our embedding contains two different domains of that browse the sites for inspiration, design ideas and recommenda- product images: products cropped from internet scenes, and prod- tions, and to pose design questions. For example, some topics and ucts in their iconic form. With such a multi-domain embedding, we questions that come up are: demonstrate several applications of visual search including identify- • “What is this fchair, lamp, wallpaperg in this photograph? ing products in scenes and finding stylistically similar products.
    [Show full text]
  • Arxiv:2003.05096V1 [Cs.MM] 11 Mar 2020
    Exploring the Role of Visual Content in Fake News Detection Juan Cao1;2, Peng Qi1;2[0000−0002−6458−8320], Qiang Sheng1;2[0000−0002−2481−5023], Tianyun Yang1;2, Junbo Guo1, and Jintao Li1 1 Key Laboratory of Intelligent Information Processing & Center for Advanced Computing Research, Institute of Computing Technology, CAS, China fcaojuan,qipeng,shengqiang18z,yangtianyun19z,guojunbo,[email protected] 2 University of Chinese Academy of Sciences, China Abstract. The increasing popularity of social media promotes the pro- liferation of fake news, which has caused significant negative societal effects. Therefore, fake news detection on social media has recently be- come an emerging research area of great concern. With the development of multimedia technology, fake news attempts to utilize multimedia con- tent with images or videos to attract and mislead consumers for rapid dissemination, which makes visual content an important part of fake news. Despite the importance of visual content, our understanding about the role of visual content in fake news detection is still limited. This chapter presents a comprehensive review of the visual content in fake news, including the basic concepts, effective visual features, representa- tive detection methods and challenging issues of multimedia fake news detection. This chapter can help readers to understand the role of visual content in fake news detection, and effectively utilize visual content to assist in detecting multimedia fake news. Keywords: fake news detection · fake-news images · social media · im- age forensics · image repurposing · multimedia · multi-modal · deep learn- ing · computer vision. 1 Introduction 1.1 Motivation arXiv:2003.05096v1 [cs.MM] 11 Mar 2020 Social media platforms, such as Twitter1 and Chinese Sina Weibo2, have become important access where people acquire the latest news and express their opin- ions freely34.
    [Show full text]