HPI Future SOC Lab: Proceedings 2015

Total Page:16

File Type:pdf, Size:1020Kb

HPI Future SOC Lab: Proceedings 2015 HPI Future SOC Lab: Proceedings 2015 Christoph Meinel, Andreas Polze, Gerhard Oswald, Rolf Strotmann, Ulrich Seibold, Bernhard Schulzki (Eds.) HPI Future SOC Lab: Proceedings 2015 Christoph Meinel | Andreas Polze | Gerhard Oswald | Rolf Strotmann | Ulrich Seibold | Bernhard Schulzki (Eds.) HPI Future SOC Lab Proceedings 2015 Hasso-Plattner-Institut für Softwaresystemtechnik GmbH Bibliografische Information der Deutschen Nationalbibliothek Die Deutsche Nationalbibliothek verzeichnet diese Publikation in der Deutschen Nationalbibliografie; detaillierte bibliografische Daten sind im Internet über http://dnb.dnb.de/ abrufbar. Hasso-Plattner-Institut 2017 https://hpi.de/ Prof.-Dr.-Helmert-Straße 2-3, 14482 Potsdam Tel.: +49-(0)331 5509-0 7 / Fax: +49-(0)331 5509-325 E-Mail: [email protected] Das Manuskript ist urheberrechtlich geschützt. Online veröffentlicht auf dem Publikationsserver der Universität Potsdam URN urn:nbn:de:kobv:517-opus4-102516 http://nbn-resolving.de/urn:nbn:de:kobv:517-opus4-102516 Contents Spring 2015 Prof. Dr. Karl Kurbel, European University Viadrina Frankfurt (Oder) Solving of LP and CWLP Problems Using SAP HANA . 1 Prof. Dr. Christoph Meinel, Hasso Plattner Institute Security Monitoring and Analytics of HPI FutureSoC Lab (Phase I) . 7 Machine Learning for Security Analytics powered by SAP HANA (Phase II) . 11 Prof. Dr. Frank Morelli, Pforzheim University of Applied Sciences Sales Planning and Forecast . 19 Prof. Dr.-Ing. Jorge Marx Gomez,´ Carl von Ossietzky Universitat¨ Oldenburg Project OliMP: In-Memory Planning with SAP HANA . 25 Dr. Lena Wiese, Georg-August-Universitat¨ Gottingen¨ OntQA-Replica: Intelligent Data Replication for Ontology-Based Query Answering . 29 Prof. Dr. Hasso Plattner, Hasso Plattner Institute Natural Language Processing for In-Memory Databases: an Application to Biomedical Question Answering . 35 Provision of Analyze Genomes Services in a Federated In-Memory Database System for Life Sciences . 39 Prof. Dr. Andreas Polze, Hasso Plattner Institute Inspection and Evaluation of Modern Hardware Architectures . 43 Prof. Dr. Holger Giese, Hasso Plattner Institute Large-Scale Graph-Databases based on Graph Transformations & Multi-Core Architectures 49 Prof. Dr. Tadeusz Czachorski,´ Silesian University of Technology Modelling wide area networks using SAP HANA in-memory database . 53 Prof. Dr. Helmut Krcmar, Technical University of Munich Using Process Mining to Identify Fraud in the Purchase-to-Pay Process . 57 Dr. Harald Sack, Hasso Plattner Institute Comparison of Image Classification Models on Varying Dataset Sizes . 63 i Prof. Dr. Witold Abramowicz, Department of Information Systems, Poznan Uni- versity of Economics Sentiment Analysis for the needs of benchmarking the Energy Sector . 69 Prof. Dr. Katinka Wolter, Institute of Computer Science, Freie Universitat¨ Berlin Model-based Quantitative Security Analysis of Mobile Offloading Systems under Timing Attacks . 73 Fall 2015 Prof. Dr. Peter Fettke, Institut fur¨ Wirtschaftsinformatik (IWi) at Deutsches Forschungszen- trum fur¨ Kunstliche¨ Intelligenz (DFKI) and Saarland University Towards Process Mining on Big Data: Optimizing Process Model Matching Approaches on High Performance Computing Infrastructure . 79 Prof. Dr. Andreas Polze, Hasso Plattner Institute A survey of security-aware approaches for cloud-based storage and processing technologies 83 Dr. Lena Wiese, Institut fur¨ Informatik, Georg-August-Universitat¨ Gottingen¨ OntQA-Replica: Intelligent Data Replication for Ontology-Based Query Answering (Revis- ited and Verified) . 89 Prof. Dr. Hasso Plattner, Hasso Plattner Institute Natural Language Processing for In-Memory Databases: Boosting Biomedical Applications 95 Extending Analyze Genomes to a Federated In-Memory Database System For Life Sciences 99 Interactive Product Cost Simulation on Coprocessors . 103 Prof. Dr. Gunther Piller, University of Applied Sciences Mainz ActOnAir: Data Mining and Forecasting for the Personal Guidance of Asthma Patients . 109 Prof. Dr. Dr. h.c. Hans-Jurgen¨ Appelrath, Carl von Ossietzky Universitat¨ Olden- burg BICE: A Cloud-based Business Intelligence System . 113 Prof. Dr. Holger Giese, Hasso Plattner Institute Large-Scale Graph-Databases based on Graph Transformations & Multi-Core Architectures 117 Dr. Benjamin Fabian, Institute of Information Systems, Humboldt University of Berlin Analyzing the Global-Scale Internet Graph at Different Topology Levels: Data Collection and Integration . 121 Dr. Harald Sack, Hasso Plattner Institute Comparison of Feature Extraction Approaches for Image Classification . 127 ii Prof. Dr. Christoph Engels, University of Applied Sciences and Arts Dortmund Optimization of Data Mining Ensemble Algorithms on SAP HANA . 131 Prof. Dr. Christoph Meinel, Hasso Plattner Institute Simulation of User Behavior on a Security Testbed . 137 Prof. Dr. Jan Eloff, University of Pretoria, South Africa Protecting minors on social media platforms . 141 Prof. Dr. Helmut Krcmar, Technical University of Munich Using Process Mining to Identify Fraud in the Purchase-to-Pay Process . 145 Prof. Dr. Bernd Scheuermann, Hochschule Karlsruhe - Technik und Wirtschaft On the Potential of Big Data Boosting Bio-inspired Optimization A Study Using SAP HANA 151 iii Solving of LP and CWLP Problems Using SAP HANA Karl Kurbel Dawid Nowak European University Viadrina European University Viadrina Große Scharrnstraße 59 Große Scharrnstraße 59 15230 Frankfurt (Oder), Germany 15230 Frankfurt (Oder), Germany [email protected] [email protected] Abstract the test cases used. Section 5 discusses the test results, while the concluding section wraps up the findings and This paper explores the capabilities of SAP HANA for gives an outlook to further research. solving optimization problems, in particular linear programming and mixed-integer programming prob- 2 Solution approach lems. The study contrasts a tightly integrated solution approach (GENIOS) with an external solver approach From release SPS 08 on, SAP HANA can be equipped (R server) and with self-implemented optimization with its own solver, GENIOS (GENeric Integer heuristics. All solution approaches are integrated into Optimization System) [6]. GENIOS is capable of a test environment in HANA, and compared with solving continuous and mixed-integer linear program- respect to performance and solution quality. Based on ming (MILP) problems. It is a part of AFL (Applica- a series of test cases, performance indicators are tion Function Libraries) [7, p. 6]), which extends evaluated and factors influencing the performance are HANA's functionality by predefined functions. These discussed. functions can be called by user-defined stored proce- dures. They are native to the data engine underlying 1 Introduction and motivation SAP HANA’s index server, thus offering the best possible performance for data access and data pro- Today’s business software, such as enterprise resource cessing [8, p. 14]. planning (ERP) and supply chain management (SCM) Bearing in mind the architectural concept of SAP systems, provides solutions to many planning and HANA [8, pp. 13-15] and the results of previous decision problems [1]. A number of exact and studies [9], two approaches seem to be worth trying in heuristic optimization algorithms have been order to make a comparison with GENIOS. incorporated into the software (for example, mixed- Since HANA's power is exploited best when integer linear programming for supply network "everything" happens inside HANA, the first option is planning in SAP SNP [2]). to create a solver for optimization problems with the Optimization models representing real-life situations means that HANA provides for software development. can be very large, requiring a lot of computing power This means, in the first place, to use the native and efficient algorithms. programming language, SQLScript [10]. SAP HANA as an in-memory database exhibits very The second option is a linear programming package good performance where data access and data that is available on R servers. Although an R server is processing are concerned. Due to efficient techniques external to HANA, procedures in R can be written and such as parallel processing and column-oriented data invoked inside HANA, just like SQLScript procedures storage [3], SAP HANA outperforms typical data- [11]. An R server is a statistical server providing bases [4] [5]. advanced calculation functionalities. These function- The questions we are investigating in this paper is how alities can be easily extended through downloadable optimization models can be integrated into HANA and function libraries and R scripts. One of them is the whether optimization can also benefit from HANA's lpSolve package, which provides an application processing power. programming interface (API) for building and solving linear programs [12]. The next section discusses different approaches to solving optimization problems in HANA. Section 3 The types of problems we are considering in this study focuses on the solution architecture and the perfor- are linear programming (LP) and mixed-integer linear mance measures used for the test runs. The fourth programming (MILP) problems. A representative of section briefly describes the testing environment and the latter type is the so-called capacitated warehouse location problem (CWLP). The CWLP embraces the 1 decision in which locations warehouses with limited 3.2 Performance measuring concept capacities are to be built, and the decision which customers are to be served by which warehouses so
Recommended publications
  • Acatech Annual Report 2012 Acatech Jahresberichtcontents 2012
    > ANNUAL REPORT 2012 > acatech2012 ANNUALREPORT acatech Annual Report 2012 acatech JahresberichtContents 2012 CONTENTS PRESIDENTS’ FOREWORD 4 INNOVATION POLICY ADVICE 6 ENERGY, RESOURCES, SUSTAINABILITY 8 TECHNOLOGIES 10 EDUCATION AND TECHNOLOGY COMMUNICATION 12 INTERNATIONAL RELATIONS 14 DIALOGUE 16 PROJECTS 18 EVENTS 20 PUBLICATIONS 24 MEMBERS 28 SENATE 32 EXECUTIVE BOARD 34 FINANCIAL SUPPORTERS 36 FINANCES 40 acatech OFFICE 42 PUNKT PRIZE 44 3 acatech Annual Report 2012 Foreword > PRESIDENTS’ FOREWORD In this, its tenth year, the Academy can once again look back Germany’s decision to shift towards renewable energy produc- and partnership arrangements with the national academies on a busy and successful twelve months. As the independent tion could be one potential driver of innovation. Together of technology in China, India and the US, as well as chairing voice of the technological sciences community, acatech has with the German National Academy of Sciences Leopoldina the Euro-CASE Innovation Platform. contributed in a variety of ways to the public debate both at and the Union of the German Academies of Sciences and home and abroad and has raised a number of new issues. Humanities, we are developing science-based scenarios for Our work relies on the interdisciplinary, cross-sectoral, evi- the “energy systems of the future”. A concurrent interdiscipli- dence-based dialogue that is a key feature of the Academy. We have addressed several important social questions in the nary Dialogue Forum is engaged in drafting specific recom- We would therefore like to express our gratitude to everyone areas of “education and technology communication”, “ener- mendations on this topic. In 2012, as a starting point for this whose knowledge and commitment have helped make gy, resources and sustainability” and “technologies”, with the work, we put forward a series of proposals on how to fund the acatech a platform for lively exchanges on a variety of issues.
    [Show full text]
  • 10Th Annual Symposium on Future Trends in Service-Oriented
    10 th Annual Symposium on Future Trends in Service-Oriented Computing Hasso Plattner Institute Potsdam | June 17–19, 2015 Design IT. Create Knowledge. http://hpi.de/future-soc-symposium Agenda Wednesday, June 17, 2015 (Doctoral Symposium) Room: H.E-51 (Main Building) 10:00–10:30 Keynote Prof. Dr. Tobias FRIEDRICH , Hasso Plattner Institute Models of Large Real-World Networks 10:30–10:45 Co+ee Break 10:45–12:30 Doctoral Symposium GAO Heng, Ph.D. Student, Nanjing University Parallel Transformation to Programs on GPU Cluster Pedro LOPES , Ph.D. Student, Hasso Plattner Institute Proprioceptive Interaction: Our Own Muscles as Input & Output Devices Using Computer-controlled Electrical Muscle Stimulation Pacome AMBASSA , Ph.D. Student, University of Cape Town Security for Resilient Power Grids in Resource Constrained Environment Arian TREFFER , Ph.D. Student, Hasso Plattner Institute Omniscient Debugging in Database Applications Lighton PHIRI , Ph.D. Student, University of Cape Town Investigating the Impact of Organised Orchestration on Teaching Alexandra ION , Ph.D. Student, Hasso Plattner Institute Skin Drag Displays Yamiko MSOSA , Ph.D. Student, University of Cape Town Clinical Knowledge Framework for Low Resource Settings: A Case of Malawi Anton KROHMER , Ph.D. Student, Hasso Plattner Institute What is a Hyperbolic Random Graph? 12:30–13:30 Lunch 13:30–15:00 Poster & Ice Cream Session HPI Research School Ph.D. Students Share Their Ideas - Poster Session 16:00 Social Event Game Science Center Berlin 3 Thursday, June 18, 2015 (Symposium with Industry & Academia) Room: HS 1, HS Foyer (Lecture Building) 13:00–13:15 Opening of the Symposium on Future Trends in Service-Oriented Computing Prof.
    [Show full text]
  • HPI Future SOC Lab : Proceedings 2012
    HPI Future SOC Lab: Proceedings 2012 Christoph Meinel, Andreas Polze, Gerhard Oswald, Rolf Strotmann, Ulrich Seibold, Bernhard Schulzki (Hrsg.) Technische Berichte Nr. 85 des Hasso-Plattner-Instituts für Softwaresystemtechnik an der Universität Potsdam ISBN 978-3-86956-276-6 ISSN 1613-5652 Technische Berichte des Hasso-Plattner-Instituts für Softwaresystemtechnik an der Universität Potsdam Technische Berichte des Hasso-Plattner-Instituts für Softwaresystemtechnik an der Universität Potsdam | 85 Christoph Meinel | Andreas Polze | Gerhard Oswald | Rolf Strotmann | Ulrich Seibold | Bernhard Schulzki (Hrsg.) HPI Future SOC Lab Proceedings 2012 Universitätsverlag Potsdam Bibliografische Information der Deutschen Nationalbibliothek Die Deutsche Nationalbibliothek verzeichnet diese Publikation in der Deutschen Nationalbibliografie; detaillierte bibliografische Daten sind im Internet über http://dnb.dnb.de/ abrufbar. Universitätsverlag Potsdam 2014 http://verlag.ub.uni-potsdam.de/ Am Neuen Palais 10, 14469 Potsdam Tel.: +49 (0)331 977 2533 / Fax: 2292 E-Mail: [email protected] Die Schriftenreihe Technische Berichte des Hasso-Plattner-Instituts für Softwaresystemtechnik an der Universität Potsdam wird herausgegeben von den Professoren des Hasso-Plattner-Instituts für Softwaresystemtechnik an der Universität Potsdam. ISSN (print) 1613-5652 ISSN (online) 2191-1665 Das Manuskript ist urheberrechtlich geschützt. Druck: docupoint GmbH Magdeburg ISBN 978-3-86956-276-6 Zugleich online veröffentlicht auf dem Publikationsserver der Universität Potsdam: URL http://pub.ub.uni-potsdam.de/volltexte/2014/6899/ URN urn:nbn:de:kobv:517-opus-68991 http://nbn-resolving.de/urn:nbn:de:kobv:517-opus-68991 Contents Spring 2012 Prof. Dr. Ben Juurlink, Architektur eingebetteter Systeme, Technische Universitat¨ Berlin Parallelizing H.264 Decoding with OpenMP Superscalar . 1 Prof. Dr. Jurgen¨ Dollner,¨ Computer Graphics Systems, Hasso-Plattner-Institut Service-Based 3D Rendering and Interactive 3D Visualization .
    [Show full text]
  • Christoph Meinel (Univ
    Professor Dr. Christoph Meinel (Univ. Prof., Dr. sc. nat., Dr. rer. nat., 1954) is Scientific Director and CEO of the Hasso Plattner Institute for Software Systems Engineering GmbH (HPI). Hasso Plattner Institute is part of the University of Potsdam and privately financed by the non-profit foundation of Hasso Plattner, one of the founders of SAP. HPI provides outstanding Bachelor, Master and PhD degree pro- grams in IT-Systems Engineering as well as a study program in De- sign Thinking. The HPI course offerings are freely accessible via the tele-TASK portal and the openHPI online platform. In 2006, together with Hasso Plattner, Christoph Meinel hosted the 1st National IT Summit of German Chancellor Dr. Angela Merkel at HPI in Potsdam. Meinel is a member of acatech, the German “National Academy of Science and Engineering,” and numerous scientific committees and supervisory boards. Christoph Meinel is a full professor (C4) for computer science and serves as department chair of Internet Technologies and Systems at HPI. His areas of research focus on Internet and Information Security, Web 3.0, Semantic Web, Social and Service Web and the domains of e-learning, teleteach- ing and telemedicine. He is scientifically active in innovation research on all aspects of the Stanford innovation method “Design Thinking.” His earlier research work concentrated on the theoretical foundation of computer science in the areas of complexity theory and efficient OBDD-based algo- rithms and data structures. Christoph Meinel teaches courses in the Bachelor and Master’s programs on “IT-Systems Engineer- ing” and at the HPI School of Design Thinking.
    [Show full text]
  • An Attempt at MOOC Localization for Chinese-Speaking Users
    International Journal of Information and Education Technology, Vol. 6, No. 2, February 2016 An Attempt at MOOC Localization for Chinese-Speaking Users Xiaoyin Che, Sheng Luo, Cheng Wang, and Christoph Meinel who live in mainland China to join the course on openHPI.de. Abstract—―Internetworking with TCP/IP‖ is a massive open In order to offer better service on openHPI.cn, we don‘t online course (MOOC) provided by Germany-based MOOC want to simply clone the courses from openHPI.de. platform ―openHPI‖, which has been offered in German, Localizing them will not only help the Chinese learners English and – recently – Chinese respectively, with similar understand the content, but also show our sincerity. And content. In this paper, the authors, who worked jointly as a teacher (or as teaching assistants) in this course, want to share through the experience gained as the members of openHPI.cn their ideas derived from daily teaching experiences, analysis of teaching team, we have learned that a localized course should the statistics, comparison between the performance in different be more than a simply translated course. Just providing language offers and the feedback from user questionnaires. subtitles is far from enough. Additionally, the motivation, attempt and suggestion at MOOC The authors worked jointly as a teacher or as teaching localization will also be discussed. assistants in the course ―Internetworking with TCP/IP‖, simplified as ―Internetworking‖, which was available on both Index Terms—Chinese-speaking users, localization, MOOC, openHPI. openHPI.de and openHPI.cn. From 2012 to 2014, this course has been offered three times with similar content, but in German, English and Chinese respectively.
    [Show full text]