IBM Infosphere Streams V3.0

326

Copy Link

Front cover Addressing Data Volume, Velocity, and Variety with IBM InfoSphere Streams V3.0 Performing real-time analysis on big data by using System x Developing a drag and drop Streams application Monitoring and administering Streams Mike Ebbers Ahmed Abdel-Gayed Veera Bhadran Budhi Ferdiansyah Dolot Vishwanath Kamat Ricardo Picone João Trevelin ibm.com/redbooks International Technical Support Organization Addressing Data Volume, Velocity, and Variety with IBM InfoSphere Streams V3.0 March 2013 SG24-8108-00 Note: Before using this information and the product it supports, read the information in “Notices” on page ix. First Edition (March 2013) This edition applies to IBM InfoSphere Streams V3.0. © Copyright International Business Machines Corporation 2013. All rights reserved. Note to U.S. Government Users Restricted Rights -- Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Notices . ix Trademarks . .x Preface . xi The team who wrote this book . xi Now you can become a published author, too! . xiv Comments welcome. xiv Stay connected to IBM Redbooks . xiv Chapter 1. Think big with big data. 1 1.1 Executive summary. 2 1.2 What is big data? . 3 1.3 IBM big data strategy . 5 1.4 IBM big data platform . 7 1.4.1 InfoSphere BigInsights for Hadoop-based analytics . 7 1.4.2 InfoSphere Streams for low-latency analytics. 8 1.4.3 InfoSphere Information Server for Data Integration . 9 1.4.4 Netezza, InfoSphere Warehouse, and Smart Analytics System for deep analytics10 1.5 Summary. 13 Chapter 2. Exploring IBM InfoSphere Streams. 15 2.1 Stream computing . 16 2.1.1 Business landscape . 19 2.1.2 Information environment . 22 2.1.3 The evolution of analytics . 26 2.2 IBM InfoSphere Streams. 28 2.2.1 Overview of Streams. 29 2.2.2 Why use Streams . 34 2.2.3 Examples of Streams implementations. 36 Chapter 3. InfoSphere Streams architecture . 41 3.1 Streams concepts and terms . 42 3.1.1 Working with continuous data flow . 42 3.1.2 Component overview . 43 3.2 InfoSphere Streams runtime system. 45 3.2.1 SPL components. 45 3.2.2 Runtime components . 47 3.2.3 InfoSphere Streams runtime files . 49 3.3 Performance requirements . 51 3.3.1 InfoSphere Streams reference architecture . 52 3.4 High availability . 55 Chapter 4. IBM InfoSphere Streams V3.0 new features. 57 4.1 New configuration features . 58 4.1.1 Enhanced first steps after installation . 58 4.2 Development . 59 4.3 Administration . 59 4.3.1 Improved visual application monitoring. 59 4.3.2 Streams data visualization . 59 © Copyright IBM Corp. 2013. All rights reserved. iii 4.3.3 Streams console application launcher . 59 4.4 Integration . 60 4.4.1 DataStage integration . 60 4.4.2 Netezza integration . 60 4.4.3 Data Explorer integration . 60 4.4.4 SPSS integration. 60 4.4.5 XML integration. 60 4.5 Analytics and accelerators toolkits . 61 4.5.1 Geospatial toolkit . 61 4.5.2 Time series toolkit . 61 4.5.3 Complex Event Processing toolkit. 61 4.5.4 Accelerators toolkit . 61 Chapter 5. InfoSphere Streams deployment. 63 5.1 Architecture, instances, and topologies . 64 5.1.1 Runtime architecture. 64 5.1.2 Streams instances . 66 5.1.3 Deployment topologies . 69 5.2 Streams runtime deployment planning . 72 5.2.1 Streams environment . 72 5.2.2 Sizing the environment . 73 5.2.3 Deployment and installation checklists . 73 5.3 Streams instance creation and configuration . 80 5.3.1 Streams shared instance configuration. 80 5.3.2 Streams private developer instance configuration . 84 5.4 Application deployment capabilities . 88 5.4.1 Dynamic application composition . 89 5.4.2 Operator host placement . 91 5.4.3 Operator partitioning . 96 5.4.4 Parallelizing operators . ..

Recommended publications

Online Identity in the Case of the Share Phenomenon. a Glimpse Into the on Lives of Romanian Millennials

Online identity in the case of the share phenomenon. A glimpse into the on lives of Romanian millennials Demetra GARBAȘEVSCHI PhD Student National University of Political Science and Public Administration E-Mail: [email protected] Abstract. In less than a decade, the World Wide Web has evolved from a predominantly search medium to a predominantly share medium, from holding a functional role to being endowed with a social one.In the context of a reontologisation of the infosphere and of an unprecedented display of mass self-communication, the identity system has gained a legitimate dimension – online identity –, as individuals have become the sum of impressions openly offered online and decoded into a coherent story by the receiver. In the network society, there are consequences to both having and not having an online identity. Originating in an interactionist perspective, the present paper looks into Romanian Millennials in trying to find out whether online identity is undergoing a process of intentionalization, in other words whether it becomes a conscious, planned effort of the individual to build himself/ herself a legitimate and profitable dimension in the digital space. Keywords: online identity; infosphere; mass self-communication; Millennials; Generation Y. 1. Introduction and theoretical background This paper examines online identity as part of an individual’s identity system, in the specific context of current Internet development, generalized connectivity Journal of Media Research, Vol. 8 Issue 2(22) / 2015,14 pp. 14-26 and participation through the share web. The discussion centers on Romanian young adults, seeking to uncover their perceptions of online identity as a potentially strategic self-representation process.
Storming the Reality Studio

DRAFT Storming the Reality Studio: Leveraging Public Information in the War on Terror Brendan Matthew-Gordon Kelly Prepared for the 47th Annual International Studies Association Convention March 22-25, 2006 San Diego, CA Abstract This paper ar gues that the war on terror is understood on both sides as an idea war, an event that signifies the triumph of Constructivist theories over strictly Realist interpretations of international politics. It further argues that this is a watershed event, in which information operations have finally taken a primary role in military strategy. Finally, it argues that this is most visible in cyberspace. On February 17th, Defense Secretary Donald Rumsfeld spoke before the Council on Foreign Relations to argue that America was losing the information war in its struggle against radical Islam: Rumsfeld also said al-Qaida and other Islamic extremist groups have poisoned the Muslim public's view of the United States through deft use of the Internet and other modern communications methods that the American government has failed to master. "Our enemies have skillfully adapted to fighting wars in today's media age, but for the most part we - our country, our government - has not adapted," he said. 1 This argument is problematic for several reasons. First, it fails to consider the possibility that the Muslim world’s “poisoned” view of the United States has nothing to do with Al-Qaeda or other extremist organizations.2 But even if we accept Rumsfeld’s argument at face value, these statements are still problematic. The fact is that America, the home of Hollywood and Madison Avenue, has dominated the art of political spin for decades.
Data Management in Systems Biology I

Data management in systems biology I – Overview and bibliography Gerhard Mayer, University of Stuttgart, Institute of Biochemical Engineering (IBVT), Allmandring 31, D-70569 Stuttgart Abstract Large systems biology projects can encompass several workgroups often located in different countries. An overview about existing data standards in systems biology and the management, storage, exchange and integration of the generated data in large distributed research projects is given, the pros and cons of the different approaches are illustrated from a practical point of view, the existing software – open source as well as commercial - and the relevant literature is extensively overviewed, so that the reader should be enabled to decide which data management approach is the best suited for his special needs. An emphasis is laid on the use of workflow systems and of TAB-based formats. The data in this format can be viewed and edited easily using spreadsheet programs which are familiar to the working experimental biologists. The use of workflows for the standardized access to data in either own or publicly available databanks and the standardization of operation procedures is presented. The use of ontologies and semantic web technologies for data management will be discussed in a further paper. Keywords: MIBBI; data standards; data management; data integration; databases; TAB-based formats; workflows; Open Data INTRODUCTION the foundation of a new journal about biological The large amount of data produced by biological databases [24], the foundation of the ISB research projects grows at a fast rate. The 2009 (International Society for Biocuration) and special edition of the annual Nucleic Acids Research conferences like DILS (Data Integration in the Life database issue mentions 1170 databases [1]; alone Sciences) [25].
Four Modes of Travelling and Navigating the Knowledge Universe?

Alternative view of segmented documents via Kairos 21st December 2006 | Draft Noonautics Four modes of travelling and navigating the knowledge "universe"? -- / -- Annex 4 of Towards an Astrophysics of the Knowledge Universe: from astronautics to noonautics? (2007) Introduction Four modes of travel? "Vehicles" "Globe"... "planets"... "celestial sphere"? "Global" as integrative Travel where / when / why? Constraints and possibilities Travelling "songlines" to engender the knowledge universe Songlines as "elven pathways" through the knowledge universe? Embodying a universe of "stargates"? Introduction Following from the arguments of the main paper, four distinct "universes" can notably be distinguished, as in the following table Distinctive "universes" cognitively objective and subjective neither objective nor subjective "complex" (relevantial universe) (enactivism) cognitively objective ("external") subjective ("internal") "binary" (classificational universe) (relational universe) This distribution benefits from the case made by Kinhide Mushakoji (Global Issues and Interparadigmatic Dialogue: essays on multipolar politics, 1988) for a quadrilemma (A, not-A, A and not-A, neither A nor not-A) as well as from the distinction, noted earlier, by Magoroh Maruyama regarding the different kinds of information universe (classificational, relational, and relevantial): Objective: With respect to an "objective" understanding of the knowledge universe, the main approach to its organization and navigation has been widely addressed through conventional knowledge organization and information systems and the possibilities of the world wide web. Objective and subjective: Beyond what might be caricatured as the management of conceptual "pigeon holes" and the conceptual "tunnels" between them, there is the potential for a more meaningful integrative approach, necessarily dependent on a subjective dimension and appeals to "values". Features of this are emerging in the conceptual dimensions of Web 2.0 and may be significant in the future of the semantic web.
Implementing an Infosphere Optim Data Growth Solution

Front cover IBM® Information Management Software Implementing an InfoSphere Optim Data Growth Solution Understanding the InfoSphere Optim Data Growth solution architectures Implementing InfoSphere Optim Data Growth solutions Managing data growth with flexibility Whei-Jen Chen David Alley Barbara Brown Sunil Dravida Saunnie Dunne Tom Forlenza Pamela S Hoffman Tejinder S Luthra Rajat Tiwary Claudio Zancani ibm.com/redbooks International Technical Support Organization Implementing an InfoSphere Optim Data Growth Solution November 2011 SG24-7936-00 Note: Before using this information and the product it supports, read the information in “Notices” on page xi. First Edition (November 2011) This edition applies to IBM InfoSphere Optim Data Growth Solution Version 7.3.1. © Copyright International Business Machines Corporation 2011. All rights reserved. Note to U.S. Government Users Restricted Rights -- Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Notices . xi Trademarks . xii Preface . xiii The team who wrote this book . xiii Acknowledgements . xvi Now you can become a published author, too! . xvii Comments welcome. xvii Stay connected to IBM Redbooks . xviii Chapter 1. Introduction to IBM InfoSphere Optim . 1 1.1 Challenges . 2 1.1.1 Data explosion . 2 1.1.2 Current approaches . 3 1.2 Information governance. 3 1.3 IBM role in information governance. 4 1.3.1 History . 4 1.3.2 IBM approach to data governance . 5 1.3.3 Data governance maturity model . 7 1.4 Information lifecycle management. 8 1.4.1 Benefits of implementing the correct ILM strategy . 11 1.4.2 What is data archiving. 11 1.5 IBM InfoSphere Optim Data Growth Solution .
Dynamic Information with IBM Infosphere Data Replication CDC

Front cover IBM® Information Management Software Smarter Business Dynamic Information with IBM InfoSphere Data Replication CDC Log-based for real-time high volume replication and scalability High throughput replication with integrity and consistency Programming-free data integration Chuck Ballard Alec Beaton Mark Ketchie Anzar Noor Frank Ketelaars Judy Parkes Deepak Rangarao Bill Shubin Wim Van Tichelen ibm.com/redbooks International Technical Support Organization Smarter Business: Dynamic Information with IBM InfoSphere Data Replication CDC March 2012 SG24-7941-00 Note: Before using this information and the product it supports, read the information in “Notices” on page ix. First Edition (March 2012) This edition applies to Version 6.5 of IBM InfoSphere Change Data Capture (product number 5724-U70). © Copyright International Business Machines Corporation 2012. All rights reserved. Note to U.S. Government Users Restricted Rights -- Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Notices . ix Trademarks . x Preface . xi The team who wrote this book . xii Now you can become a published author, too! . xvi Comments welcome. xvii Stay connected to IBM Redbooks . xvii Chapter 1. Introduction and overview . 1 1.1 Optimized data integration . 2 1.2 InfoSphere architecture . 4 Chapter 2. InfoSphere CDC: Empowering information management. 9 2.1 The need for dynamic data . 10 2.2 Data delivery methods. 11 2.3 Providing dynamic data with InfoSphere CDC . 12 2.3.1 InfoSphere CDC architectural overview . 14 2.3.2 Reliability and integrity . 16 Chapter 3. Business use cases for InfoSphere CDC . 19 3.1 InfoSphere CDC techniques for transporting changed data .
Annual Meeting with the Financial Market

Annual meeting with the financial market Speech by the Chairman Professor Paolo Savona Rome, 16 June 2020 Speech by the Chairman to the financial market Introduction 1. Rethinking the institutional architecture while taking into account changes in the monetary policy framework, financial conditions and technological innovations available 2. The ultimate goal: to strengthen the risk capital of exporting companies, especially medium and small, and to protect all forms of savings by directing them to support real business 3. Actions to be taken immediately Conclusions Introduction The Report that Consob is presenting today was forwarded to the Minister of Economy and Finance on 31 March pursuant to Law 216/1974. It illustrates the work carried out in 2019, ranging from approving share and bond issue prospectuses to examining takeover bids, monitoring stock exchange transactions and the activities of financial operators, imposing sanctions and examining domestic, European and supranational regulatory measures. The extent of the commitment can be summarised by the 101 meetings of the Commission held to examine the 1246 files handled by the offices. The most important news was the closure of 117 websites (223 as at 11 June 2020) that were collecting savings illegally, which was ordered consequently to the power granted to Consob by Law 58 of 28 June 2019. In the first part of 2020, 21 measures were taken to tackle the problems arising from the outbreak of the Covid-19 pandemic; such measures were announced during the hearing at the Parliamentary Committee of Inquiry into the Banking and Financial System on 28 May. With the International supervisory bodies (ESMA and IOSCO), firms were given instructions, among other things, on the information to be disclosed to the market in the financial reporting and information on the measures taken to grant the continuity of companies’ business and to mitigate the dominant uncertainties about looming risks.
A Metalogue with Floridi's Information Ethics

APA NEWSLETTER | PHILOSOPHY AND COMPUTERS ———. “On the Intrinsic Value of Information Objects and the Infosphere.” Ethics and Information Technology 4, no. 4 (2002): 287– 304. Meaningful Reality: A Metalogue with ———. The Fourth Revolution, How the Infosphere Is Reshaping Human Reality. Oxford: Oxford University Press, 2014. Floridi’s Information Ethics ———. “The Method of Levels of Abstraction.” Minds and Machines 18, no. 3 (2008): 303–29. doi:10.1007/s11023-008-9113-7. Pompeu Casanovas ———. “Understanding Information Ethics.” APA Newsletter on INSTITUTE OF LAW AND TECHNOLOGY, AUTONOMOUS UNIVERSITY Philosophy and Computers 7, no. 1 (2007): 3–12. OF BARCELONA, [email protected] / CENTRE FOR APPLIED SOCIAL RESEARCH, ROYAL MELBOURNE Floridi, Luciano, and J. W. Sanders. “Artificial Evil and the Foundation INSTITUTE OF TECHNOLOGY, [email protected] of Computer Ethics.” Ethics and Information Technology 3, no. 1 (2001): 55–66. Floridi, Luciano, and Mariarosaria Taddeo. The Ethics of Information Abstract. This is a comment on some aspects of the The Warfare. New York: Springer, 2014. Ethics of the Information by Luciano Floridi. This paper Gelven, Michael. War and Existence: a Philosophical Inquiry. University explores some of the notions advanced in the book, its Park, PA: Pennsylvania State University Press, 1994. methodology, and its practical and ontological turn. In the Hayes, Carol M., and Jay P. Kesan. Law of Cyber Warfare. SSRN Scholarly end, some suggestions are made about the relationship Paper ID 2396078. Rochester, NY: Social Science Research Network, 2014. http://papers.ssrn.com/abstract=2396078. between Information Ethics (IE), policy, and law. Hepburn, Ronald W. “Wonder” and Other Essays: Eight Studies in Aesthetics and Neighbouring Fields.
APA Newsletter on Philosophy and Computers a Basic Cognitive Cycle, Including Several Modes of Learning, 08:2

APA Newsletters NEWSLETTER ON PHILOSOPHY AND COMPUTERS Volume 09, Number 1 Fall 2009 FROM THE EDITOR, PETER BOLTUC FROM THE CHAIR, MICHAEL BYRON NEW AND NOTEWORTHY: A CENTRAL APA INVITATION ARTICLES Featured Article RAYMOND TURNER “The Meaning of Programming Languages” GREGORY CHAITIN “Leibniz, Complexity, and Incompleteness” AARON SLOMAN “Architecture-Based Motivation vs. Reward-Based Motivation” DISCUSSION 1: ON ROBOT CONSCIOUSNESS STAN FRANKLIN, BERNARD J. BAARS, AND UMA RAMAMURTHY “Robots Need Conscious Perception: A Reply to Aleksander and Haikonen” PENTTI O. A. HAIKONEN “Conscious Perception Missing. A Reply to Franklin, Baars, and Ramamurthy” © 2009 by The American Philosophical Association ONTOLOGICAL STATUS OF WEB-BASED OBJECTS DAVID LEECH ANDERSON “A Semantics for Virtual Environments and the Ontological Status of Virtual Objects” ROBERT ARP “Realism and Antirealism in Informatics Ontologies” DISCUSSION 2: ON FLORIDI KEN HEROLD “A Response to Barker” JOHN BARKER “Reply to Herold” DISCUSSION 3: ON LOPES GRANT TAVINOR “Videogames, Interactivity, and Art” ONLINE EDUCATION MARGARET A. CROUCH “Gender and Online Education” H. E. BABER “Women Don’t Blog” BOOK REVIEW Christian Fuchs: Social Networking Sites and the Surveillance Society. A Critical Case Study of the Usage of studiVZ, Facebook, and MySpace by Students in Salzburg in the Context of Electronic Surveillance REVIEWED BY SANDOVAL MARISOL AND THOMAS ALLMER SYLLABUS DISCUSSION AARON SLOMAN “Teaching AI and Philosophy at School?” CALL FOR PAPERS Call for Papers with Ethics Information Technology on “The Case of e-Trust: A New Ethical Challenge” APA NEWSLETTER ON Philosophy and Computers Piotr Bołtuć, Editor Fall 2009 Volume 09, Number 1 The second topic area pertains to L.
De-Identified Multidimensional Medical Records for Disease Population Demographics and Image Processing Tools Development

DE-IDENTIFIED MULTIDIMENSIONAL MEDICAL RECORDS FOR DISEASE POPULATION DEMOGRAPHICS AND IMAGE PROCESSING TOOLS DEVELOPMENT DISSERTATION Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School of The Ohio State University By Barbaros Selnur Erdal, D.D.S., M.S. ***** Electrical and Computer Engineering The Ohio State University 2011 Dissertation Committee: Prof. Bradley D. Clymer, Adviser Prof. Elliott D. Crouser Prof. Umit V. Catalyurek Prof. Kun Huang c Copyright by Barbaros Selnur Erdal 2011 ABSTRACT Recently, The National Institute of Health (NIH) has outlined its scientific priorities in a strategic plan, “NIH Roadmap for Medical Research”. In direct alignment with these pri- orities, many academic and research oriented medical institutions across The United States conduct numerous clinical and translational research studies on an ongoing basis. From a personalized health care and translational research perspective, quite often efforts of such nature will span across multiple departments or even institutions. We consider these activ- ities as a knowledge and information flow which is taking place around multidimensional, heterogeneous clinical and research data that is collected from disparate sources. The primary objective of the research and development described in this thesis is to pro- vide an integrative platform where multidimensional data from multiple disparate sources can be easily accessed, visualized, and analyzed. We believe that ability to execute such truly integrative queries, visualizations and analyses across multiple data types is critical to the ability to execute highly effective clinical and translational research. Therefore, to address the preceding gap in knowledge, we introduce a model computational frame- work that is intended to support the integrative query, visualization and analysis of struc- tured data, narrative text, and image data sets in support of translational research activi- ties.
A Review of Data Mining Using Big Data in Health Informatics Matthew Herland, Taghi M Khoshgoftaar and Randall Wald*

Herland et al. Journal of Big Data 2014, 1:2 http://www.journalofbigdata.com/content/1/1/2 SURVEY PAPER OpenAccess A review of data mining using big data in health informatics Matthew Herland, Taghi M Khoshgoftaar and Randall Wald* *Correspondence: [email protected] Florida Atlantic University, 777 Abstract Glades Road, Boca Raton, FL, USA The amount of data produced within Health Informatics has grown to be quite vast, and analysis of this Big Data grants potentially limitless possibilities for knowledge to be gained. In addition, this information can improve the quality of healthcare offered to patients. However, there are a number of issues that arise when dealing with these vast quantities of data, especially how to analyze this data in a reliable manner. The basic goal of Health Informatics is to take in real world medical data from all levels of human existence to help advance our understanding of medicine and medical practice. This paper will present recent research using Big Data tools and approaches for the analysis of Health Informatics data gathered at multiple levels, including the molecular, tissue, patient, and population levels. In addition to gathering data at multiple levels, multiple levels of questions are addressed: human-scale biology, clinical-scale, and epidemic-scale. We will also analyze and examine possible future work for each of these areas, as well as how combining data from each level may provide the most promising approach to gain the most knowledge in Health Informatics. Keywords: Big data; Health informatics; Bioinformatics; Neuroinformatics; Clinical informatics; Public health informatics; Social media Introduction The field of Health Informatics is on the cusp of its most exciting period to date, entering a new era where technology is starting to handle Big Data, bringing about unlimited poten- tial for information growth.
Troubleshooting.Pdf

Troubleshooting4 Troubleshooting a problem7 Understanding where to find troubleshooting information10 Searching knowledge bases11 Using Support Assistant in Management Console12 To collect source and target information using Support Assistant13 Locating event messages14 To change the number of messages in the event log15 To remove event messages daily16 Locating log files17 Enabling detailed tracing for InfoSphere CDC for z/OS18 Enabling detailed InfoSphere CDC tracing for distributed systems19 To enable tracing using Management Console20 To enable tracing using the dmset command line utility21 To disable tracing using Management Console23 To disable tracing using the dmset command line utility24 Using trace options in Management Console25 To enable tracing for Management Console messages26 To enable tracing for Access Server messages27 To enable tracing for Access Server log information28 To disable tracing for Management Console messages29 To disable tracing for Access Server messages30 To disable tracing for Access Server log information31 Troubleshooting installation and configuration32 Encountering a previous instance of InfoSphere CDC in the instance list33 Encountering an ./oraclenativeapi.dll is not a valid win2k application message34 Encountering problems while using InfoSphere CDC for InfoSphere DataStage with Direct Connect35 Encountering a You must perform a FULL DATABASE BACKUP to start the FULL or BULK- LOGGED RECOVERY MODEL message when configuring InfoSphere CDC for DB2 for LUW37 Encountering a You must perform a FULL DATABASE BACKUP to start the FULL or BULK- LOGGED RECOVERY MODEL message when configuring InfoSphere CDC for Microsoft SQL Server38 Encountering a The RECOVERY MODEL for the database must be FULL or BULK-LOGGED message when configuring InfoSphere CDC for Microsoft SQL Server39 Encountering a Microsoft SQL Server is not configured for distribution.