Announcement February 4, 2003

Preview: IBM DB2 Information Integrator (Beta Program) Integrates Diverse Information Across and Beyond the Enterprise

Overview Key Prerequisites At a Glance IBM is introducing the IBM DB2 The offerings will be available for Information Integrator family of these operating systems: With the IBM DB2 Information products. It provides the foundation Integrator family of products you for a strategic information • Microsoft Windows NT can: • Microsoft Windows 2000 integration framework to help • customers access, manipulate, and • AIX Choose the data access integrate diverse, distributed, and • Hewlett-Packard HP-UX strategy to match business real-time data. The family consists • Sun Solaris value • Linux on Intel of: − Centralize information for availability or performance • IBM DB2 Information Integrator Preview Announcements − Manage distributed access V8.1 — A new product based on to required data IBM DB2 technology Preview announcements provide insight into IBM plans and direction. • Integrate diverse and • IBM DB2 Information Integrator Availability, prices, ordering distributed data, without moving for Content V8.2 — Formerly IBM information, and terms and the data or changing the Enterprise Information Portal conditions will be provided when the platforms product is announced. Both products enable customers to − Access disparate data as abstract a common data model though it were a single across data and content sources and source to access and manipulate them as though they were a single source. • Progress more quickly at a Each supports a user community lower cost defined primarily by the data they access and the development − Increase developer community that they support. Both productivity for integrating products are now available in beta diverse, distributed, and programs. real-time data − Deploy current skills over a greater range of project requirements • Rely on proven IBM technology − Trust 25 years of data management research and development − Integrate with complementary technologies • Protect your current and future IT investments − Build on industry standards − Reduce the modification and replacement of systems to make them work together

This announcement is provided for your information only. For additional information, contact your IBM representative, call 800-IBM-4YOU, or visit the IBM home page at: http://www.ibm.com.

IBM United States IBM is a registered trademark of International Business Machines Corporation. 203-021 strategic, reusable, and open information integration Description platform that lets customers choose the right approach for their business. IBM DB2 Information Integrator Family Benefits IBM DB2 Information Integrator Product Family With the IBM DB2 Information Integrator family of products you can: IBM DB2 Information Integrator is a new family of products from IBM which provides the foundation for a Choose the data access strategy to match business value strategic information integration framework to help — Consolidating data local to the application simplifies customers access, manipulate, and integrate diverse, application development and provides better data access distributed, and real-time data. The family consists of performance and availability, but it also introduces the IBM DB2 Information Integrator V8.1 and IBM DB2 cost of moving data, storing it, and managing its Information Integrator for Content V8.2. Both products synchronization. IBM DB2 Information Integrator V8.1 enable customers to abstract a common data model supports centralized access by providing replication and across diverse and distributed data and content sources caching to support performance and availability and to access and manipulate them as though they were requirements. a single source. Each product supports a user community defined primarily by the data they access and the Alternatively, when there is wide diversity in the data development community they support. The product accessed, or when the data is owned outside the family supports the predominantly read-access scenarios enterprise, it is impractical or too expensive to replicate common to enterprise-wide reporting, knowledge it. The IBM DB2 Information Integrator family lets you management, business intelligence, portal leave distributed data where it is, yet access it infrastructures, e-commerce applications, customer care, transparently as though it were a single source. This and trading partners′ linkage requirements. approach maps well to the predominantly read-access scenarios common to enterprise-wide reporting, business IBM DB2 Information Integrator V8.1 — IBM DB2 intelligence, portal infrastructures, e-commerce Information Integrator V8.1 is targeted primarily at the applications, customer care, and trading partner linkage application development community familiar with requirements. relational database application development. Applications that use SQL or tools that generate SQL Integrate data and content without moving the data or (integrated development environments, reporting, and — changing the platform by accessing access diverse and analytical tools) can now access, integrate, and distributed data as if it were a single source, no matter manipulate distributed and diverse data through a where it resides. It provides a broad range of data source federated data server. This product is most appropriate access out-of-the-box, covering structured and for projects whose primary data sources are relational unstructured data across and beyond the enterprise. data augmented by other XML, Web, or content sources. Sources currently include relational databases, flat files, IBM DB2 Information Integrator core capabilities include: XML documents, spreadsheets, content repositories, Web sites, Web services, and message queues. And, you can A Federated Data Server extend access to proprietary or virtually any data source. • Administrators configure data source access and Make more progress, more quickly, and at a lower cost define integrated views across diverse and distributed — With the IBM DB2 Information Integrator family of data products you can develop the new generation of composite applications that require efficient integration − Administrators use integrated graphical tools to of disparate data. Based on the product selection, configure access to source data, representing that developers can use either SQL or an object-oriented data as logical tables in the federated data server. access. And, now you have a practical way to integrate − Integrated views can be composed across these diverse relational data and combine it, for example, with sources using standard SQL view definitions and unstructured data from content repositories, the World expressions. Wide Web, and spreadsheets. Developers can use SQL to create joins or unions over the data, to compute • Data sources include: statistical functions, to aggregate data, to use online  analytical processing (OLAP) features, to compose or − Relational: IBM DB2, Informix Dynamic Server, transform XML documents, among other choices — Informix Extended Parallel Server, Microsoft SQL speeding project deployment, leveraging existing skills Server, Oracle, Sybase SQL Server, Sybase over a broader range of projects, and reducing ongoing Adaptive Server Enterprises, Teradata, and Open maintenance costs. Database Connectivity (ODBC) sources. − Nonrelational: WebSphere MQ message queues, Rely on proven IBM technology — These products, Web services, Microsoft Access, Microsoft Excel developed from proven IBM technologies, are used by spreadsheets, flat files, XML documents, LDAP customers today and are based on 25 years of data directories, and data sources accessible by Entrez, management research and development. They provide Blast, HMMer, BioRS, Documentum, and IBM a scalable, cross-platform infrastructure that integrates Lotus Extended Search. IBM Lotus Extended with the IBM WebSphere business integration portfolio Search provides access to multiple data stores, including IBM WebSphere Business Integrator, IBM including Domino , IBM DB2 Information Integrator WebSphere Portal Server, IBM WebSphere MQ, and IBM for Content sources (such as IBM Content Manager, WebSphere Studio for a complete business integration IBM Content Manager OnDemand, and IBM infrastructure. ImagePlus ), relational databases (IBM DB2, Protect your current and future Information Technology Oracle, Sybase, Microsoft SQL Server, Microsoft (IT) investments — with products based on industry Access), Lotus Domino.Doc , Microsoft Index standards such as SQL, XML, Java , and Web services Server, Microsoft Site Server, Microsoft Exchange, to provide broad interoperability. The access-in-place and over 18 Web search sites. capabilities reduce rewriting or replacing systems to make them work together. These products provide a

203-021 -2- − A developer toolkit is provided to add access to windows) or with transaction consistency (for data other sources. that is never off-line). • Applications can query or search across the IBM DB2 Information Integrator V8.1 is currently available aggregated data sources as if they were in a single in beta. If you are interested in participating in the beta database. program, contact your IBM representative for additional information. − The query is expressed using standard SQL. SQL expressions may be used to transform the data for IBM DB2 Information Integrator for Content V8.2 — IBM business analysis or data exchange. DB2 Information Integrator for Content V8.2 is targeted − Text search semantics may be used within the at the application development community familiar with query. A fast, versatile, and intelligent full text content management application development. These search capability is provided across all relational customers primarily access content management sources data sources, including those that either don′t but have requirements for additional sources across the — support native text search or don′t provide as enterprise. This product the new generation of IBM — broad a range of text search capability. A large set Enterprise Information Portal is suited for solutions of search operations is supported such as Boolean, where the developer is a content application developer wildcard, free-text, fuzzy search, proximity search familiar with content management programming for words within the same sentence or paragraph, interfaces (and Object Oriented APIs). It is indicated or search within XML documents. where: • − The query may produce standard SQL answer sets Customers need federated access to content sources or XML documents, which can be: such as IBM Content Manager, IBM Content Manager OnDemand, ImagePlus, and Lotus Domino.doc. The -- Generated from the federated source data to application may also need integrated access across facilitate interchange non-IBM content repositories such as FileNET Panagon Image Services, or other content sources. Enterprise -- Automatically validated against DTDs or XML content may be augmented with relational databases schemas including DB2, Oracle, and Open Database -- Transformed using XSL for flexible presentation Connectivity (ODBC) sources. − Results can be made available to the rest of the • Customers need sophisticated analysis of the textual organization by publishing them to a WebSphere information in their applications, content repositories, MQ message queue using built-in functions e-mail repositories, databases and file systems. To leverage this information, it must be indexed, − The federated server uses a cost-based distributed summarized, and organized by content or classified query optimization to select the best access paths according to a taxonomy. Text analytics gathers and for higher query performance. It leverages summarizes information about individual documents intelligence about optimizing access to the data as well as groups of documents: sources provided by the data source wrapper, by database statistics, and by the administrator. − Language identification determines the language of each document, important for international − The administrator can define data caches over the businesses. federated data (called Materialized Query Tables) to improve query responsiveness and availability − Information extraction identifies information for read-only access. If cache use is enabled by the contained in the document and classifies it into application, the optimizer can automatically meaningful entities such as names of people or redirect the query to exploit the cache. Cache organizations, domain technical terms, refresh is managed by the administrator. abbreviations, dates, numbers, or currency amounts. • Applications can access the server by either traditional database or Web service clients − Categorization assigns documents into pre-existing categories based on a taxonomy predefined by the A Replication Server for Mixed Relational Databases firm (product line or competitors). • Customers can replicate data between mixed − Information clustered into groups of related relational data sources. DB2, Informix, Microsoft, documents automatically based on content. This Oracle, and Sybase are supported as replication differs from categorization as it does not require sources and targets; Teradata is supported as a predefined classes. replication target. − Summarization extracts the most relevant • Customers can configure a variety of topologies, sentences from each document to create a latency, and consistency characteristics: document synopsis. − The replication server supports distribution • Customers need to search across a multitude of other (moving data from one database to many) and information sources, including file systems, Lotus consolidation (moving data from many databases Domino databases, Microsoft Exchange Servers, and to one) scenarios. Web Search sites. To access these data sources, IBM DB2 Information Integrator for Content V8.2 integrates − Transformation can be applied in-line with the data with IBM Lotus Extended Search, increasing the range movement via standard SQL expressions or stored of the data accessible by the application. procedure execution. • An integrated workflow component is optional, can − Data movement can be automated to occur on a involve including case management with data residing specific schedule, at designated intervals, in IBM Content Manager repositories and other continuously, or as triggered by events. supported data sources. IBM DB2 Information − Data movement can be managed table-at-a-time Integrator for Content V8.2 enables all accessed (such as for warehouse loading during batch information to be included in workflow processes. An advanced workflow application provides a graphical

-3- 203-021 workflow builder to easily define the advanced they operate by customizing the modeling, automation, workflow processes across the enterprise. and monitoring of processes across people and heterogeneous systems, both inside and outside the IBM DB2 Information Integrator for Content V8.2 (formerly enterprise. IBM Enterprise Information Portal) is currently available in beta. If you are interested in participating in the beta • User interaction is about creating a single, interactive program, contact your IBM representative for additional user experience across applications and devices. information. • Build to integrate focuses on building and deploying new integration-ready applications that leverage Web Global Financing services and existing assets. Instead of traditional silos, new solutions must enable immediate integration IBM Global Financing offers competitive financing to with existing software assets. credit-qualified customers to assist them in acquiring IT solutions. Our offerings include financing for IT These approaches may be used together or separately to acquisitions, including hardware, software, and services, address business integration challenges. Information both from IBM and other manufacturers or vendors. integration, as delivered in the IBM DB2 Information Offerings (for all customer segments: small, medium, and Integrator family of products, is an enabling technology large enterprise), rates, terms, and availability can vary for the other approaches providing integrated, declarative by country. Contact your local IBM Global Financing access to diverse data. organization or visit the Web at: http://www.ibm.com/financing Reference Information

Refer to the Statement of General Direction section in Product Positioning these announcements: Preview Announcement of IBM DB2 Universal The IBM DB2 Information Integrator family of products Database V8.1 strengthens IBM′s industry leading WebSphere business integration portfolio. • Software Announcement 202-171, dated July 23, 2002 Recognizing the market requirement for structure and Worldwide Announcement of IBM DB2 Universal Database clarity, IBM introduced a framework for complete V8.1 for Linux, UNIX , and Windows business integration. Fundamentally, integration • revolves around people, processes, applications, and Software Announcement 202-214, dated information. Different integration approaches are September 17, 2002 necessary for different classes of integration problems. The DB2 federated features previously available as DB2 For example, online customer orders must be enabled Relational Connect V7.2 and DB2 Life Sciences Data through an application, not a database application Connect V7.2 have been enhanced and will be programming interface (API). Business rules embedded reintroduced in IBM DB2 Information Integrator V8.1. in application programming logic protect the database from inappropriate use. Product Upgrades Alternatively, the application that responds with a IBM DB2 Information Integrator V8.1 is the successor projected delivery date could access correlated product for IBM DB2 Relational Connect, IBM DB2 Life information across manufacturing and shipping Sciences Data Connect, and IBM DB2 DataJoiner . databases, and could depend on the data management Customers who have a software maintenance agreement system to handle the complex joins and mask differences for IBM DB2 Relational Connect, IBM DB2 Life Sciences between the data sources. As in this example, the best Data Connect, or IBM DB2 DataJoiner will be entitled to solution often uses several approaches, emphasizing the upgrade to an IBM DB2 Information Integrator V8.1 need for moving easily among technologies. While product. competitors may provide only niche integration, IBM can deliver complete integration with offerings that work Trademarks together smoothly. With over 30 years experience in Informix is a trademark of International Business Machines building and evolving their base offerings for middleware Corporation in the United States or other countries or both. and enabling these offerings to work together in DB2, WebSphere, AIX, ImagePlus, DB2 Universal Database, thousands of different business environments, IBM has and DataJoiner are registered trademarks of International identified five types of integration based on an open Business Machines Corporation in the United States or other services infrastructure that can be used together or countries or both. separately to address these issues. Intel is a registered trademark of Intel Corporation. Microsoft is a trademark of Microsoft Corporation. • Information integration enables the integration of Windows NT and Windows are registered trademarks of diverse forms of business information across and Microsoft Corporation. beyond the enterprise. Instead of sequentially Java is a trademark of Sun Microsystems, Inc. accessing individual information sources, information UNIX is a registered trademark of the Open Company in the integration enables coherent search, access, United States and other countries. replication, and transformation over a unified view of Domino is a trademark of Lotus Development Corporation information assets to meet business needs. and/or IBM Corporation. Lotus and Domino.Doc are registered trademarks of Lotus • Application connectivity allows applications to Development Corporation and/or IBM Corporation. and leverage information. Business assets are Other company, product, and service names may be efficiently connected to allow information across trademarks or service marks of others. disparate systems to be available across the enterprise. • Process integration takes application connectivity to the next level by allowing the business to change how

203-021 -4-