Beyond Search: What to Do When Your Enterprise Search System Doesn't Work

Total Page:16

File Type:pdf, Size:1020Kb

Beyond Search: What to Do When Your Enterprise Search System Doesn't Work Research Report Beyond Search: What to do when Your Enterprise Search System Doesn't Work April 2, 2008 by Stephen E. Arnold Gilbane Group Inc. 763 Massachusetts Avenue Cambridge, MA 02139 USA Tel: 617.497.9443 Fax: 617.497.5256 [email protected] http://gilbane.com Table of Contents List of Figures ........................................................................................................vi List of Tables....................................................................................................... viii Preface .............................................................................................. x Executive Summary............................................................................ 1 Fixing a Broken Search System.............................................................................1 How Do You Avoid Common Mistakes?...............................................................1 Beyond Key Words............................................................................................... 2 The Next-Generation Search Market................................................................... 2 The Market Layout............................................................................................... 4 Beyond Search at a Glance....................................................................................5 I. Introduction: Setting the Stage....................................................... 9 Search 2008 ........................................................................................................ 11 Does Your Organization Have Search Flu? ........................................................ 11 Maintaining Legacy Search Systems .................................................14 Fixing a Search System .......................................................................................14 Avoiding Some Common Pitfalls ........................................................................18 Getting to More than Key Words...................................................... 32 The Symbiosis of an Interface and Search Technology..................................... 33 Identifiable Trends............................................................................................. 39 Payoffs and Liabilities of Rich Text Processing................................................. 46 Market Context.................................................................................51 The Superplatforms ........................................................................................... 52 Autonomy, Endeca, and Fast Search (The Big Three) .......................................55 Up-and-Coming Vendors................................................................................... 63 More Choices, More Functionality .................................................................... 68 What’s Next? ...................................................................................................... 70 Google and Dataspaces .....................................................................73 Semantic Technology at Google..........................................................................73 Transformic: A Meta-Meta Approach to Content ..............................................75 Possible Impact of Google’s Dataspace Technology...........................................81 II. Tracking the Players ....................................................................83 A Snapshot of the Beyond-Search Market......................................................... 83 The Beyond Search Market Map........................................................................ 86 III. Vendor Profiles .......................................................................... 91 1. Access Innovations....................................................................... 92 MAIstro Suite ..................................................................................................... 93 Technology ......................................................................................................... 95 Customers............................................................................................................97 Benefits............................................................................................................... 98 Downside............................................................................................................ 99 © 2008 Gilbane Group, Inc. - i - http://gilbane.com Net-Net............................................................................................................... 99 2. Attensity Corporation ................................................................ 100 The System in Action ........................................................................................102 Technology ........................................................................................................103 The SDK - Attensity Integration .......................................................................107 New Features.....................................................................................................107 Upside................................................................................................................107 Downside.......................................................................................................... 108 Net-Net............................................................................................................. 108 3. Bitext SA ..................................................................................... 110 The Bitext Data Suite .........................................................................................111 Technology ........................................................................................................ 113 Upside................................................................................................................ 114 Downside........................................................................................................... 114 Net-Net.............................................................................................................. 114 4. Brainware, Inc............................................................................ 115 How Brainware Works...................................................................................... 116 Associative Memory: The Brainware Innovation............................................. 117 Features.............................................................................................................118 The System in Use.............................................................................................120 Upside................................................................................................................120 Downside...........................................................................................................120 Net-Net.............................................................................................................. 121 5. Cognition Technologies, Inc. .......................................................122 Examples of the System in Use.........................................................................124 The Knowledgebases.........................................................................................125 Key Features......................................................................................................126 Upside................................................................................................................127 Downside...........................................................................................................128 Net-Net..............................................................................................................128 6. Connotate Technologies..............................................................129 A Content Processing Riff .................................................................................129 Connotate’s Agent Approach ............................................................................ 131 Professional Services.........................................................................................133 The System in Action ........................................................................................133 Upside................................................................................................................134 Downside...........................................................................................................135 Net-Net..............................................................................................................135 7. Dieselpoint Inc............................................................................ 137 Key Features......................................................................................................138 Open Pipeline ....................................................................................................139 Technology ........................................................................................................142 Dieselpoint in Use .............................................................................................143 Upside................................................................................................................143 Downside...........................................................................................................144
Recommended publications
  • Out of the Ordinary: Finding Hidden Threats by Analyzing Unusual
    RAND-INITIATED RESEARCH CHILD POLICY This PDF document was made available CIVIL JUSTICE from www.rand.org as a public service of EDUCATION the RAND Corporation. ENERGY AND ENVIRONMENT HEALTH AND HEALTH CARE Jump down to document6 INTERNATIONAL AFFAIRS NATIONAL SECURITY The RAND Corporation is a nonprofit POPULATION AND AGING research organization providing PUBLIC SAFETY SCIENCE AND TECHNOLOGY objective analysis and effective SUBSTANCE ABUSE solutions that address the challenges TERRORISM AND facing the public and private sectors HOMELAND SECURITY TRANSPORTATION AND around the world. INFRASTRUCTURE Support RAND Purchase this document Browse Books & Publications Make a charitable contribution For More Information Visit RAND at www.rand.org Explore RAND-Initiated Research View document details Limited Electronic Distribution Rights This document and trademark(s) contained herein are protected by law as indicated in a notice appearing later in this work. This electronic representation of RAND intellectual property is provided for non- commercial use only. Permission is required from RAND to reproduce, or reuse in another form, any of our research documents. This product is part of the RAND Corporation monograph series. RAND monographs present major research findings that address the challenges facing the public and private sectors. All RAND mono- graphs undergo rigorous peer review to ensure high standards for research quality and objectivity. Out of the Ordinary Finding Hidden Threats by Analyzing Unusual Behavior JOHN HOLLYWOOD, DIANE SNYDER, KENNETH McKAY, JOHN BOON Approved for public release, distribution unlimited This research in the public interest was supported by RAND, using discretionary funds made possible by the generosity of RAND's donors, the fees earned on client-funded research, and independent research and development (IR&D) funds provided by the Department of Defense.
    [Show full text]
  • Private Companiescompanies
    PrivatePrivate CompaniesCompanies Private Company Profiles @Ventures Kepware Technologies Access 360 Media LifeYield Acronis LogiXML Acumentrics Magnolia Solar Advent Solar Mariah Power Agion Technologies MetaCarta Akorri mindSHIFT Technologies alfabet Motionbox Arbor Networks Norbury Financial Asempra Technologies NumeriX Asset Control OpenConnect Atlas Venture Panasas Autonomic Networks Perimeter eSecurity Azaleos Permabit Technology Azimuth Systems PermissionTV Black Duck Software PlumChoice Online PC Services Brainshark Polaris Venture Partners BroadSoft PriceMetrix BzzAgent Reva Systems Cedar Point Communications Revolabs Ceres SafeNet Certeon Sandbridge Technologies Certica Solutions Security Innovation cMarket Silver Peak Systems Code:Red SIMtone ConsumerPowerline SkyeTek CorporateRewards SoloHealth Courion Sonicbids Crossbeam Systems StyleFeeder Cyber-Ark Software TAGSYS DAFCA Tatara Systems Demandware Tradeware Global Desktone Tutor.com Epic Advertising U4EA Technologies ExtendMedia Ubiquisys Fidelis Security Systems UltraCell Flagship Ventures Vanu Fortisphere Versata Enterprises GENBAND Visible Assets General Catalyst Partners VKernel Hearst Interactive Media VPIsystems Highland Capital Partners Ze-gen HSMC ZoomInfo Invention Machine @Ventures Address: 187 Ballardvale Street, Suite A260 800 Menlo Ave, Suite 120 Wilmington, MA 01887 Menlo Park, CA 94025 Phone #: (978) 658-8980 (650) 322-3246 Fax #: ND ND Website: www.ventures.com www.ventures.com Business Overview @Ventures provides venture capital and growth assistance to early stage clean technology companies. Established in 1995, @Ventures has funded more than 75 companies across a broad set of technology sectors. The exclusive focus of the firm's fifth fund, formed in 2004, is on investments in the cleantech sector, including alternative energy, energy storage and efficiency, advanced materials, and water technologies. Speaker Bio: Marc Poirier, Managing Director Marc Poirier has been a General Partner with @Ventures since 1998 and operates out the firm’s Boston-area office.
    [Show full text]
  • Proceedings 1 AGILE Phd School
    REGIONA Regionales Innovationszentrum für nachhaltiges Wirtschaften und Umwelt-/Geoinformation Band 2 Hardy Pundt, Lars Bernard (eds.) Proceedings 1st AGILE PhD School March 13-14, 2012 Wernigerode, Germany Shaker Verlag Aachen 2012 i Preface The Association of Geographic Information Laboratories in Europe (AGILE, http://www.agile- online.org/) seeks to ensure that the views of the geographic information teaching and research community are fully represented in the discussions that take place on future European research agendas. AGILE also provides a permanent scientific forum where geographic information researchers can meet and exchange ideas and experiences at the European level. The activities of AGILE are managed by an eight person council elected by the member organizations. Its main tasks are to develop an organizational structure to release the goals of AGILE, to further develop with the help of the members a European research agenda, to instigate and stimulate AGILE initiatives, to network with other organizations working in the field of Geoinformation and concerning technologies and to organize the yearly AGILE conference on Geographic Information Science. During one of the last meetings, the Council discussed to strengthen the focus on young researchers who are doing challenging scientific work within the more than 80 member laboratories. Therefore, the Council decided to implement a new initiative that is focused specifically on PhD candidates. This decision was the advent of the AGILE PhD School. The first of its kind has now been carried out at the University of Applied Sciences in Wernigerode, Germany. PhD students and senior scientists met in Wernigerode to present their subjects, exchange ideas and to discuss critically experiences, results and challenges.
    [Show full text]
  • Improving the Utility of Social Media with Natural Language Processing
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by University of Melbourne Institutional Repository Improving the Utility of Social Media with Natural Language Processing A thesis presented by Bo Han to The Department of Computing and Information Systems in total fulfillment of the requirements for the degree of Doctor of Philosophy The University of Melbourne Melbourne, Australia February 2014 Produced on archival quality paper Declaration This is to certify that: (i) the thesis comprises only my original work towards the PhD except where indi- cated in the Preface; (ii) due acknowledgement has been made in the text to all other material used; (iii) the thesis is fewer than 100,000 words in length, exclusive of tables, maps, bibliographies and appendices. Signed: Date: ⃝c 2014 - Bo Han All rights reserved. Thesis advisor(s) Author Timothy Baldwin Bo Han Paul Cook Improving the Utility of Social Media with Natural Language Processing Abstract Social media has been an attractive target for many natural language processing (NLP) tasks and applications in recent years. However, the unprecedented volume of data and the non-standard language register cause problems for off-the-shelf NLP tools. This thesis investigates the broad question of how NLP-based text processing can improve the utility (i.e., the effectiveness and efficiency) of social media data. In particular, text normalisation and geolocation prediction are closely examined in the context of Twitter text processing. Text normalisation is the task of restoring non-standard words to their standard forms. For instance, earthquick and 2morrw should be transformed into \earthquake" and \tomorrow", respectively.
    [Show full text]
  • GEOINT 2021 Prospectus
    Discovery and Connections October 5-8, 2021 St. Louis, Missouri PROSPECTUS Sponsor and Exhibitor Dates October 6-8 EXHIBIT HALL FEATURING • Aerial Imaging • Location-Based Services • Artificial Intelligence • Machine/Deep Learning • Big Data Analytics • Mapping Tools • Cloud Computing • Mobile Applications • Commercial Satellite Imaging • Mobile Platforms • Consulting • Open Source Intelligence • Content Managment • Remote Sensing • Cyber Security • Sensors • Geographic Information Systems • Simulation • Global Positioning Hardware & • Small Sats Software • UAVs • IC ITE • Visualization Software • Image Processing • And Much More! • Integration Services and Software GEOINT2021.com ATTRACTING HIGHLY QUALIFIED BUYERS Acquisition Directorate Architect Sensing Portfolio Director AND KEY DECISION MAKERS Admiral Chief Systems Engineer President Analyst Engineer Executive Director Principal Architect Assistant Executive Chief Technology Executive Vice Principal Data Officer Director President Scientist Collection Manager Associate Vice Founder Principal Engineer President Colonel Functional Principal Intelligence Board Member Combat Development Management Analyst Analyst Executive Branch Chief Professor Commandant General Branch Head Program Director Commander General Council Budget Director Publisher Business Development Commanding General General Manager R&D Engineer Executive Congressional Liaison GEOINT Chief R&D Scientist Business Development Contract Officer GEOINT Division Chief Manager Senior All-source Contracting Officer Geospatial
    [Show full text]
  • Geographic Information Retrieval: Classification, Disambiguation
    Imperial College London Department of Computing Geographic Information Retrieval: Classification, Disambiguation and Modelling Simon E Overell Submitted in part fulfilment of the requirements for the degree of Doctor of Philosophy in Computing of the University of London and the Diploma of Imperial College, July 2009 1 2 Abstract This thesis aims to augment the Geographic Information Retrieval process with information extracted from world knowledge. This aim is approached from three directions: classifying world knowledge, disambiguating placenames and modelling users. Geographic information is becoming ubiquitous across the Internet, with a significant proportion of web documents and web searches containing geographic entities, and the proliferation of Internet enabled mobile devices. Traditional information retrieval treats these geographic entities in the same way as any other textual data. In this thesis I augment the retrieval process with geographic information, and show how methods built upon world knowledge outperform methods based on heuristic rules. The source of world knowledge used in this thesis is Wikipedia. Wikipedia has become a phenomenon of the Internet age and needs little introduction. As a linked corpus of semi-structured data, it is unsurpassed. Two approaches to mining information from Wikipedia are rigorously explored: initially I classify Wikipedia articles into broad categories; this is followed by much finer classification where Wikipedia articles are disambiguated as specific locations. The thesis concludes with the proposal of the Steinberg hypothesis: By analysing a range of wikipedias in different languages I demonstrate that a localised view of the world is ubiquitous and inherently part of human nature. All people perceive closer places as larger and more important than distant ones.
    [Show full text]