Features - Content Indexing and Search
Total Page:16
File Type:pdf, Size:1020Kb
Content Indexing and Search Features - Content Indexing and Search TABLE OF CONTENTS OVERVIEW SYSTEM REQUIREMENTS z Content Indexing Engine z Web Search Server z Web Search Client INSTALLATION z Install the Content Indexing Engine - Single Node Installation z Install the Content Indexing Engine - Multi-Node Installation z Install the Web Search Server z Install the Web Search Client CONFIGURATION z Content Indexing Engine z Offline Content Indexing z Web Search Server OFFLINE CONTENT INDEXING DATA DISCOVERY AND SEARCH CONTENT DIRECTOR z Legal Hold z Tagging z Enterprise Records Management (ERM) z Content Director Policy RESTORING DATA FROM SEARCH RESULTS MANAGEMENT - CONTENT INDEXING AND SEARCH Page 1 of 131 Content Indexing and Search Overview - Content Indexing and Search Topics | Support Overview of Content Indexing and Search Content Indexing and Search Components z Content Indexing Engine z Offline Content Indexing z Online Content Indexing z Search { Web-based Search Console z Legal Hold z Tagging z ERM Connectors z Content Director Policy License Requirements Security OVERVIEW OF CONTENT INDEXING AND SEARCH Content Indexing and Search provides the ability to content index and search both your file server/desktop data and protected/archived data for data discovery and other purposes. This product allows Compliance Officers, Administrators and End-Users to search and restore file system and application data. Here is a list of features supported by Content Indexing and Search: z Ability to Content Index offline and online data, which includes data in storage as well as user desktops. z Multi-purpose and flexible search capability using the web-based Search Console. z Search based on User Security which provides the capabilities for: { Compliance Officers to perform data discovery. { Administrators and end-users to search for files or objects that are associated with their security. z Ability to edit and save search queries. z Ability to preview the items returned by the search query. z Ability to restore files/objects discovered by the search operation. z Ability to save search results. Data can also be downloaded and saved as .pst, .cab, or .nsf files. z Ability to Legal Hold discovered items for long term retention for legal purposes. z Ability to create and attach tags to discovered items and later perform search based on the tags. z Ability to submit discovered items to a record management system, using the ERM Connector. z Ability to automate and schedule the data discovery operations using the Content Director Policy. The diagram on the right provides a broad overview of Content Indexing and Search. Contact Professional Services for assistance in designing the Content Indexing Engine and Search in your environment. CONTENT INDEXING AND SEARCH COMPONENTS Content Indexing and Search consists of the following main components. The diagram on the right provides a broad overview of the deployment and configuration of these components. CONTENT INDEXING ENGINE The Content Indexing Engine is the core component for the content indexing and search feature. It is the underlying integrated software application that provides indexing, searching and filtering services for all data - including file server/desktop data and protected/archived data. As the content indexing process is very resource intensive it is recommended that the engine be installed in powerful computer that has extensive memory and hard disk availability at all times. (See System Requirements - Content Indexing Engine for minimum requirements.) The Content Indexing Engine may be installed as a single-node installation where all the components within the Content Indexing Engine are installed in the same computer. Depending on the volume of Page 2 of 131 Content Indexing and Search data that must be content indexed in a CommCell, one or more Content Indexing Engines can be installed and configured. You can also perform a multi-node install to customize the installation of each Content indexing Engine to distribute and harness the capacity of multiple computers. Content Indexing Engine is the first component that must be installed. (See Deployment - Content Indexing and Search for more information on how to install the Content Indexing Engine.) The properties of each Content Indexing Engine in the CommCell is displayed in the CommCell Console under Storage Resources. Once installed, you can configure the content indexing engine to set the maximum number of batch slots and maximum number of documents per batch. You can also specify a staging location where the files to be content indexed will be staged temporarily prior to content indexing. Both offline and online content indexing processes are configured to use a Content Indexing Engine. This is explained in the following sections. OFFLINE CONTENT INDEXING Offline Content Indexing is used to content index the storage data secured by the various data protection/data archive operations. For this reason the configuration of the Offline Content Indexing is associated with a storage policy. Each storage policy must be configured to use a Content Indexing Engine, if content indexing is enabled in the storage policy. (See Configuration - Content indexing and Search for more details.) The MediaAgent associated with the Storage Policy will be used for reading the data associated with the storage policy. Offline content indexing is supported for all types of of data including compressed, deduplicated and encrypted data. OFFLINE CONTENT INDEXING FOR RMS PROTECTED DOCUMENTS You can also perform offline content indexing of documents/emails secured by Rights Management Service (RMS). Rights Management Service (RMS) is a technology that works with RMS enabled applications (such as, Microsoft Office applications, Microsoft Exchange Server, and Microsoft Sharepoint Server) to set usage rights on documents or emails. This is basically used by content authors to set permissions on their documents/emails in order to limit access to other users. For more information on Rights Management Service, refer Microsoft documentation. For more information on content indexing RMS protected content, see Content Indexing RMS Protected Files. OFFLINE CONTENT INDEXING FOR NAS AGENTS Offline content indexing is also supported for NAS backups. See Content Indexing- Support for a list of data types that are supported by offline content indexing. In order to view or restore the content indexed NAS data from the Search Console, install the Deployment - File System NDMP Restore Enabler on the web search server. OFFLINE CONTENT INDEXING FOR VIRTUAL SERVER IDATAAGENT Offline content indexing is also supported for file level backups on VMware virtual servers. See Content Indexing- Support to know the virtual server platforms supported by offline content indexing. OFFLINE CONTENT INDEXING FOR LOTUS NOTES/DOMINO SERVER Offline content indexing is also supported for Lotus Notes email backups. In order to enable Domino Directory Service login or to restore Lotus Notes emails, you need to install the Lotus Notes Client on the Web Search Server on a 32-bit platform. ONLINE CONTENT INDEXING Online Content Indexing operations can be performed using the following agents: ONLINE CONTENT INDEXING FOR FILE SYSTEM AGENT The Online Content Indexing for File System Agent allows you to content index live files residing on Windows computers. The Online Content Indexing Agents must be installed on all the computers in the CommCell that you wish to content index and search. See Deployment - Content Indexing and Search for information on installing the Online Content Indexing agents. See Configuration - Content indexing and Search for information on configuring the Online Content Indexing agents. SEARCH Once the data is content indexed, it can be searched for data discovery and other purposes. Search can be performed using the following components: WEB-BASED SEARCH CONSOLE The web-based Search Console provides a multi-purpose and flexible method to search and if necessary restore data. It has an easy-to-use search interface modeled after popular search engines. In order to perform searches from the Search Console, you need to install the Web Search Server and the Web Search Client. For information on installing the Web Search Server and Web Search Client, see Deployment - Content Indexing and Search. In order to view or restore the content indexed NAS data from the Search Console, install the Deployment - File System NDMP Restore Enabler on the web search server. Once installed, the web-based Search Console and User security must be configured before it is used. See Configuration - Content indexing and Search for more information. The Search Console also has powerful built-in security features that enables both compliance and end-users to search data based on individual security permissions. In addition, it also allows users to restore the appropriate file/data if necessary. The Search Console provides several options and tools to search the data. It also provides following additional advanced search options to further refine your search. z Search on multiple content indexing engines. Page 3 of 131 Content Indexing and Search z Enable/Disable Lemmatization and synonym search. During search, you have the facility to include intra operators against search criteria in the advanced search options window. It also allows users to preview the search results in the same or new window. When performing end-user search, the Search Console also provides options to search for Exchange