Relativity Processing User Guide V9.2
Total Page:16
File Type:pdf, Size:1020Kb
Processing User Guide February 4, 2019 - Version 9.2 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- For the most recent version of this document, visit our documentation website. Table of Contents 1 Processing 8 1.1 Application version considerations 8 1.2 Basic processing workflow 9 2 Installing and configuring Processing 11 2.1 Upgrade considerations 11 2.2 License considerations 11 2.3 Importing the Processing application 12 2.4 Worker manager server 12 2.5 Worker manager server on the resource pool 12 2.6 Processing agents 13 2.7 Creating a choice for the processing source location 14 3 Processing custodians 16 3.1 Creating a new custodian 17 3.1.1 Fields 17 3.2 Viewing or editing custodian details 17 3.3 Importing custodians through the RDC 19 4 Supported file types for processing 21 4.1 Supported file types 21 4.1.1 Multi-part forensic file considerations 24 4.1.2 Tracking inline/embedded images 24 4.1.3 Native text extraction and OCR 25 4.1.4 Microsoft Office child extraction support 25 4.2 Notable unsupported file types 27 4.3 Supported container file types 30 4.3.1 Lotus Notes considerations 31 4.4 Container file types supported for Password Bank in Inventory 33 4.4.1 Non-container file types supported for Password Bank in Inventory 34 5 Password bank 35 5.1 Password bank in processing workflow 35 Relativity | Processing User Guide - 2 5.2 Creating or deleting a Password Bank entry 37 5.2.1 Fields 38 5.2.2 Example password 40 5.2.3 Encrypted Collection files 40 5.3 Validations, errors, and exceptions 40 5.3.1 Special considerations - encrypted file names inside RAR archives 41 5.4 Viewing audits 41 6 Processing profiles 43 6.1 Creating or editing a processing profile 43 6.1.1 Fields 43 6.2 Editing field mappings 49 6.3 Invalid OCR language combinations 51 6.4 Processing fields 51 6.4.1 Default fields 51 6.4.2 Optional fields 52 6.4.3 Pivot-enabled fields for processing 60 7 Deduplication considerations 61 7.1 Global deduplication 61 7.2 Custodial deduplication 62 7.3 No deduplication 63 7.4 Global deduplication with attachments 64 7.5 Global deduplication with document-level errors 65 7.6 Technical notes for deduplication 66 7.6.1 Calculating MD5/SHA1/SHA256 hashes 67 7.6.2 Calculating RPC deduplication hashes for emails 67 7.6.3 Calculating the Relativity deduplication hash 69 8 Processing sets 70 8.1 Processing sets default view 70 8.2 Creating a processing set 72 8.3 Fields 73 8.4 Adding a data source 75 Relativity | Processing User Guide - 3 8.5 Fields 76 8.5.1 Order considerations 80 8.5.2 Edit considerations for data sources 81 8.6 Deleting a processing set 81 8.7 Avoiding data loss across sets 85 9 Inventory 86 9.1 Running inventory 87 9.1.1 Inventory process 89 9.1.2 Monitoring inventory status 90 9.1.3 Canceling inventory 91 9.2 Filtering files 92 9.2.1 Applying a Date range filter 97 9.2.2 Applying a File Size filter 99 9.2.3 Applying a deNIST filter 101 9.2.4 Applying a Location filter 101 9.2.5 Applying a File Type filter 102 9.2.6 Applying a Sender Domain filter 102 9.3 Removing filters 106 9.4 Inventory progress 106 9.5 Discovering files from Inventory 108 9.6 Inventory errors 109 9.6.1 Inventory error scenarios 109 9.7 Re-inventory 109 10 Discovering files 111 10.1 Running file discovery 111 10.1.1 Discovery process 114 10.1.2 Container extraction 115 10.2 Special considerations - OCR and text extraction 115 10.3 Monitoring discovery status 116 10.4 Canceling discovery 118 10.4.1 Canceling discovery 118 Relativity | Processing User Guide - 4 11 Publishing files 120 11.1 Running file publish 121 11.1.1 Publish process 123 11.2 Monitoring publish status 124 11.3 Canceling publishing 125 11.4 Republishing files 126 11.5 Retrying errors after publish 127 12 Processing errors 128 12.1 All Processing Errors view 128 12.1.1 Other error views 130 12.1.2 Bulk import error handling 131 12.2 Reading individual processing errors 132 12.2.1 Non-existent document identifier errors 134 12.3 Ignoring errors 134 12.4 Un-ignoring errors 135 12.5 Retrying errors 136 12.5.1 Retrying password protection errors 137 12.6 Unresolvable errors 137 12.7 Agent errors 138 12.8 Linking child errors to parent documents 139 13 Processing reports 140 13.1 Generating a processing report 140 13.2 Data Migration 142 13.2.1 Excluded Files 142 13.2.2 Summary Statistics: Data Migration 143 13.2.3 Processing Sets 143 13.3 Discovered Files by Custodian 143 13.3.1 Discovered Files by Custodian 143 13.3.2 File Types Discovered - Processable 143 13.3.3 File Types Discovered - Processable(By Custodian) 144 13.3.4 File Types Discovered - Unprocessable 144 Relativity | Processing User Guide - 5 13.3.5 File Types Discovered - Unprocessable (by Custodian) 144 13.3.6 Processing Sets 144 13.4 Discovered Files by File Type 144 13.4.1 Discovered Files by Custodian 145 13.4.2 File Types Discovered - Processable 145 13.4.3 File Types Discovered - Processable (By File Type) 145 13.4.4 File Types Discovered - Unprocessable 145 13.4.5 File Types Discovered - Unprocessable (By File Type) 145 13.4.6 Processing Sets 145 13.5 Document Exception 146 13.5.1 Document Level Errors - Discovery 146 13.5.2 Document Level Errors - Publishing 146 13.5.3 Processing Sets 146 13.6 File Size Summary 146 13.6.1 Pre-Processed File Size 147 13.6.2 Processed File Size 147 13.6.3 Published File Size 147 13.7 Inventory Details 147 13.7.1 Inventory Filter Settings 147 13.7.2 Excluded by File Type Filter | Excluded File Count 148 13.7.3 Excluded by Location Filter | Excluded File Count 148 13.7.4 Excluded by Sender Domain Filter | Excluded File Count 148 13.7.5 Processing Sets 148 13.8 Inventory Details by Custodian 148 13.8.1 Inventory Filter Settings 148 13.8.2 Custodian | Excluded by File Type Filter | Excluded File Count 149 13.8.3 Custodian | Excluded by File Location Filter | Excluded File Count 149 13.8.4 Custodian | Excluded by Sender Domain | Excluded File Count 149 13.8.5 Processing Sets 149 13.9 Inventory Exclusion Results 149 13.9.1 Inventory Filter Settings 149 Relativity | Processing User Guide - 6 13.9.2 File Type | Excluded File Count 149 13.9.3 Location | Excluded File Count 150 13.9.4 Sender Domain | Excluded File Count 150 13.9.5 Processing Sets 150 13.10 Inventory Exclusion Results by Custodian 150 13.10.1 Custodian | Excluded by File Type Filter | Excluded File Count 150 13.10.2 Custodian | Excluded by File Location Filter | Excluded File Count 150 13.10.3 Custodian | Excluded by Sender Domain | Excluded File Count 150 13.10.4 Processing Sets 150 13.11 Inventory Summary 151 13.11.1 Initial Inventory Results 151 13.11.2 Filtering Summary 151 13.11.3 Final Inventory Results 151 13.11.4 Processing Sets 152 13.12 Job Exception 152 13.12.1 Job Level Errors 152 13.12.2 Processing Sets 152 13.13 Text Extraction 152 13.13.1 Text Extraction by Custodian 153 13.13.2 Text Extraction by File Type 153 13.13.3 Breakdown by Error Message 153 13.13.4 Processing Sets 153 14 Monitoring the Worker Manager Queue 154 15 Processing FAQs 156 Relativity | Processing User Guide - 7 1 Processing Relativity’s processing feature allows you to ingest raw data directly into your workspace for eventual search, review, and production without the need for an external tool. You can use the various processing objects to create custom processing jobs that handle a wide array of information. Some of the primary goals of processing are to: n Discern, at an item-level, exactly what data is contained in the universe submitted. n Record all item-level metadata as it existed prior to processing. n Enable defensible reduction of data by selecting only appropriate items to move forward to review. Note: Processing doesn't perform language identification. For information on how to perform language identification using Analytics, see the Language identification section of the Analytics Guide. To gain control over more complex processing jobs on a granular level, you can use the Relativity Processing Console desktop application. For more information, see the Relativity Processing Console Guide. 1.1 Application version considerations All content in this section and its related pages corresponds to the latest version of the Processing application, which is updated on a monthly basis with the release of each Relativity 9.2 product update. If the processing components in your environment don't match the descriptions in this content exactly, it may be because you're using an older version of the Processing application. To get the newest version of the Processing application, you'll need to upgrade to the new product update of Relativity 9.2. For a list of changes made to processing per product update, see the Relativity 9.2 Release Notes. Using processing You’re a litigation support specialist, and the lead attorney hands you a CD containing data on a key custodian. There are about 200,000 files on the disc, and he's only looking for files from an 18-month period.