Relativity Processing User Guide V10.3
Total Page:16
File Type:pdf, Size:1020Kb
Processing User Guide September 23, 2021 | Version 10.3.287.3 For the most recent version of this document, visit our documentation website. Table of Contents 1 Processing 10 1.1 Application version considerations 10 1.2 Basic processing workflow 11 1.3 Logging for processing 12 2 Installing and configuring Processing 13 2.1 Upgrade considerations for Relativity Goatsbeard 13 2.2 Installation process 13 2.3 Throttle settings for distributed publish 14 2.4 License considerations 15 2.5 Importing the Processing application 16 2.6 Worker manager server 16 2.6.1 Designating a worker for processing 16 2.7 Entering the ProcessingWebAPiPath instance setting 17 2.8 Adding the worker manager server to a resource pool 19 2.9 Configuring processing agents 20 2.10 Creating a choice for the processing source location 21 2.11 Logging for processing 23 2.12 Security permissions 23 3 Processing to Data Grid 25 3.1 Enabling processing to Data Grid 25 4 Supported file types for processing 28 4.1 Supported file types 28 4.1.1 Excel file considerations 31 4.1.2 Multi-part forensic file considerations 32 4.1.3 Tracking inline/embedded images 32 4.1.4 Native text extraction and OCR 32 4.1.5 Support for password-protected RAR archives 33 4.1.6 MSG to MHT conversion considerations 34 4.1.7 Microsoft Office child extraction support 34 Processing User Guide 2 4.2 Notable unsupported file types 36 4.3 Supported container file types 39 4.3.1 Lotus Notes considerations 41 4.3.2 Multi-part container considerations 43 4.3.3 ICS/VCF file considerations 43 4.4 Container file types supported for the password bank 48 4.4.1 Non-container file types supported for Password Bank in Inventory 49 5 Password bank 50 5.1 Password bank in processing workflow 50 5.2 Password Bank in imaging workflow 52 5.3 Creating or deleting a Password Bank entry 53 5.3.1 Fields 53 5.3.2 Example password 56 5.4 Validations, errors, and exceptions 56 5.5 Viewing audits 57 6 Mapping processing fields 58 6.1 Mapping fields 58 6.1.1 Processing system field considerations 59 6.1.2 Field mapping validations 59 6.2 System-mapped processing fields 60 6.3 Optional processing fields 66 6.4 Email Store Name details 88 6.5 Virtual path details 90 6.6 Processing folder path details 91 6.7 Email folder path details 91 6.8 Source path details 91 6.9 Message ID considerations 92 6.10 Comments considerations 92 6.11 Deduped custodian and path considerations 93 6.12 Pivot-enabled fields for processing 94 7 Processing profiles 95 Processing User Guide 3 7.1 Creating or editing a processing profile 95 7.1.1 Fields 96 8 Deduplication considerations 111 8.1 Global deduplication 111 8.2 Custodial deduplication 112 8.3 No deduplication 113 8.4 Global deduplication with attachments 114 8.5 Global deduplication with document-level errors 115 8.6 Technical notes for deduplication 116 8.6.1 Calculating MD5/SHA1/SHA256 hashes 116 8.6.2 Calculating deduplication hashes for emails 117 8.6.3 Calculating the Relativity deduplication hash 118 9 Quick-create set(s) 120 9.1 Required security permissions 120 9.2 Using quick-create set(s) 122 9.2.1 Validations and errors 130 10 Processing sets 131 10.1 Processing sets default view 132 10.2 Creating a processing set 134 10.3 Processing Set Fields 134 10.4 Adding a data source 136 10.5 Data Source Fields 137 10.5.1 Order considerations 142 10.5.2 Edit considerations for data sources 143 10.5.3 Processing Data Source view 143 10.5.4 Document Errors view 145 10.5.5 Job Errors View 146 10.6 Processing Data Sources tab 147 10.7 Deleting a processing set 151 10.8 Avoiding data loss across sets 155 10.9 Copying natives during processing 155 Processing User Guide 4 11 Inventory 159 11.1 Running inventory 160 11.1.1 Inventory process 162 11.1.2 Monitoring inventory status 163 11.1.3 Canceling inventory 164 11.2 Filtering files 165 11.2.1 Applying a Date range filter 170 11.2.2 Applying a File Size filter 172 11.2.3 Applying a deNIST filter 173 11.2.4 Applying a Location filter 174 11.2.5 Applying a File Type filter 174 11.2.6 Applying a Sender Domain filter 175 11.3 Removing filters 179 11.4 Inventory progress 179 11.5 Discovering files from Inventory 181 11.6 Inventory errors 181 11.6.1 Inventory error scenarios 182 11.7 Re-inventory 182 12 Discovering files 185 12.1 Running file discovery 186 12.1.1 Discovery process 188 12.1.2 Container extraction 190 12.2 Special considerations - OCR and text extraction 190 12.3 Monitoring discovery status 191 12.4 Canceling discovery 193 12.4.1 Canceling discovery 193 13 Discovered files view 195 13.0.1 Discovered files tab 195 14 Publishing files 198 14.1 Running file publish 199 14.1.1 Publish process 203 Processing User Guide 5 14.2 Monitoring publish status 205 14.3 Canceling publishing 206 14.4 Republishing files 208 14.5 Retrying errors after publish 209 15 Processing error workflow 211 15.1 Required permissions 211 15.2 Processing errors tabs 212 15.2.1 Document Errors tab 213 15.2.2 Pivotable error fields 215 15.2.3 Document Errors layout 216 15.2.4 Job Errors tab 219 15.2.5 Job Error layout 220 15.2.6 Error categories list 223 15.3 Error message list 224 15.4 Identifying and resolving errors by category 225 15.5 Using the processing error console 227 15.6 Viewing error-related audits 233 15.7 Special considerations 234 15.7.1 Bulk import error handling 234 15.7.2 Non-existent document identifier errors 235 15.7.3 Publish Batch ID field 235 15.7.4 Linking child errors to parent documents 236 15.7.5 Agent errors 237 15.7.6 Mass ignoring errors 237 15.7.7 Mass un-ignoring errors 238 15.7.8 Mass retrying errors 239 15.7.9 Mass editing notes 240 16 Reports 242 16.1 Generating a processing report 242 16.2 Data Migration 243 16.2.1 Excluded Files 244 Processing User Guide 6 16.2.2 Summary Statistics: Data Migration 244 16.2.3 Processing Sets 244 16.3 Discovered Files by Custodian 244 16.3.1 Discovered Files by Custodian 245 16.3.2 File Types Discovered - Processable 245 16.3.3 File Types Discovered - Processable(By Custodian) 245 16.3.4 File Types Discovered - Unprocessable 245 16.3.5 File Types Discovered - Unprocessable (by Custodian) 245 16.3.6 Processing Sets 245 16.4 Discovered Files by File Type 246 16.4.1 Discovered Files by Custodian 246 16.4.2 File Types Discovered - Processable 246 16.4.3 File Types Discovered - Processable (By File Type) 246 16.4.4 File Types Discovered - Unprocessable 246 16.4.5 File Types Discovered - Unprocessable (By File Type) 246 16.4.6 Processing Sets 247 16.5 Document Exception 247 16.5.1 Document Level Errors - Discovery 247 16.5.2 Document Level Errors - Publishing 247 16.5.3 Processing Sets 247 16.6 File Size Summary 248 16.6.1 Pre-Processed File Size 248 16.6.2 Processed File Size 248 16.6.3 Published File Size 248 16.7 Inventory Details 248 16.7.1 Inventory Filter Settings 248 16.7.2 Excluded by File Type Filter | Excluded File Count 249 16.7.3 Excluded by Location Filter | Excluded File Count 249 16.7.4 Excluded by Sender Domain Filter | Excluded File Count 249 16.7.5 Processing Sets 249 16.8 Inventory Details by Custodian 249 Processing User Guide 7 16.8.1 Inventory Filter Settings 249 16.8.2 Custodian | Excluded by File Type Filter | Excluded File Count 250 16.8.3 Custodian | Excluded by File Location Filter | Excluded File Count 250 16.8.4 Custodian | Excluded by Sender Domain | Excluded File Count 250 16.8.5 Processing Sets 250 16.9 Inventory Exclusion Results 250 16.9.1 Inventory Filter Settings 250 16.9.2 File Type | Excluded File Count 250 16.9.3 Location | Excluded File Count 251 16.9.4 Sender Domain | Excluded File Count 251 16.9.5 Processing Sets 251 16.10 Inventory Exclusion Results by Custodian 251 16.10.1 Custodian | Excluded by File Type Filter | Excluded File Count 251 16.10.2 Custodian | Excluded by File Location Filter | Excluded File Count 251 16.10.3 Custodian | Excluded by Sender Domain | Excluded File Count 251 16.10.4 Processing Sets 251 16.11 Inventory Summary 252 16.11.1 Initial Inventory Results 252 16.11.2 Filtering Summary 252 16.11.3 Final Inventory Results 252 16.11.4 Processing Sets 253 16.12 Job Exception 253 16.12.1 Job Level Errors 253 16.12.2 Processing Sets 253 16.13 Text Extraction 253 16.13.1 Text Extraction by Custodian 254 16.13.2 Text Extraction by File Type 254 16.13.3 Breakdown by Error Message 254 16.13.4 Processing Sets 254 17 Processing Administration 255 17.1 Security considerations for processing administration 255 Processing User Guide 8 17.2 Monitoring active jobs 257 17.2.1 Active jobs mass operations 260 17.3 Checking worker and thread status 261 17.3.1 Worker mass operations 265 17.3.2 Timeouts for stuck workers 266 17.3.3 Auto refresh options 267 17.3.4 Thread data visibility 267 17.3.5 Errors 268 17.4 Using the Processing History tab 268 17.4.1 Auto refresh options for processing history 270 18 Managing processing jobs in the queue 272 19 Processing FAQs 275 Processing User Guide 9 1 Processing Use Relativity’s processing feature to ingest raw data directly into your workspace for eventual search, review, and production without the need for an external tool.