Trillium Software System™ Batch User’s Guide
Investigator Converter Global Data Router Customer Data Parser Business Data Parser Window Key Generator Matcher Data Reconstructor Create Common Module
Version 7.16 May 2017 This manual, as well as the software described in it, are furnished under license and may be used only in accordance with the terms of such license. The content of this manual is furnished for informational purposes only, is subject to change without notice, and should not be construed as a commitment by Trillium Software. Trillium Software assumes no responsibility or liability for any errors or inaccuracies that may appear in this manual.
The customer shall not disclose, copy, reproduce, distribute, or display any portion of the Trillium Software System or this manual in any form to any third person without the prior written consent of Trillium Software, nor allow third parties to do the same. The customer shall keep the Trillium Software System and all confidential information in the strictest confidence.
Trillium Software System Batch User’s Guide 52617
Trillium Software, Inc. owns all rights in and to the marks "TRILLIUM SOFTWARE" and "TRILLIUM SOFTWARE SYSTEM," which marks are registered in various countries throughout the world (including, without limitation, the United States Patent and Trademark Office).
All other trademarks are the property of their respective owners.
© 2008-2017 Trillium Software, Inc. Table of Contents
Trillium Software System® Batch User’s Guide
CHAPTER 1 Investigator Data Process Flow ...... 1-2 Investigator Order of Operations...... 1-3 Investigator Parameters...... 1-4 Sample Investigator Parameter File ...... 1-6 Investigator Parameter Descriptions ...... 1-7 Sample Investigator Output Statistics ...... 1-29 Error Messages ...... 1-30 Running the Investigator...... 1-33
CHAPTER 2 Input/Output Resources ...... 2-3 Converter Parameters ...... 2-4 Using Record Select and Bypass Functionality with the Converter ...... 2-8 Rules File Subset Parameters and Syntax Rules...... 2-9 Comparison Operators ...... 2-10 Rule File Entries Examples ...... 2-11 Converter Parameter Details ...... 2-13 Running the Converter on UNIX and 32-bit PCs ...... 2-50 IBM Mainframe Converter Sample JCL ...... 2-51 Converter Error Messages ...... 2-52
CHAPTER 3 Global Data Router Design Flow ...... 3-2 Input/Output Resources ...... 3-2 Parameter Syntax ...... 3-3 Global Data Router Parameters ...... 3-4 Global Data Router Rules File...... 3-8 Rules File Parameters...... 3-9
Trillium Software System™ Batch User’s Guide iii Table of Contents
Running the Global Data Router on UNIX and 32-bit PCs ...... 3-24 IBM Mainframe Execution...... 3-25 Sample Rules File...... 3-27 Sample Log File Output...... 3-29 Testing the Router ...... 3-29 Global Data Router Error Messages ...... 3-30
CHAPTER 4 Customer Data Parsing Logic Flow ...... 4-2 Parser Functional Capabilities...... 4-4 Identifying Business versus Personal Names ...... 4-5 Comma-Reversed Names...... 4-6 Customer Data Parser Process Flow ...... 4-7 Customer Data Parser Input ...... 4-8 Customer Data Parser Output ...... 4-10 About Data Dictionary Language Files ...... 4-11 Special DDL Fields...... 4-12 Customer Data Parser Parameters ...... 4-14 Sample Parser Parameter File ...... 4-38 Sample Parser Parameter File ...... 4-40 pfprsdrv.par Parameters ...... 4-41 *CHANGE_DDNAME (‘change’ functionality) ...... 4-46 **JOIN_LINES (‘join’ functionality)...... 4-47 Name and Record Generation...... 4-49 Running the Customer Data Parser on UNIX and 32-bit PC Platforms ...... 4-52 IBM Mainframe Parser Sample JCL...... 4-53 Customer Data Parser Error Messages ...... 4-55 About Window Keys ...... 4-58 Multiple Customer Data Parsers ...... 4-58 How Multiple Parsers Work ...... 4-58 Line Pattern Identification Codes ...... 4-59 Pattern Leveling...... 4-60 Name Pattern Depth Levels ...... 4-61
iv Trillium Software System™ Batch User’s Guide Table of Contents
Street Pattern Depth Levels ...... 4-63 Customer Data Parser Output ...... 4-64 Customer Data Parser Repository (PREPOS) Record ...... 4-64 Parser Repository (PREPOS) Layout ...... 4-65 Complete PREPOS Layout ...... 4-66 Customer Data Parser Log File ...... 4-92 Sample Sorted Log File ...... 4-92 Bad Name, Street Patterns and City Problem Section...... 4-97 Parser Scrub Report ...... 4-99 Customer Data Parser Display (Scrub) Report File ...... 4-101 Section 1: Record Information ...... 4-101 Section 2: Street Line Information...... 4-102 Section 3: Additional Street Line Information ...... 4-102 Section 4: Geography Line Information ..4-102 Section 5: Name Line Information ...... 4-102 Detail File...... 4-104 Using the Palog Analyzer...... 4-105 Corrective Action Examples ...... 4-106 Name Pattern Addition...... 4-106 Street Pattern Addition ...... 4-106 U.S. City Problem Addition...... 4-106 Post Town Problem Addition ...... 4-107 Word/Phrase Addition ...... 4-107 Using the Parser Display Program ...... 4-107 Rerunning Table Maintenance After Tuning .....4-107 Review Codes and Review Groups ...... 4-108 Review Group Hierarchy...... 4-113 Statistics Report ...... 4-116 Output Display Program (CFPRSDSP) ...... 4-117 Parser Display Program Parameters ...... 4-118 Customer Data Parser Display Report Description ...... 4-119 Line Pattern Identification on the Display Report...... 4-123
Trillium Software System™ Batch User’s Guide v Table of Contents
CFPRSDSP Program Error Messages...... 4-124 Running the Parser Display Program on UNIX and 32-bit PC Platforms ...... 4-125 IBM Mainframe Sample Parser JCL...... 4-126
CHAPTER 5 Business Data Parsing Logic Flow ...... 5-2 Business Data Parser Functions...... 5-3 Primary Functions ...... 5-3 Business Data Parser Process Flow...... 5-4 Input and Output Resources ...... 5-4 DDL Requirements ...... 5-5 ORG_RECORD Field...... 5-6 Other Special DDL Fields ...... 5-6 DDL Specifics...... 5-7 Driver Parameter File ...... 5-7 Business Data Parser Parameter File ...... 5-10 Parameter File Descriptions...... 5-10 Running the Business Data Parser on UNIX and 32-bit PC Platforms ...... 5-16 IBM Mainframe Sample JCL...... 5-17 Business Data Parser Output...... 5-18 Business Data Parser Repository Record (BPREPOS) ...... 5-18 BPREPOS Fields...... 5-19 BDP Repository Output Record Format .....5-19 Business Data Parser Log File...... 5-21 Sample Business Data Parser Log File (Sorted)...... 5-21 Bad Patterns...... 5-22 Using the Palog Analyzer...... 5-22 Corrective Action ...... 5-23 Miscellaneous Pattern Addition ...... 5-23 Review Codes and Review Groups...... 5-25 Business Data Parser Review Codes ...... 5-25 Error Messages ...... 5-27
vi Trillium Software System™ Batch User’s Guide Table of Contents
CHAPTER 6 Window Key Generator Process Flow...... 6-2 Input and Output Resources ...... 6-2 Window Key Generator Parameters...... 6-3 Sample Parameter File 1...... 6-6 Sample Parameter File 2...... 6-6 Rules File ...... 6-8 Window Key Generator Codes ...... 6-9 Sample Window Key Rules File ...... 6-11 Running the Window Key Generator on UNIX and 32-bit PC Platforms ...... 6-11 Sample IBM Mainframe JCL...... 6-13 Error Messages ...... 6-14
CHAPTER 7 Matcher Driver Programs ...... 7-2 Window Matching and Reference Matching...... 7-3 Window Matching ...... 7-4 Window Matching Input and Output ...... 7-5 Window Keys ...... 7-5 Reference Matching...... 7-6 About Reference Matching ...... 7-7 Reference Matching Input and Output...... 7-9 Candidate Matching Information (CMI) Parameter File ...... 7-10 Sample CMI Parameter File...... 7-11 Sample Transaction File...... 7-11 Matcher Driver Program ...... 7-12 Matcher Parameters ...... 7-13 Handling Large Window Keys ...... 7-18 Creating Unique Window Keys...... 7-18 Using a Window Key Table ...... 7-19 Using Record Select and Bypass Functionality with the Matcher ...... 7-21 Rules File Parameters and Syntax Rules...... 7-21 Comparison Operators ...... 7-23 Rule File Entries Examples ...... 7-23
Trillium Software System™ Batch User’s Guide vii Table of Contents
Sample Matcher Parameter File ...... 7-25 About Matching Levels ...... 7-26 Field Comparison Routine Lists...... 7-26 Defining Field/Comparison Routine List Entries...... 7-27 Matcher Input 2 ...... 7-28 Grade Pattern Lists ...... 7-28 Grade Pattern List Syntax...... 7-29 Business Grade Pattern List Example ...... 7-30 Matching Prevention ...... 7-31 Using the PREVENT Match Routine ...... 7-31 Matching Propagation...... 7-32 Transitivity ...... 7-32 Propagation...... 7-32 Determining the Minimal Occurrence Influence...... 7-33 Match-Testing Early Exit...... 7-34 Matcher Output 1 ...... 7-35 Matcher Return Fields ...... 7-35 Matcher Output 2 ...... 7-36 Matcher Summary Statistics Report...... 7-36 Details for the Summary Statistics Report...... 7-42 Statistics from Matcher for Matcher Windows...... 7-42 Retail and Commercial...... 7-42 Commonizer Function ...... 7-45 Survivorship...... 7-47 Selecting the Surviving Name ...... 7-47 Survivorship Example...... 7-48 Selecting the Commercial Survivor...... 7-49 Identifying a Survivor Record Using CIS- _RANK_KEY ...... 7-49 Standard Common Data...... 7-51 User Common Data...... 7-52 User Common Data Parameter File Entries.....7-53 User Common Data Routines...... 7-56 Error Messages ...... 7-58
viii Trillium Software System™ Batch User’s Guide Table of Contents
Running the Matcher on UNIX and 32-bit PC Platforms ...... 7-66 IBM Mainframe Matcher Sample JCL ...... 7-67 Matcher Display Programs...... 7-68 Display Program DDL Requirements ...... 7-68 CFMATDSP Display Program...... 7-69 Display Program Parameters ...... 7-69 Sample Parameter File for CFMATDSP...... 7-71 Running the Matcher Display Program on UNIX and 32-Bit PC Platforms ...... 7-71 IBM Mainframe Matcher Display Sample JCL .....7-72 Display Program Errors ...... 7-73 Running CFFXMDSP on UNIX and 32-Bit PC Platforms ...... 7-76 IBM Mainframe Sample JCL for cffxmdsp ...... 7-77 Matcher Driver 2 ...... 7-78 Using CFMATCH with CKM ...... 7-78 CFMATCH Driver Parameters...... 7-79 Sample CFMATCH Matcher Driver Parameter File ...... 7-82 IBM Mainframe SAMPLE JCL for CFMATCH...... 7-83 Matcher Driver #3...... 7-84 Using CFWINMAT with CKM ...... 7-84 CFWINMAT Parameters ...... 7-84 Running CFWINMAT on UNIX and 32-bit PC Platforms ...... 7-88 IBM Mainframe Sample JCL for CFWINMAT ...... 7-89 Tuning the Match Results ...... 7-91 Getting Started ...... 7-92 Analyzing the Data...... 7-93 Using “Tie-Breaking” Fields ...... 7-93 Using Parmvals with the Matcher Comparison Routines...... 7-94 Comparison Routine and Parmval Details ...... 7-95 ABSOLUTE Routine ...... 7-95 APTNO Routine ...... 7-96 APTNO When Parmval (01) ...... 7-96
Trillium Software System™ Batch User’s Guide ix Table of Contents
ARRAY1 Routine ...... 7-98 ARRAY2 Routine ...... 7-99 BUSNAME Routine ...... 7-100 BUSNAME When Parmval (COMPACT) .....7-101 BUSNAME When Parmval (SORT)...... 7-102 BUSNAME When Parmval (DI) ...... 7-103 DATE Routine ...... 7-104 DIFFER Routine ...... 7-105 DIFFER Routine Parmvals ...... 7-105 Scoring Values...... 7-106 Example ...... 7-106 FLAG10 Routine ...... 7-106 FLAGFM Routine ...... 7-107 FLAGGN Routine...... 7-108 FLAGMF Routine ...... 7-109 Scoring Values...... 7-110 FLAGYN Routine ...... 7-110 FRSTNAME Routine ...... 7-111 FRSTNAME When Parmval = SYMETRIC ..7-113 FRSTNAME When Parmval = DI ...... 7-113 FRSTNAME When Parmval = TSB...... 7-113 FRSTNAME When Parmval = INITIAL ...... 7-114 GENER Routine...... 7-115 GENER When Parmval (95)...... 7-116 HOUSENO Routine...... 7-116 HOUSENO When Parmval (NORANGE) ....7-118 HOUSENO When Parmval (PARITY)...... 7-118 HOUSENO When Parmval (01)...... 7-119 MXDNAME Routine ...... 7-120 Scoring Values...... 7-121 NYSIIS Routine ...... 7-122 ONECOM Routine ...... 7-124 PARTIAL1 Routine ...... 7-125 PARTIAL1 When Parmval (10) ...... 7-125 PARTIAL1 When Parmval (FM) ...... 7-126 PARTIAL1 When Parmval (GN)...... 7-127 PARTIAL1 When Parmval (MF) ...... 7-128 PARTIAL1 When Parmval (MU)...... 7-129 PARTIAL1 When Parmval (YN) ...... 7-130
x Trillium Software System™ Batch User’s Guide Table of Contents
Scoring Values...... 7-131 PARTIAL1 When Parmval (ARRAY1,n) .....7-131 PARTIAL1 When Parmval (ARRAY2,n) .....7-132 PARTIAL2 Routine...... 7-133 PARTIAL2 When Parmval (DATE) ...... 7-134 PARTIAL2 When Parmval (SOUNDEX1) ...7-135 PARTIAL2 When Parmval (RSOUNDEX1) .7-137 Scoring Values...... 7-139 PARTIAL2 When Parmval (SOUNDEX2) ...7-140 PARTIAL2 When Parmval (RSOUNDEX2) .7-142 PARTIAL2 When Parmval (STATUS) ...... 7-144 PARTIAL2 When Parmval (NYSIIS) ...... 7-145 “Improved” NYSIIS Algorithm...... 7-145 PARTIAL2 When Parmval (RNYSIIS) ...... 7-148 Scoring Values...... 7-148 POSTCODE Routine ...... 7-148 Scoring Values ...... 7-149 POSTCODE When Parmval = TSB (used in UK only) ...... 7-149 PREFIX Routine ...... 7-150 PREVENT Routine ...... 7-151 RNYSIIS Routine ...... 7-152 SOCSEC Routine ...... 7-152 SOUNDEX1 Routine...... 7-153 SOUNDEX Algorithm ...... 7-153 SOUNDEX2 Routine...... 7-155 SPELLING Routine ...... 7-157 SPELLING When Parmval = DI...... 7-159 SPELLING When Parmval = SQUISH ...... 7-159 STATUS Routine ...... 7-159 STATUS When Parmval (STATUS)...... 7-159 Scoring Values...... 7-160 STREETS Routine ...... 7-160 STREETS When Parmval (DI)...... 7-162 STREETS When Parmval (TYPE)...... 7-162 SUBSTRNG Routine ...... 7-163 SUBSTRNG When Parmval (AND) ...... 7-164 TWORET Routine ...... 7-165 TWORET When Parmval (LO) ...... 7-165
Trillium Software System™ Batch User’s Guide xi Table of Contents
CHAPTER 8 Input/Output Resources...... 8-2 Data Reconstructor Parameters ...... 8-3 Parameter File Syntax ...... 8-5 Rules File ...... 8-6 Rules File Requirements ...... 8-6 Rule Script Language...... 8-7 Precedence and Associativity ...... 8-8 Comments ...... 8-9 Fields ...... 8-9 Input or Output Dictionary ...... 8-10 Selecting a Portion of a Field, field[n:n] & field(n:n)...... 8-11 Literal Values ...... 8-11 Binary Data Strings...... 8-12 Concatenating Literal Values ...... 8-12 BLANKS, ZEROS and NULLS...... 8-13 ‘IF’ Statements ...... 8-14 Conditions ...... 8-15 Logical Operators, AND and OR...... 8-18 Nested ‘IF’ statements ...... 8-19 Action Statements ...... 8-19 String Variables in the Data Reconstructor...... 8-25 Running the Data Reconstructor on UNIX and 32-Bit PC Platforms...... 8-26 IBM Mainframe Sample Data Reconstructor JCL 8-27 Error Messages ...... 8-28 Parameter Echo File Error Messages ...... 8-30 Rules File Error Messages...... 8-31
CHAPTER 9 User Common Data (Commonization) ...... 9-2 Selecting the Surviving Record ...... 9-6 Create Common Parameters ...... 9-8 Decision Routines...... 9-10 IBM Mainframe Sample JCL for cfcrcdrv ...... 9-16
xii Trillium Software System™ Batch User’s Guide Table of Contents
Running Create Common ...... 9-17 On UNIX and 32-bit PC Platforms...... 9-17 Error Messages ...... 9-18
Trillium Software System™ Batch User’s Guide xiii CHAPTER 1 Investigator
The Investigator module is used to investigate and analyze your data before (and sometimes after) you convert it. Unlike the Converter, the Investigator doesn’t perform any action on your data; however, because it works in conjunction with the Converter it uses many of the same analysis routines.
The Investigator provides parameters that let you identify the data to search for and flags the data that is found. The Investigator produces two output files and a statistics file.
Investigator 1-2 Investigator Data Process Flow
Investigator Data Process Flow
The Investigator uses a driver named CFINVDRV.
The Investigator uses the following input and output files:
File Description Driver parameter file pfindrv.par Input DDL file input.ddl Input file User data file Output file Output data file, which is typically used as input to the Converter.
In the OUTREC_FNAME output file, each field contains a 1 if the parameter line conditions were met and data is found. If conditions were not met, and no specified data is found, the field contains a 0. See page 1-23 for more information about the OUTREC_FNAME output file.
Trillium Software System™ Batch User’s Guide Investigator Order of Operations 1-3
Investigator Order of Operations
The Investigator driver operations occur in the following order:
1. Read input record/buffer 2. Process SINGLE_FIELD_LOOKUP parameter 3. Process MULTI_FIELD_LOOKUP parameter 4. Process FIELD_SCAN parameter 5. Process NUMERIC_RANGE_COMPARE parameter 6. Process FIELD_COMPARE parameter 7. Process ARITHMETIC_COMPARE parameter 8. Process FREQUENCY parameter 9. Write output to record file 10. Write output to field file