Cluster Foundation Configuration and Administration Guide
Total Page:16
File Type:pdf, Size:1020Kb
Cluster Foundation PRIMECLUSTER Cluster Foundation (CF) Configuration and Administration Guide 4.3 (Oracle Solaris) Edition August 2015 Copyright and Trademarks Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners. EMC, PowerPath, SRDF, and Symmetrix are registered trademarks of EMC Corporation. PRIMECLUSTER is a registered trademark of Fujitsu Limited. All hardware and software names used are trademarks of their respective manufacturers. Export Controls Exportation/release of this document may require necessary procedures in accordance with the regulations of your resident country and/or US export control laws. Requests Delivery subject to availability; right of technical modifications reserved. All Rights Reserved, Copyright (C) FUJITSU LIMITED 2012-2015. Preface Cluster Foundation CF Registry and Integrity Monitor Cluster resource management GUI administration LEFTCLUSTER state CF topology table Shutdown Facility CF over IP Diagnostics and troubleshooting CF messages and codes Manual pages Glossary Abbreviations Figures Tables Index Contents 1 Preface . 1 1.1 Contents of this manual . 1 1.2 PRIMECLUSTER manuals . 2 1.3 Conventions . 3 1.3.1 Notation . 3 1.3.1.1 Prompts . 4 1.3.1.2 The keyboard . 4 1.3.1.3 Typefaces . 4 1.3.1.4 Example 1 . 4 1.3.1.5 Example 2 . 5 1.3.2 Command syntax . 5 1.4 Important notes and cautions . 5 1.5 Abbreviations . 6 1.6 Revision history . 6 2 Cluster Foundation . 7 2.1 CF, CIP, and CIM configuration . 7 2.1.1 Differences between CIP and CF over IP . 11 2.1.2 cfset . 13 2.1.3 CF security . 15 2.1.4 Example of creating a cluster . 16 2.1.5 Adding a new node to CF . 38 2.2 CIP configuration file . 39 2.3 Cluster Configuration Backup and Restore (CCBR) . 41 3 CF Registry and Integrity Monitor . 47 3.1 CF Registry . 47 3.2 Cluster Integrity Monitor . 48 3.2.1 Configuring CIM . 48 3.2.2 Query of the quorum state . 49 3.2.3 Reconfiguring quorum . 50 4 Cluster resource management . 53 4.1 Overview . 53 4.2 Kernel parameters for Resource Database . 54 4.3 Resource Database configuration . 57 4.4 Registering hardware information . 59 4.4.1 Setup exclusive device list . 59 4.4.2 Exclusive device list for EMC Symmetrix . 60 4.4.2.1 emcpower Devices and native Devices . 61 4.4.2.2 BCV, R2, GateKeeper, CKD . 61 J2S2-1588-01ENZ0(01) Contents 4.4.2.3 VCMDB . 62 4.4.2.4 Simplified setup for exclusive device list - clmakediskinfo, clmkdiskinfo 62 4.4.3 Automatic resource registration . 63 4.5 Start up synchronization . 65 4.5.1 Start up synchronization and the new node . 67 4.6 Adding a new node . 67 4.6.1 Backing up the Resource Database . 69 4.6.2 Reconfiguring the Resource Database . 70 4.6.3 Configuring the Resource Database on the new node . 71 4.6.4 Adjusting StartingWaitTime . 72 4.6.5 Restoring the Resource Database . 72 5 GUI administration . 75 5.1 Overview . 76 5.2 Starting Cluster Admin GUI and logging in . 76 5.3 Main CF table . 79 5.4 CF route tracking . 81 5.5 Node details . 84 5.6 Displaying the topology table . 85 5.7 Starting and stopping CF . 87 5.7.1 Starting CF . 89 5.7.2 Stopping CF . 92 5.8 Marking nodes DOWN . 93 5.9 Using PRIMECLUSTER log viewer . 94 5.9.1 Search based on time filter . 97 5.9.2 Search based on keyword . 98 5.9.3 Search based on severity levels . 99 5.10 Displaying statistics . 100 5.11 Heartbeat monitor . 105 5.12 Adding and removing a node from CIM . 106 5.13 Unconfigure CF . 109 5.14 CIM Override . 110 6 LEFTCLUSTER state . 111 6.1 Description of the LEFTCLUSTER state . 112 6.2 Recovering from LEFTCLUSTER . 114 6.2.1 Caused by a panic/hung node . 114 6.2.2 Caused by staying in the kernel debugger too long . 114 6.2.3 Caused by a cluster partition . 115 6.2.4 Caused by reboot . 117 7 CF topology table . 119 J2S2-1588-01ENZ0(01) Contents 7.1 Basic layout . 121 7.2 Selecting devices . 122 7.3 Examples . 123 8 Shutdown Facility . 127 8.1 Overview . 127 8.2 Available SAs and MAs . 129 8.2.1 RCI . 129 8.2.2 XSCF . 131 8.2.3 ALOM . 133 8.2.4 ILOM . 133 8.2.5 NPS . 134 8.3 SF split-brain handling . 135 8.3.1 Administrative LAN . 135 8.3.2 SF split-brain handling . 136 8.3.2.1 RMS ShutdownPriority attribute . 136 8.3.2.2 Shutdown Facility weight assignment . 137 8.3.2.3 Disabling split-brain handling . 137 8.3.3 Runtime processing . 137 8.3.4 Configuration notes . 138 8.4 Configuring the Shutdown Facility . 140 8.5 SF administration . 140 8.5.1 Starting and stopping SF . 141 8.5.1.1 Starting and stopping SF manually . 141 8.5.1.2 Starting and stopping SF automatically . 141 8.6 Logging . 141 9 CF over IP . 143 9.1 Overview . 143 9.2 Configuring CF over IP . 145 10 Diagnostics and troubleshooting . 147 10.1 Beginning the process . 147 10.2 Symptoms and solutions . 151 10.2.1 Join-related problems . 152 10.3 PCI Hot Plug . 161 10.4 Collecting troubleshooting information . 162 10.4.1 Executing the fjsnap command . 162 10.4.2 System dump . 163 10.4.3 SCF dump . 164 11 CF messages and codes . 165 11.1 cfconfig messages . 166 11.1.1 Usage message . 166 J2S2-1588-01ENZ0(01) Contents 11.1.2 Error messages . 167 11.2 cipconfig messages . 174 11.2.1 Usage message . 175 11.2.2 Error messages . 175 11.3 cftool messages . 177 11.3.1 Usage message . 177 11.3.2 Error messages . 178 11.4 rcqconfig messages . 181 11.4.1 Usage message . 181 11.4.2 Error messages . 182 11.5 rcqquery messages . 193 11.5.1 Usage message . ..