Mapreduce Service
Total Page:16
File Type:pdf, Size:1020Kb
MapReduce Service Troubleshooting Issue 01 Date 2021-03-03 HUAWEI TECHNOLOGIES CO., LTD. Copyright © Huawei Technologies Co., Ltd. 2021. All rights reserved. No part of this document may be reproduced or transmitted in any form or by any means without prior written consent of Huawei Technologies Co., Ltd. Trademarks and Permissions and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd. All other trademarks and trade names mentioned in this document are the property of their respective holders. Notice The purchased products, services and features are stipulated by the contract made between Huawei and the customer. All or part of the products, services and features described in this document may not be within the purchase scope or the usage scope. Unless otherwise specified in the contract, all statements, information, and recommendations in this document are provided "AS IS" without warranties, guarantees or representations of any kind, either express or implied. The information in this document is subject to change without notice. Every effort has been made in the preparation of this document to ensure accuracy of the contents, but all statements, information, and recommendations in this document do not constitute a warranty of any kind, express or implied. Issue 01 (2021-03-03) Copyright © Huawei Technologies Co., Ltd. i MapReduce Service Troubleshooting Contents Contents 1 Account Passwords.................................................................................................................. 1 1.1 Resetting or Changing the Password of Manager User admin............................................................................... 1 2 Account Permissions............................................................................................................... 2 2.1 When a User Uses the AK/SK to Call the MRS Cluster Host List Interface, the Message "User do not have right to access cluster" Is Displayed..............................................................................................................................2 3 Accessing the Web Pages...................................................................................................... 4 3.1 Failed to Access MRS Manager.......................................................................................................................................... 4 3.2 Accessing MRS Manager on Mac...................................................................................................................................... 5 3.3 Failed to Log In to MRS Manager After the Python Upgrade................................................................................. 8 3.4 Error 502 Is Reported During the Access to MRS Manager..................................................................................... 8 3.5 Failed to Log In to MRS Manager After Changing the Domain Name..............................................................10 4 APIs........................................................................................................................................... 12 4.1 Failed to Call an API to Create a Cluster...................................................................................................................... 12 5 Cluster Management............................................................................................................13 5.1 Failed to Scale In Task Nodes........................................................................................................................................... 13 5.2 OBS Certificate in a Cluster Expired............................................................................................................................... 14 5.3 Adding a New Disk to an MRS Cluster..........................................................................................................................16 5.4 Disk Fault................................................................................................................................................................................. 20 5.5 MRS Backup Failure............................................................................................................................................................. 22 5.6 Inconsistency Between df and du Command Output on the Core Node..........................................................23 5.7 Disassociating a Subnet from the ACL Network........................................................................................................24 5.8 MRS Becomes Abnormal After hostname Modification..........................................................................................25 5.9 DataNode Restarts Unexpectedly................................................................................................................................... 25 5.10 Failed to Configure Cross-Cluster Mutual Trust.......................................................................................................27 5.11 Network Is Unreachable When Using pip3 to Install the Python Package in an MRS Cluster............... 29 5.12 Connecting the Open-Source confluent-kafka-go to the Security Cluster of MRS.....................................30 5.13 Failed to Periodically Back Up an MRS 1.7.2 Cluster............................................................................................. 31 5.14 Failed to Download the MRS Cluster Client..............................................................................................................32 5.15 An Error Is Reported When a Flink Job Is Submitted in a Cluster with Kerberos Authentication Enabled............................................................................................................................................................................................ 33 5.16 Scale-Out Failure................................................................................................................................................................ 35 5.17 Error Occurs When MRS Executes the Insert Command Using Beeline.......................................................... 36 Issue 01 (2021-03-03) Copyright © Huawei Technologies Co., Ltd. ii MapReduce Service Troubleshooting Contents 5.18 How Do I Upgrade EulerOS Vulnerabilities in an MRS Cluster?........................................................................ 37 5.19 Using CDM to Migrate Data to HDFS......................................................................................................................... 39 5.20 Alarms Are Frequently Generated in the MRS Cluster.......................................................................................... 40 5.21 Memory Usage of the PMS Process Is High..............................................................................................................42 5.22 High Memory Usage of the Knox Process................................................................................................................. 43 5.23 It Takes a Long Time to Access HBase from a Client Installed on a Node Outside the Security Cluster ............................................................................................................................................................................................................ 44 5.24 How Do I Locate a Job Submission Failure?............................................................................................................. 45 5.25 Insufficient Space of Partition /var/log on the System Disk Because the .out Log File Is Too Large....50 6 Using Alluixo.......................................................................................................................... 51 6.1 Error Message "Does not contain a valid host:port authority" Is Reported When Alluixo Is in HA Mode ............................................................................................................................................................................................................ 51 7 Using ClickHouse...................................................................................................................52 7.1 ClickHouse Fails to Start Due to Incorrect Data in ZooKeeper.............................................................................52 8 Using DBService..................................................................................................................... 55 8.1 DBServer Instance Is in Abnormal Status..................................................................................................................... 55 8.2 DBServer Instance Remains in the Restoring State.................................................................................................. 56 8.3 Default Port 20050 or 20051 Is Occupied.................................................................................................................... 57 8.4 DBServer Instance Is Always in the Restoring State Because the Incorrect /tmp Directory Permission ............................................................................................................................................................................................................ 58 8.5 DBService Backup Failure................................................................................................................................................... 59 8.6 Components Failed to Connect to DBService in Normal State.............................................................................60