Compaq Server Manager/R
Total Page:16
File Type:pdf, Size:1020Kb
TECH NOTE . [November 1995 . Prepared By . Compaq Server Manager/R: . Systems Division . Compaq Computer . Important Facts and Information Corporation . CONTENTS . EXECUTIVE SUMMARY . Introduction ....................3 . The Compaq 32-bit Server Manager/R (SM/R) board is a server management tool. SM/R Hardware ...............3 . It monitors, sends alerts, and provides information about the hardware, EISA bus, System Hardware . and operating system of the server in which it is installed. This paper provides a Requirements....................... 3 . brief overview of: Monitored Items ................... 3 . Hardware Issues .................. 5 . • Hardware and software of the SM/R board . Hardware Questions and . • SM/R board system requirements Answers............................... 7 . • Recommended SM/R board alert settings SM/R Software ................8 . Server Manager/R . It also answers general questions about the features and capabilities of the SM/R Support Software.................. 8 . board and presents solutions to known issues regarding the board. Server Manager Facility/R..... 8 . Server Manager Collector/R .. 8 . Supported Operating . Systems............................... 9 . Monitored Items ................... 9 . Software Issues ..................14 . Software Questions and . Answers..............................15 . SM/R Alert Tables . 1. Netware Environment......10 . 2. OS/2 Environment ..........11 . 3. Banyon Environment.......12 . 4. SCO UNIX Environment ..13 . Compaq field personnel may obtain more detailed information or submit comments regarding this communication via . internal bmail to this address: tech_com@hw tech@sys hou. 195A/1195 1 TECH NOTE (cont.) . NOTICE . The information in this publication is subject to change without notice. C OMPAQ COMPUTER CORPORATION SHALL NOT BE LIABLE FOR TECHNICAL . OR EDITORIAL ERRORS OR OMISSIONS CONTAINED HEREIN, NOR FOR . INCIDENTAL OR CONSEQUENTIAL DAMAGES RESULTING FROM THE . FURNISHING, PERFORMANCE, OR USE OF THIS MATERIAL. This publication does not constitute an endorsement of the product or products that were tested. The configuration or configurations tested or described may or may not be the only available . solution. This test is not a determination of product quality or correctness, nor does it ensure . compliance with any federal, state or local requirements. Compaq does not warrant products other . than its own strictly as stated in Compaq product warranties. Product names mentioned herein may be trademarks and/or registered trademarks of their . respective companies. Compaq, Deskpro, Compaq Insight Manager, Systempro, Systempro/LT, ProLiant, QVision, and . QuickFind, registered United States Patent and Trademark Office. ProSignia and Systempro/XL are trademarks and/or service marks of Compaq Computer . Corporation. Other product names mentioned herein may be trademarks and/or registered trademarks of their . respective companies. ©1995 Compaq Computer Corporation. Printed in the U.S.A. Microsoft, Windows, Windows NT, Windows NT Advanced Server, SQL Server for Windows NT . are trademarks and/or registered trademarks of Microsoft Corporation. Compaq Server Manager/R: Important Facts and Information . First Edition (November 1995) . Document Number 195A/1195 . 195A/1195 2 TECH NOTE (cont.) . INTRODUCTION . The purpose of this paper is to provide a brief overview of: . • . Hardware and software of the Compaq 32-bit Server Manager/R (SM/R) board . • SM/R board system requirements . • Recommended SM/R board alert settings . In addition, this paper answers general questions about the features and capabilities of the SM/R . board and presents solutions to known issues regarding the board. This paper is not intended to . repeat information provided elsewhere or to cover Power On-Self Test (POST) errors, diagnostic . error codes, or alert messages in detail. Those topics are addressed in the SM/R user manuals and . in on-line documentation for the Server Manager Support Software. This paper is divided into two major sections that discuss SM/R hardware and SM/R software . respectively. If additional information on a topic is available on Compaq QuickFind, a Compaq . document number for that reference is cited in this paper. To locate a document in Compaq . QuickFind, search for the document number. SM/R HARDWARE . The SM/R board is a server management tool that monitors, sends alerts, and provides information . about the hardware and EISA bus of the server in which it is installed. It monitors server hardware . by evaluating the signals and data available on the EISA bus. The board works continuously, even . when system power is off, and sends alerts when a problem occurs. The SM/R board also includes . software components that enable it to monitor the operating system as well as the server hardware. The SM/R board monitors the server hardware even if the SM/R Support Software component is . not installed. System Hardware Requirements . The SM/R board is designed to work in Compaq systems and is supported in the following . Compaq hardware platforms: . • . Compaq ProLiant 1000, Compaq ProLiant 1000R, Compaq ProLiant 1500, Compaq ProLiant . 1500R, Compaq ProLiant 2000, Compaq ProLiant 2000R, Compaq ProLiant 4000, Compaq . ProLiant 4000R, Compaq ProLiant 4500, Compaq ProLiant 4500R . • Compaq ProSignia VS, Compaq ProSignia 300, Compaq ProSignia 500 . • Compaq DeskPro 386/33L, Compaq DeskPro 486/25, Compaq DeskPro 486/33L, Compaq . DeskPro 486/50L . • Compaq DeskPro/M series . • Compaq Systempro, Compaq Systempro/LT, Compaq Systempro 486/33e, Compaq . Systempro/XL . Monitored Items . The SM/R board monitors many aspects of the hardware including power, memory, serial port, . parallel port, EISA bus, temperature, POST failures, the drive array, and the SM/R board itself. When a monitored item falls outside the specified range or set value, an alert is sent to notify the . system administrator that a problem has occurred or a condition exists that may require attention. Alerts can be sent to a pager, to a management PC, or to a phone number using the voice alert . feature. 195A/1195 3 TECH NOTE (cont.) . Power Monitoring . The SM/R board monitors power to the system and sends out an alert when a power outage occurs, . when the power cord is disconnected, or when the power is switched off. In addition it monitors . the +5, −5, +12, and −12 voltage lines and sends an alert if any of the monitored voltage line . readings are outside the specified range. When a power failure occurs, all four voltage lines are at . zero volts, which causes multiple alerts to be sent. When using the Server Manager Facility to . watch the actual voltage, it is normal to see the voltage fluctuate slightly. The normal operating . voltage of one system may be slightly different from that of another system. Refer to the section entitled “Incorrect +5V alerts on ProLiant servers using 5.2V Pentium chips” . located on page 6 of the this paper. Memory Parity and Refresh Monitoring . The SM/R board monitors for memory parity errors and tracks memory refresh cycles. Memory . parity errors occur when the system detects that information has not been transferred correctly . during a memory read or write operation. Memory must be refreshed at regular intervals and . within the specified amount of time; otherwise, data may be lost. If this occurs, a “Lack of . Refresh” alert is reported. Refresh errors may occur while the system is first starting up; that is, . after a reset but before the POST begins. Serial Port Monitoring . The SM/R board can monitor serial ports on a monitored server for parity errors, overrun errors, . framing errors, and carrier detection. At installation, however, serial port monitoring is not . enabled because monitoring tracks only the occurrence of serial port errors, not the occurrence of . serial port failures. A serial port failure is defined as a change in the rate at which errors occur or a . sudden increase in the number of errors being detected. Serial port failures are not considered catastrophic because the server may continue operating after . a serial port failure. SM/R serial port alerts are not enabled at installation and should not be used . for serial port monitoring. The SM/R board will not send serial port alerts unless the system . administrator sets threshold values and enables alerting. Parallel Port Monitoring . The parallel port can be monitored to detect whether a printer is off-line, the printer cables are . disconnected, the printer is out of paper, or there is a defective parallel port. At installation . alerting is not active, but it can be enabled. EISA Bus Monitoring . The EISA bus is monitored for I/O Check counts that occur when an EISA or ISA expansion board . has failed. When a failure occurs, the failing board sends an I/O Check Count error to the system . over the EISA bus. Temperature Monitoring . The SM/R board monitors the temperature inside the system and sends an alert if the temperature . goes outside the specified range. The default range is is 0°C to 65°C (32°F to 149°F); however, . Compaq recommends resetting to the operating temperature range of the system so an alert will be . sent before a temperature change becomes critical. Typically the operating range of a system is . 10°C to 40°C (50°F to 104°F), but certain systems have a maximum operating temperature of . 35°C (95°F). Unless the system is in a non-controlled environment, it would be more meaningful . to set the range to 10°C to 35°C (50°F to 95°F). Besides the temperature of the system, the SM/R . board can also monitor the change in temperature per minute. 195A/1195 4 TECH NOTE (cont.) . Note that the location of the SM/R board and other boards in the system can affect air flow and . temperature around the temperature sensor on the SM/R board. This may result in false alerts and . inaccurate temperature readings. POST Failures . POST alerts are sent if the system has a failure during the Power On-Self Test. Drive Array Monitoring . The SM/R Support Software tracks many parameters of the operating system including the drive . array subsystem. The most serious drive array alert that can occur is a “drive failure” or “logical .