IBM Tivoli Monitoring: Linux OS Agent User's Guide Problem Classification
Total Page:16
File Type:pdf, Size:1020Kb
IBM Tivoli Monitoring Version 6.3 Linux OS Agent User's Guide SC22-5453-00 IBM Tivoli Monitoring Version 6.3 Linux OS Agent User's Guide SC22-5453-00 Note Before using this information and the product it supports, read the information in “Notices” on page 315. This edition applies to version 6, release 3 of IBM Tivoli Monitoring (product number 5724-C04) and to all subsequent releases and modifications until otherwise indicated in new editions. © Copyright IBM Corporation 1994, 2013. US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Tables ...............vii Virtual Memory Statistics workspace ......23 Virtual Memory Usage Trends workspace ....24 Chapter 1. Using the monitoring agent . 1 New in this release ............1 Chapter 4. Attributes .........25 Using IBM Tivoli Monitoring .........3 Agent Availability Management Status attributes . 27 Components of the monitoring agent ......3 Agent Active Runtime Status attributes .....28 User interface options ...........4 Alerts Table attributes ...........29 All Users attributes ............30 Chapter 2. Requirements for the Configuration Information attributes ......31 monitoring agent ...........5 CPU attributes .............32 CPU attributes (superseded) .........33 Naming instances.............7 CPU Averages attributes ..........34 Running as a non-Administrator user ......7 CPU Averages attributes (superseded) .....36 Using Agent Management Services .......9 CPU Configuration attributes ........37 Filtering capabilities on the names of processes . 10 Disk attributes .............38 Disk attributes (superseded) .........39 Chapter 3. Workspaces........11 Disk IO attributes ............41 Organization of the predefined workspaces ....11 Disk IO attributes (superseded)........42 Agent Management Services workspace .....12 Disk Usage Trends attributes ........43 Agents' Management Log workspace ......12 Disk Usage Trends attributes (superseded) ....44 All Files workspace............13 File Comparison Group attributes .......46 Capacity Usage Information workspace .....13 File Information attributes .........46 CPU Averages workspace..........14 File Pattern Group attributes ........48 Disk I/O Extended Rate workspace ......14 I/O Ext attributes ............48 Disk I/O Rate workspace..........14 I/O Ext attributes (superseded)........50 Disk Usage workspace...........15 IP Address attributes ...........51 File Information workspace .........15 Linux Group attributes ..........52 Historical Summarized Availability workspace. 15 Linux Host Availability attributes .......52 Historical Summarized Availability Daily workspace 15 Linux TCP Statistics attributes ........53 Historical Summarized Availability Hourly LPAR attributes .............54 workspace ...............16 Machine Information attributes ........55 Historical Summarized Availability Weekly Network attributes ............56 workspace ...............16 Network attributes (superseded) .......60 Historical Summarized Capacity workspace . 16 NFS Statistics attributes ..........63 Historical Summarized Capacity Daily workspace 17 NFS Statistics attributes (superseded) ......68 Historical Summarized Capacity Hourly workspace 17 OS Configuration attributes .........72 Historical Summarized Capacity Weekly workspace 17 Process attributes ............73 Historical Summarized Performance workspace . 18 Process attributes (superseded) ........78 Historical Summarized Performance Daily Process User Info attributes .........82 workspace ...............18 Process User Info attributes (superseded) ....84 Historical Summarized Performance Hourly RPC Statistics attributes ..........86 workspace ...............18 RPC Statistics attributes (superseded)......87 Historical Summarized Performance Weekly Sockets Detail attributes ..........88 workspace ...............19 Sockets Detail attributes (superseded) .....90 Linux workspace ............19 Sockets Status attributes ..........91 Network workspace ...........19 Sockets Status attributes (superseded) .....91 NFS Statistics workspace ..........20 Swap Rate attributes ...........92 Process workspace ............20 Swap Rate attributes (superseded) .......93 Process CPU Usage workspace ........20 System Statistics attributes .........94 Process User Information workspace ......21 System Statistics attributes (superseded) .....96 RPC Statistics workspace ..........21 User Login attributes ...........98 Sockets Information workspace ........21 User Login attributes (superseded) ......98 Specific File Information workspace ......22 VM Stats attributes ............99 System Configuration workspace .......22 VM Stats attributes (superseded) .......102 System Information workspace ........22 Disk capacity planning for historical data ....103 Users Workspace ............23 © Copyright IBM Corp. 1994, 2013 iii Chapter 5. Situations ........107 AMS Start Agent Instance action .......121 Predefined situations ...........107 AMS Stop Agent action ..........122 Linux_AMS_Alert_Critical situation ......109 AMS Start Management action .......122 Linux_BP_AvgCpuBusyPct1h_Critic situation. 109 AMS Stop Management action .......123 Linux_BP_CpuBusyPct_Critical situation ....109 Sample_kill_Process action .........123 Linux_BP_CpuWaitIOPct_Warning situation . 110 Linux_BP_LoadAvg5min_Critical situation....110 Chapter 7. Policies .........125 Linux_BP_NetTotalErrPct_Warning situation . 110 Predefined policies ...........125 Linux_BP_NumberZombies_Warning situation . 110 Linux_BP_ProcHighCpu_Critical situation ....110 Chapter 8. Server dashboards ....127 Linux_BP_ProcMissing_Critical situation ....111 Server dashboards background information . 127 Linux_BP_SpaceUsedPct_Critical situation ....111 Checking the health of your monitored Linux_BP_SpaceUsedPctCustom_Cri situation . 111 environment..............130 Linux_BP_SwapSpaceUsedPct_Criti situation . 111 Displaying situation event results ......131 Linux_Fragmented_File_System situation ....111 Managed System Groups Overview dashboard . 132 Linux_Fragmented_File_System_2 situation . 112 Managed system group dashboard ......133 Linux_High_CPU_Overload situation .....112 Situation Events dashboard.........135 Linux_High_CPU_Overload_2 situation ....112 Situation event results dashboard.......136 Linux_High_CPU_System situation ......112 Linux managed system dashboard ......138 Linux_High_CPU_System_2 situation .....112 Page layout and controls .........140 Linux_High_Packet_Collisions situation ....112 Table controls .............141 Linux_High_Packet_Collisions_2 situation ....113 Copying the URL ............142 Linux_High_RPC_Retransmit situation .....113 Launching to the Tivoli Enterprise Portal ....142 Linux_High_RPC_Retransmit_2 situation ....113 Setting a trace .............143 Linux_High_Zombies situation .......113 Linux_High_Zombies_2 situation .......113 Chapter 9. Tivoli Common Reporting Linux_Low_Pct_Inodes situation .......113 Linux_Low_Pct_Inodes_2 situation ......114 for the monitoring agent ......145 Linux_Low_percent_space situation ......114 Utilization Details for Single Resource report . 150 Linux_Low_percent_space_2 situation .....114 Utilization Details for Multiple Resources report 153 Linux_Low_Space_Available situation .....114 Utilization Comparison for Single Resource report 156 Linux_Low_Space_Available_2 situation ....114 Utilization Comparison for Multiple Resources Linux_Network_Status situation .......114 report ................158 Linux_Network_Status_2 situation ......115 Utilization Heat Chart for Single Resource report 162 Linux_NFS_Buffer_High situation ......115 Memory Utilization for Single Resource report . 165 Linux_NFS_Buffer_High_2 situation ......115 Memory Utilization for Multiple Resources Linux_NFS_Getattr_High situation ......115 Comparison report ...........167 Linux_NFS_Getattr_High_2 situation .....115 Top Resources Utilization report .......169 Linux_NFS_rdlink_high situation .......115 Top Situations by Status report .......173 Linux_NFS_rdlink_high_2 situation ......116 Enterprise Resources List report .......174 Linux_NFS_Read_High situation .......116 Enterprise Daily Utilization Heat Chart report . 174 Linux_NFS_Read_High_2 situation ......116 Enterprise Summary report.........176 Linux_NFS_Writes_High situation ......116 Top Resources by Availability ........178 Linux_NFS_Writes_High_2 situation ......116 Top Resources Utilization Summary Heat Chart Linux_Packets_Error situation ........116 report ................180 Linux_Packets_Error_2 situation .......117 Top Resources by Availability (MTTR/MTBSI) . 182 Linux_Process_High_Cpu situation ......117 Resource Availability Comparison ......184 Linux_Process_High_Cpu_2 situation .....117 Availability Heat Chart for Single Resource . 186 Linux_Process_High_Instant_CPU situation . 117 CPU Utilization Comparison for Multiple Linux_Process_stopped situation .......117 Resources ..............188 Linux_Process_stopped_2 situation ......117 CPU Utilization for Single Resource ......191 Linux_RPC_Bad_Calls situation .......118 Disk Utilization for Single Resource ......192 Linux_RPC_Bad_Calls_2 situation ......118 Disk Utilization Comparison for Multiple Linux_System_Thrashing situation ......118 Resources ..............195 Linux_System_Thrashing_2 situation .....118 Situations History report .........198 Creating custom queries and reports .....200 Chapter 6. Take Action commands 119 Predefined Take Action commands ......119 Chapter 10. Troubleshooting .....207 AMS Recycle