Installing and Using Utilities 5-13

Detection of Server Fault

NEC ESMPRO Manager and NEC ESMPRO Agent detect errors causing faults to occur at an early stage and notify Administrators of fault information real-time.

Early detection of error

If a fault occurs, NEC ESMPRO Agent detects the fault and reports the occurrence of the fault to NEC ESMPRO Manager (alert report). NEC ESMPRO Manager displays the received alert in the alert viewer and also changes the status colors of the server and server component in which the fault occurs. This allows you to identify the fault at a glance. Further, checking the content of the fault and the countermeasures, you can take appropriate action for the fault as soon as possible.

Types of reported faults

The table below lists the typical faults reported by NEC ESMPRO Agent.

Component

Reported information

CPU

CPU load is over the threshold

 

CPU degrading, etc.

Memory

ECC 1-bit error detection, etc.

Power supply

Voltage lowering

 

Power failure, etc.

Temperature

Temperature increase in chassis, etc.

Fan

Fan failure (decrease in the number of revolutions), etc.

 

 

Storage

File system usage rate, etc.

LAN

Line fault threshold over

 

Send retry or send abort threshold over, etc.

Prevention of Server Fault

NEC ESMPRO Agent includes the preventive maintenance function forecasting the occurrence of a fault as countermeasures for preventing faults from occurring.

NEC ESMPRO Manager and NEC ESMPRO Agent can set the threshold for each source in the server. If the value of a source exceeds the threshold, NEC ESMPRO Agent reports the alert to NEC ESMPRO Manager.

The preventive maintenance function can be set for a variety of monitoring items including chassis temperature, and CPU usage rate.