80 RS/6000 43P 7043 Models 150 and 260 Handbook
3.2.3.3 Fault Monitoring Functions
Built-in Self-Test (BIST) and Power-on Self-Test (POST) checks processor,
L2 cache, memory and associated hardware, that are required for proper
booting of the operating system every time the system is powered on. If a
non-critical error is detected, or if the error(s) occur in the resources which
can be removed from the system configuration, the booting process will
proceed to completion. The error(s) are logged in the system non-volatile
RAM.
Disk drive fault tracking that can alert the system administrator of potential
disk failure before it impacts customer operation.
The AIX log facility where hardware and software failures are recorded and
analyzed (by Error Log Analysis routine) to provide warning to the system
administrator on the causes of system problems. This also enables IBM
service representatives to bring along needed replacement hardware
components when a service call is placed, thus minimizing system repair
time.
3.2.3.4 Mutual Surveillance
The service processor can monitor the operation of the firmware during the
boot process, and it can monitor the operating system for loss of control. It
also allows the operating system to monitor for service processor activity. The
service processor can take appropriate action, including calling for service,
when it detects that the firmware or the operating system has lost control.
Likewise, the operating system can request a service processor repair action
if necessary.
3.2.3.5 Environmental Monitoring Functions
The following is a list of the environmental monitoring functions.
Temperature monitoring that increases the fan speed rotation when
ambient temperature is above the normal operating range
Temperature monitoring to warn the system administrator of potential
environmental related problems (for example, air conditioning and air
circulation around the system) so that appropriate corrective actions can
be taken before a critical failure threshold is reached, and to provide
orderly system shutdown when operating temperature exceeds the critical
level
Fan speed monitoring to provide warning and an orderly system shutdown
when the speed is out of operational specification
DC voltages monitoring to provide warning and an orderly system
shutdown when the voltage(s) are out of operational specification