Using the Service Processor and Service Director Features
The service processor and service director features protect users against unnecessary system downtime by keeping support personnel (both internal and external) aware of any unexpected changes in the system environment. In combination, the two features provide a flexible solution to automated system maintenance.
Service Processor
The service processor runs on its own power boundary and continually monitors hardware attributes, the AIX operating system, and the environmental conditions within the system. Any system failure which prevents the system from returning to an operational state (a fully functional AIX operating system) is reported by the service processor. The service processor is controlled by firmware and does not require the AIX operating system to be operational to perform its tasks. If any system failures are detected, the service processor has the ability to take predetermined corrective actions. The methods of corrective actions are:
vSurveillance
vCall Home
vAIX operating system monitoring
Surveillance is a function in which the service processor monitors the system through heartbeat communication with the system firmware. The heartbeat is a periodic signal that the firmware can monitor. During system startup, the firmware surveillance monitor is automatically enabled to check for heartbeats from the firmware. If a heartbeat is not detected within a default period, the service processor cycles the system power and attempts to restart until the system either restarts successfully, or a predetermined retry threshold is reached. In the event the service processor is unsuccessful in bringing the system online (or in the event that the user asked to be alerted to any service
The call home function can be initialized to call either a service center telephone number, a customer administration center, or a digital pager telephone number. The service processor can be configured to stop at the first successful call to any of the numbers listed, or can be configured to call every number provided. If connected to the service center, the service processor transmits the relevant system information (the system's serial number and model type) and service request number (SRN). If connected to a digital pager service, the service processor inputs a customer voice telephone number defined by the customer. An established sequence of digits or the telephone number to a phone near the failed system could be used to signal a system administrator to a potential system failure.
During normal operations, the service processor can also be configured to monitor the AIX operating system. If AIX does not respond to the service processor heartbeat, the service processor assumes the operating system is hung. The service processor can automatically initiate a restart and, if enabled, initiate the call home function to alert the appropriate parties to the system hang. Enabling operating system surveillance also affords AIX the means to detect any service processor failures and report those failures to the service director application.
24pSeries 630 Model 6C4 and Model 6E4 User's Guide