IBM 755 manual Error Handling and Reporting, Service Processor, Call Home

Page 11

components run the risk of becoming loose during shipping or installation, preventing a good electrical connection. Positive retention mechanisms like latches, levers, thumb-screws, pop Nylatches (U-clips), and cables are included to help prevent loose connections and aid in installing (seating) parts correctly. These positive retention items do not require tools.

Error Handling and Reporting

In the unlikely event of system hardware or environmentally induced failure, the system runtime error capture capability systematically analyzes the hardware error signature to determine the cause of failure. The analysis result will be stored in system NVRAM. When the system can be successfully restarted either manually or automatically, the error will be reported to the operating system. Error Log Analysis (ELA) can be used to display the failure cause and the physical location of the failing hardware.

With the integrated Service Processor, the system has the ability to automatically send out an alert via phone line to a pager or call for service in the event of a critical system failure. A hardware fault will also turn on the amber system fault LED located on the system unit to alert the user of an internal hardware problem. The indicator may also be set to blink by the operator as a tool to allow system identification.

For identification, the blue locate LED on the enclosure and at the system level will turn on solid. The amber system fault LED will be on solid when an error condition occurs.

On POWER7 processor-based servers, hardware and software failures are recorded in the system log. When an HMC is attached, an ELA routine analyzes the error, forwards the event to the Service Focal Point (SFP) application running on the HMC, and notifies the system administrator that it has isolated a likely cause of the system problem. The Service Processor event log also records unrecoverable checkstop conditions, forwards them to the SFP application, and notifies the system administrator. Once the information is logged in the SFP application, if the system is properly configured, a call home service request will be initiated and the pertinent failure data with service parts information and part locations will be sent to an IBM Service organization. Customer contact information and specific system-related data such as the machine type, model, and serial number, along with error log data related to the failure are sent to IBM Service.

Service Processor

The Service Processor provides the capability to diagnose, check the status of, and sense the operational conditions of a system. It runs on its own power boundary and does not require resources from a system processor to be operational to perform its tasks.

The Service Processor supports surveillance of the connection to the HMC and to the system firmware (Hypervisor). It also provides several remote power control options, environmental monitoring, reset, restart, remote maintenance, and diagnostic functions, including console mirroring. The Service Processors menus (ASMI) can be accessed concurrently with system operation allowing nondisruptive abilities to change system default parameters.

Call Home

Call Home refers to an automatic or manual call from a customer location to IBM support structure with error log data, server status, or other service-related information. Call Home invokes the service organization in order for the appropriate service action to begin. Call Home can be done through HMC or non-HMC managed systems. While configuring Call Home is optional, clients are encouraged to implement this feature in order to obtain service enhancements such as reduced problem determination and faster and potentially more accurate transmittal of error information. In general, using the Call Home feature can result in increased

system availability. The Electronic Service AgentTM application can be configured for automated call home. Refer to the next section for specific details on this application.

IBM United States Hardware Announcement 110-008

IBM is a registered trademark of International Business Machines Corporation

11

Image 11
Contents Table of contents Key prerequisites For more information, visit Planned availability dateDescription EXP 12S SAS Drawer #5886 Minimum defined configuration, if no choice is made, isReliability, fault tolerance, and data correction Reliability, availability, and serviceability RAS featuresMutual surveillance Memory error correction extensionsFault monitoring functions Redundancy for array self-healingPartition availability priority POWER7 processor functionsPOWER7 single processor checkstopping Availability enhancement functions Serviceability First Failure Data Capture and Error Data Analysis Service InterfaceLocation diagrams Stand-alone diagnosticsService labels Call Home Error Handling and ReportingService Processor Benefits IBM Electronic ServicesStatement of general direction Product number E8C 5774 3m 14-Ft 3PH/24A Power Cord 8236 Inch racks Publications Safety InformationSpecified operating environment ServicesPhysical specifications Technical informationHardware requirements Noise level and sound powerSoftware requirements 110-008 LimitationsSAS AIX Linux DVD-RAM Sata Cable orders IBM Electronic ServicesPlanning information IBM Global Financing Warranty periodWarranty service Terms and conditionsWarranty service upgrades Maintenance Services Graduated program license charges apply Usage plan machine IBM hourly service rate classificationField-installable features Model conversions Machine installationEducational allowance PricesAdptr AA E8C 3687 Both PCI Full Width Keyboard -- USB, Danish, #159 IOP E8C US English Existing features New featuresMeter SAS Cable Virtual USB, Spanish USB, Russian Optional 8XDIMMS Kapalua Terms of use Order nowTrademarks