Memory error correction extensions
The memory has
Memory protection features include scrubbing to detect errors, a means to call for the deallocation of memory pages for a pattern of correctable errors detected, and signaling deallocation of a logical memory block when an error occurs that cannot be corrected by the ECC code.
Redundancy for array self-healing
Although the most likely failure event in a processor is a soft
Caches and directories on the POWER7 chip are manufactured with spare bits in their arrays that can be accessed via programmable steering logic to replace faulty bits in the respective arrays. This is analogous to the redundant bit steering employed in main storage as a mechanism that is designed to help avoid physical repair, and is also implemented in POWER7 systems. The steering logic is activated during processor initialization and is initiated by the
When correctable error cache exceeds a set threshold, systems using the POWER7 processor invoke a dynamic cache line delete function, which enables them to stop using bad cache and eliminates exposure to greater problems.
Fault monitoring functions
•When a
•Disk drive fault tracking is designed to alert the system administrator of an impending disk drive failure before it impacts customer operation.
Mutual surveillance
The Service Processor monitors the operation of the firmware during the boot
process, and also monitors the HypervisorTM for termination. The Hypervisor monitors the Service Processor and will perform a reset/reload if it detects the loss of the Service Processor. If the reset/reload does not correct the problem with the Service Processor, the Hypervisor will notify the operating system and the operating system can take appropriate action, including calling for service.
Environmental monitoring functions
•Temperature monitoring warns the system administrator of potential
•Fan speed is controlled by monitoring actual temperatures on critical components and adjusting accordingly. If internal component temperatures reach critical levels, the system will shut down immediately, regardless of fan speed. When a
IBM United States Hardware Announcement
IBM is a registered trademark of International Business Machines Corporation | 6 |