IBM Z10 BC manual Reliability, Availability, and Serviceability RAS, RAS Design Focus

Models: Z10 BC

1 67
Download 67 pages 26.63 Kb
Page 45
Image 45

Reliability, Availability, and Serviceability (RAS)

In today’s on demand environment, downtime is not only unwelcome—it’s costly. If your applications aren’t consis- tently available, your business suffers. The damage can extend well beyond the fi nancial realm into key areas of customer loyalty, market competitiveness and regulatory compliance. High on the list of critical business require- ments today is the need to keep applications up and run- ning in the event of planned or unplanned disruptions to your systems.

While some servers are thought of offering weeks or even months of up time, System z thinks of this in terms of achieving years. The z10 BC continues our commitment to deliver improvements in hardware Reliability, Availability and Serviceability (RAS) with every new System z server. They include microcode driver enhancements, dynamic segment sparing for memory and fi xed HSA, as well as a new I/O drawer design. The z10 BC is a server that can help keep applications up and running in the event of planned or unplanned disruptions to the system.

The System z10 BC is designed to deliver industry lead- ing reliability, availability and security our customers have come to expect from System z servers. System z10 BC RAS is designed to reduce all sources of outages by reducing unscheduled, scheduled and planned outages. Planned outages are further designed to be reduced

with the introduction of concurrent I/O drawer add and eliminating pre-planning requirements. These features are designed to reduce the need for a Power-on-Reset (POR) and help eliminate the need to deactivate/activate/IPL a logical partition.

RAS Design Focus

High Availability (HA) – The attribute of a system designed to provide service during defi ned periods, at acceptable or agreed upon levels and masks UNPLANNED OUTAGES from end users. It employs fault tolerance, auto- mated failure detection, recovery, bypass reconfi guration, testing, problem and change management.

Continuous Operations (CO) – The attribute of a system designed to continuously operate and mask PLANNED OUTAGES from end users. It employs non-disruptive hard- ware and software changes, non-disruptive confi guration and software coexistence.

Continuous Availability (CA) – The attribute of a system designed to deliver non-disruptive service to the end user 7 days a week, 24 HOURS A DAY (there are no planned or unplanned outages). It includes the ability to recover from a site disaster by switching computing to a second site.

Page 45
Image 45
IBM Z10 BC manual Reliability, Availability, and Serviceability RAS, RAS Design Focus