IBM Z10 EC manual Reliability, Availability, and Serviceability RAS, RAS Design Focus

Page 43

Reliability, Availability, and Serviceability (RAS)

In today’s on demand environment, downtime is not only unwelcome—it’s costly. If your applications aren’t consis- tently available, your business suffers. The damage can extend well beyond the fi nancial realm into key areas of customer loyalty, market competitiveness and regulatory compliance. High on the list of critical business require- ments today is the need to keep applications up and run- ning in the event of planned or unplanned disruptions to your systems.

While some servers are thought of offering weeks or even months of up time, System z thinks of this in terms of achieving years. The z10 EC continues our commitment to deliver improvements in hardware Reliability, Availability and Serviceability (RAS) with every new System z server. They include microcode driver enhancements, dynamic segment sparing for memory and fi xed HSA. The z10 EC is a server that can help keep applications up and running in the event of planned or unplanned disruptions to the system.

The System z10 EC is designed to deliver industry lead- ing reliability, availability and security our customers have come to expect from System z servers. System z10 EC RAS is designed to reduce all sources of outages by reducing unscheduled, scheduled and planned outages. Planned outages are further designed to be reduced with the introduction of concurrent I/O drawer add and eliminating pre-planning requirements. These features are designed to reduce the need for a Power-on-Reset (POR) and help eliminate the need to deactivate/activate/IPL a logical partition.

RAS Design Focus

High Availability (HA) – The attribute of a system designed to provide service during defi ned peri- ods, at acceptable or agreed upon levels and masks UNPLANNED OUTAGES from end users. It employs fault tolerance, automated failure detection, recovery, bypass reconfi guration, testing, problem and change manage- ment.

Continuous Operations (CO) – The attribute of a system designed to continuously operate and mask PLANNED OUTAGES from end users. It employs non-disruptive hard- ware and software changes, non-disruptive confi guration and software coexistence.

Continuous Availability (CA) – The attribute of a system designed to deliver non-disruptive service to the end user 7 days a week, 24 HOURS A DAY (there are no planned or unplanned outages). It includes the ability to recover from a site disaster by switching computing to a second site.

43

Image 43
Contents IBM System z10 Enterprise Class z10 EC Reference Guide Table of Contents IBM System z10 Enterprise Class z10 EC Overview Specialty engines offer an attractive alternative Just-in-time deployment of IT resourcesNumerical computing on the chip Order of introductionZ10 EC Architecture ArchitectureLiberating your assets with System z Evolving for your businessPage Commitment to system integrity Page TPF VSELinux on System z Operating System ESA/390 Z10 ECPage Page Z10 EC Design and Technology Z10 EC Model Z10 EC Base and Sub-capacity Offerings Z10 EC model upgradesLarge System Performance Reference Z10 EC PerformanceCPU Measurement Facility System I/O Configuration Analyzer Z10 EC I/O SubsystemZ10 EC Channels and I/O Connectivity Modes of Operation Concurrent UpdateFicon Express4 and Ficon Express2 Performance Support of Spanned Channels and Logical PartitionsFCP Channels Ficon Support for Cascaded DirectorsFCP increased performance for small block sizes Platform and name server registration in Ficon channel Scsi IPL now a base functionFCP Full fabric connectivity Ficon and FCP for connectivity to disk, tape, and printersIt will register Program Directed re-IPL NPort ID VirtualizationOSA-Express3 Ethernet features Summary of benefits Feature Infrastructure Ports perPort density or granularity Type FeaturesOSA-Express2 availability Purpose/TrafficFour-port exploitation on OSA-Express3 GbE SX and LX OSA-Express3 10 Gigabit Ethernet SROSA-Express3 Gigabit Ethernet LX OSA-Express3 Gigabit Ethernet SXDynamic LAN idle for z/OS Network Traffic AnalyzerLayer 2 transport mode When would it be used? Link aggregation for z/VM in Layer 2 modeIBM Communication Controller for Linux CCL Direct Memory Access DMAOSA Layer 3 Virtual MAC for z/OS Hardware data routerRemove L2/L3 LPAR-to-LPAR Restriction OSA Integrated Console ControllerOSA/SF Virtual MAC and Vlan id Display Capability HiperSockets HiperSockets Enhancement for zIIP Exploitation Can Do IT securely Security CryptographyCP Assist for Cryptographic Function Cpacf Configurable Crypto Express2 Secure Key AES Dynamically add crypto to a logical partitionSystem z10 EC cryptographic migration TKE 5.3 workstation and support for Smart Card ReaderEnhancement with TKE 5.3 LIC TKE additional smart cardsRemote Key Loading Benefits Remote Loading of Initial ATM KeysImproved Key Exchange With Non-CCA Cryptographic Systems Capacity on Demand Temporary Capacity On Demand CapabilitiesAmendment for CBU Tests Capacity Provisioning System z9 System z10 OS Capacity provisioning allows you to set up rulesRAS Design Focus Reliability, Availability, and Serviceability RASHardware System Area HSA Availability FunctionsEnhanced Book Availability Enhanced Driver Maintenance Concurrent Physical Memory UpgradeConcurrent Physical Memory Replacement Concurrent Defective Book ReplacementPlan Ahead Memory Transparent SparingEnvironmental Enhancements Service EnhancementsPower Monitoring Power Estimation ToolIBM Systems Director Active Energy Manager Parallel Sysplex Cluster TechnologyCoupling Facility Control Code Cfcc Level Improved service time with Coupling Facility DuplexSystem-Managed CF Structure Duplexing Coupling Facility Configuration AlternativesParallel Sysplex Coupling Connectivity Coupling Connectivity for Parallel Sysplex Introducing long reach InfiniBand coupling linksZ10 EC Max Z10 Coupling Link OptionsServer Time Protocol STP Time synchronization and time accuracy on z10 ECPreview Improved STP System Management with Continuous availability of NTP servers used as Exter Enhanced Network Time Protocol NTP client supportNTP server on Hardware Management Console HMC Enhanced STP recovery when Internal Battery FeatureApplication Programming Interface API to automate Internal Battery Feature Recommendation HMC/SE Console Messenger HMC System SupportFamily Machine Type Internet Protocol, Version 6 IPv6HMC z/VM Tower systems management enhancements Gdps Implementation Services for Parallel SysplexFiber Quick Connect for Ficon LX Environments Model O Cage Z10 EC Physical Characteristics Z10 EC Configuration DetailZ10 EC Dimensions Z9 EC Number of Frames 2 Frame Z10 EC Environmentals Model O CageICFs Processor Unit Features ModelOSA-Express3 and OSA-Express2 Features Min Max CPs IFLsGeneral Information Z9 BC Coupling Facility CF Level of SupportStatement of Direction Following Redbook publications are available now PublicationsResource Link ZSO03018-USEN-02