HP Smart Array (RAID) Host Bus Adapters Replacing hard drives, Automatic data recovery rebuild

Page 29

Replacing hard drives

The most common reason for replacing a hard drive is that it has failed. However, another reason is to gradually increase the storage capacity of the entire system.

If you insert a hot-pluggable drive into a drive bay while the system power is on, all disk activity in the array pauses for a second or two while the new drive is spinning up. When the drive has achieved its normal spin rate, data recovery to the replacement drive begins automatically (as indicated by the blinking Online/Activity LED on the replacement drive) if the array is in a fault-tolerant configuration.

If you replace a drive belonging to a fault-tolerant configuration while the system power is off, a POST message appears when the system is next powered up. This message prompts you to press the F1 key to start automatic data recovery. If you do not enable automatic data recovery, the logical volume remains in a ready-to-recover condition and the same POST message appears whenever the system is restarted.

Factors to consider before replacing hard drives

Before replacing a degraded drive:

Open Systems Insight Manager, and inspect the Error Counter window for each physical drive in the same array to confirm that no other drives have any errors. (For details, refer to the Systems Insight Manager documentation on the Management CD.)

Be sure that the array has a current, valid backup.

Confirm that the replacement drive is of the same type (SAS or SATA) as the degraded drive.

Use replacement drives that have a capacity at least as great as that of the smallest drive in the array. The controller immediately fails drives that have insufficient capacity.

In systems that use external data storage, be sure that the server is the first unit to be powered down and the last to be powered back up. Taking this precaution ensures that the system does not erroneously mark the drives as failed when the server is powered up.

To minimize the likelihood of fatal system errors, take these precautions when removing failed drives:

Do not remove a degraded drive if any other drive in the array is offline (the Online/Activity LED is off). In this situation, no other drive in the array can be removed without data loss.

The following cases are exceptions:

When RAID 1+0 is used, drives are mirrored in pairs. Several drives can be in a failed condition simultaneously (and they can all be replaced simultaneously) without data loss, as long as no two failed drives belong to the same mirrored pair.

When RAID 6 (ADG) is used, two drives can fail simultaneously (and be replaced simultaneously) without data loss.

If the offline drive is a spare, the degraded drive can be replaced.

Do not remove a second drive from an array until the first failed or missing drive has been replaced and the rebuild process is complete. (The rebuild is complete when the Online/Activity LED on the front of the drive stops blinking.)

The following cases are exceptions:

In RAID 6 (ADG) configurations, any two drives in the array can be replaced simultaneously.

In RAID 1+0 configurations, any drives that are not mirrored to other removed or failed drives can be simultaneously replaced offline without data loss.

Automatic data recovery (rebuild)

When you replace a hard drive in an array, the controller uses the fault-tolerance information on the remaining drives in the array to reconstruct the missing data (the data that was originally on the replaced

Replacing, moving, or adding hard drives 29

Image 29
Contents October 2006 Second Edition Part Number Audience assumptions Contents Electrostatic discharge Controller specifications Hardware featuresMain components on the board This sectionPage Quick installation procedure Windows or Linux Overview of the installation procedureOverview of the installation procedure Installing the controller board Installing the controller hardwareBefore beginning the installation Preparing the server Connecting external storage Connecting storage devicesConnecting internal storage To order additional cables, use the option kit part number SAS cable part numbersMethods for updating the firmware Windows or Linux Updating the firmwareComparing the utilities Configuring an arrayUtilities available for configuring an array Select Create Logical Drive Using OrcaUsing ACU Systems using Linux Installing device drivers and Management AgentsSystems using Microsoft Windows A system that already has Linux installed Upgrading or replacing controller options Replacing or adding a batteryUpgrading or replacing controller options Upgrading or replacing controller options Upgrading or replacing controller options Replacing the cache module or controller Upgrading or replacing controller options Upgrading or replacing controller options Upgrading or replacing controller options Identifying the status of a hard drive Replacing, moving, or adding hard drivesTerminate the current operation and cause data loss Recognizing hard drive failureEffects of a hard drive failure Compromised fault toleranceRecovering from compromised fault tolerance Factors to consider before replacing hard drives Automatic data recovery rebuildReplacing hard drives Time required for a rebuild Case 1 An uncorrectable read error has occurredCase 3 Another drive in the array has failed Case 2 The replacement drive has failedUpgrading hard drive capacity Adding drives Moving drives and arraysReplacing, moving, or adding hard drives Controller board runtime LEDs Diagnosing array problemsBattery pack LEDs Post messages Diagnostic toolsGrounding methods to prevent electrostatic discharge Electrostatic dischargePreventing electrostatic discharge Federal Communications Commission notice Regulatory compliance noticesModifications CablesBsmi notice Japanese class a notice European Union regulatory noticeTaiwan battery recycling notice Korean class a notice Battery replacement noticeAcronyms and abbreviations Smart Array Index Post error messages
Related manuals
Manual 36 pages 19.99 Kb