Dell E08J Series owner manual Troubleshooting a Damaged Array, Controller Failure Conditions

Page 49

Troubleshooting A Damaged Array

CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple repairs as authorized in your product documentation, or as directed by the online or telephone service and support team. Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions that came with the product.

1.Ensure that the following components are properly installed:

Physical disks

RAID controller modules

Power supply modules

Cooling fan module

2.Ensure that all the cables are properly connected and that there are no damaged pins in the connectors.

3.Run the diagnostics available in Dell PowerVault Modular Disk (MD) Storage Manager.

4.In the AMW, select a component in the Hardware pane of the Hardware tab.

5.Select Hardware RAID Controller Module Advanced Run Diagnostics RAID Controller Module.

Controller Failure Conditions

Certain events can cause a RAID controller module to fail and/or shut down. Unrecoverable ECC memory or PCI errors, or critical physical conditions can cause lockdown. If your RAID storage array is configured for redundant access and cache mirroring, the surviving controller can normally recover without data loss or shutdown.

Critical Conditions

The storage array generates a critical event if the RAID controller module detects a critical condition that could cause immediate failure of the array and/or loss of data. The storage array is in a critical condition if one of the following occurs:

More than one fan has failed

Any midplane temperature sensors in the critical range

Midplane/power supply module failure

Two or more temperature sensors are unreadable

Failure to detect or unable to communicate with peer port

NOTE: If both RAID controller modules fail simultaneously, the enclosure cannot issue critical or noncritical event alarms for any enclosure component.

Noncritical Conditions

A noncritical condition is an event or status that does not cause immediate failure, but must be corrected to ensure continued reliability of the storage array. Examples of noncritical events include the following:

One power supply module has failed

One cooling fan module has failed

49

Image 49
Contents Dell PowerVault MD3860f Storage Arrays Page Contents Troubleshooting your system Technical Specifications Getting help Introduction Dell PowerVault Modular Disk Storage ManagerRelated Documentation Front-Panel Features Front-Panel FeaturesFront-Bezel Indicators Front-Panel IndicatorsBlue Back-Panel Features Back-Panel FeaturesCooling Fan Module Indicators Cooling Fan Module LED Indicator CodesPower Supply Module Features and Status Indicators Power Supply Module Features And IndicatorsPhysical-Disk LED Indicators Physical-Disk LED IndicatorsFrom the system RAID Controller Modules Controller ModulesHost Channel LED Link Rate Indications Fibre Optic Cable Connection SFP+ Transceivers Fibre Optic And SAS CablesExpansion Controller Modules MD3060e Expansion Module Features And IndicatorsBattery Backup Unit RAID Controller Module-Additional FeaturesStorage Array Thermal Shutdown System Password Reset Installing The Front Bezel Removing And Installing The Front BezelRecommended Tools Physical-Disk Drawers Service Action Allowed Indicator LEDRemoving The Front Bezel Inside the Physical-Disk Drawer Opening The Physical-Disk DrawerRemoving The Physical-Disk Drawer Closing The Physical-Disk DrawerPage Removing and Installing the Physical-Disk Drawer Installing The Physical-Disk DrawerPhysical Disks Physical Disk Installation GuidelinesRemoving a Physical Disk From a Physical-Disk Carrier Inch physical drive cage Guide pin Release handle Installing a Physical Disk In a Physical-Disk Carrier Removing a Physical Disk From a Physical-Disk Drawer SAS Chain Cables Installing a Physical Disk In a Physical-Disk DrawerRemoving The SAS Chain Cables Removing and Installing the SAS Chain Cables Installing The SAS Chain CablesRemoving a RAID Controller Module Or Expansion Module Installing a RAID Controller Module Or Expansion Module Closing The RAID Controller Module Opening The RAID Controller ModuleReplacing the SFP+ Transceiver Replacing The SFP+ TransceiverRemoving The RAID Controller Module Backup Battery Unit RAID Controller Module Backup Battery UnitInstalling The RAID Controller Module Backup Battery Unit Power SuppliesRemoving a Power Supply Module Removing and Installing the power supply module Installing a Power Supply ModuleRemoving a Cooling Fan Module Cooling Fan ModulesRemoving and Installing the Cooling Fan Module Installing a Cooling Fan ModuleTroubleshooting An SFP+ Transceiver Troubleshooting Loss Of CommunicationRemoving an SFP+ Module Troubleshooting Power Supply Modules Troubleshooting External ConnectionsTroubleshooting Expansion Enclosure Management Modules Troubleshooting Array Cooling ProblemsTroubleshooting Physical Disks Troubleshooting RAID Controller ModulesIf The Link Status LEDs Are Not Green If Both LEDs For Any Given FC in Port Are UnlitTroubleshooting a Wet Storage Array Troubleshooting Array And Expansion Enclosure ConnectionsController Failure Conditions Troubleshooting a Damaged ArrayCritical Conditions Noncritical ConditionsPCI Errors ECC ErrorsInvalid Storage Array Technical Specifications Page Tested Page Contacting Dell Locating your system service tagDocumentation feedback