HP c-Class Solaris manual Monitoring IO Accelerator health, Nand flash and component failure

Page 33

Monitoring IO Accelerator health

NAND flash and component failure

The IO Accelerator is a highly fault-tolerant storage subsystem that provides many levels of protection against component failure and the loss nature of solid state storage. However, as in all storage subsystems, component failures might occur.

By pro-actively monitoring device age and health, you can ensure reliable performance over the intended product life.

Health metrics

The IO Accelerator manages block retirement using pre-determined retirement thresholds. The HP IO Accelerator Management Tool and the fio-statusutilities show a health indicator that starts at 100 and counts down to 0. As certain thresholds are crossed, various actions are taken.

At the 10% healthy threshold, a one-time warning is issued. For more information, see "Health monitoring techniques."

At 0%, the device is considered unhealthy. It enters write-reduced mode, which somewhat prolongs its lifespan so data can be safely migrated off. In this state the IO Accelerator device behaves normally, except for the reduced write performance.

After the 0% threshold, the device will soon enter read-only mode, and any attempt to write to the IO Accelerator device causes an error. Some filesystems might require special mount options to mount a read-only block device in addition to specifying that the mount must be read-only.

For example, under Linux, ext3 requires that -o ro, noload is used. The noload option tells the filesystem to not try and replay the journal.

Consider the read-only mode as a final opportunity to migrate data off the device, as device failure is more likely with continued use.

The IO Accelerator device might enter failure mode. In this case, the device is offline and inaccessible. This can be caused by an internal catastrophic failure, improper firmware upgrade procedures, or device wearout.

The IO Accelerator driver manages LEB retirement via use of pre-determined retirement thresholds. The IO Accelerator Management Tool and the fio-statusutility show a health indicator that starts at 100 and counts down to 0. As certain thresholds are crossed, various actions are taken.

At the 10% healthy threshold, a one-time warning is issued. For more information, see "Health monitoring techniques."

At 0%, the device is considered unhealthy. It enters write-reduced mode, which somewhat prolongs its lifespan so data can be safely migrated. In this state, the IO Accelerator behaves normally except for the reduced write performance.

At some point after the 0% threshold, the device enters read-only mode. Any attempt to write to the IO Accelerator causes an error. Some file systems might require special mount options to mount a read-only

Monitoring IO Accelerator health 33

Image 33
Contents HP IO Accelerator 3.2.3 Solaris User Guide Page Contents Upgrading devices About this guide Contents summaryIntroduction OverviewProduct naming IO Accelerator capacity 320GB 640GB Models AJ878B BK836A Performance attributesSupported firmware revisions Required operating environmentIntroduction Introduction Software installation Installation overviewInstalling the software and utilities Upgrading the firmware Loading the IO Accelerator VSL facility driver$ pfexec remdrv iomemory-vsl Dev/rdsk/c*d0p0Enabling PCIe power Using the device as a swapConfiguring a ZFS pool Maintenance tools MaintenanceCommand-line utilities Enabling PCIe power overrideEnabling the override parameter Unmanaged shutdown issues Uninstalling IO Accelerator VSL and utilitiesOptions iomemory-vsl externalpoweroverride=value Disabling auto attachDisabling the IO Accelerator VSL Enabling the IO Accelerator VSL$ pfexec remdrv iomemo Monitoring and managing devices Introduction to monitoring and managing devicesManagement tools Example conditions to monitor Device LED indicators Introduction to performance and tuning Performance and tuningDisabling Dvfs Limiting Apci C-statesUtilities reference UtilitiesFio-attach Fio-attach device optionsFio-bugreport Fio-beaconFio-beacon device options Tmp/fio-bugreport-20100121.173256-sdv9ko.tar.bz2Fio-detach Fio-format Fio-detach device optionsFio-format options device Fio-status device options Fio-statusFfield Fio-sure-erase Fio-sure-erase options device Fio-update-iodrive Fio-update-iodrive options iodriveversion.fff Domainbusslot.func Monitoring IO Accelerator health Nand flash and component failureHealth metrics Health monitoring techniques Using module parameters ZFS pools and health monitoringEsxcfg-module --server server-name iomemory-vsl -g Upgrade procedure Upgrading devicesUpgrading devices for IO Accelerator VSL 2.x.x to $ modprobe iomemory-vsl Upgrading devices Subscription service ResourcesFor more information Before you contact HP Support and other resourcesHP contact information Customer Self RepairRéparation par le client CSR Riparazione da parte del cliente Reparaciones del propio cliente Reparo feito pelo cliente Support and other resources Support and other resources Support and other resources Acronyms and abbreviations Documentation feedback Index Index