Appendix G High Availability
HPSS Installation Guide September 2002 565
Release 4.5, Revision 2
G.5.4 Setup Error Notification
Eventhough an HA HPSS system is designed to recover from failures, the recovered system is often
unable to handle subsequent failures. For this reason, it is important that administrators know
immediately when a component fails so that it can be replaced or fixed quickly in order to get the
HA HPSS system back to a highly available state.
This is why the hpss_notify.ksh script is supplied. If you have customized yourhpss_notify.ksh
to email your system administrator, all that remains is to direct error messages to the
hpss_notify.ksh.
Thiscan be done in two ways, through AIX Error Notification and through HACMP Notify Events.
G.5.4.1 AIX Error Notification
When configuring AIX Error Notification, use the following SMIT path:
% smitty hacmp
RAS Support
-> Error Notification
-> Add a Notify Method
Thenotify method should call the hpss_aix_error.ksh script for all AIX errors. The script filters out
particular errors that are interesting. These include power supply, disk, and SCSI bus failures.
Unfortunately, there is no easy way to synchronize both nodes’ error notification settings.
Therefore, these steps will have to be performed on each node independently.
The notify method should call thehpss_aix_error.ksh script with the following syntax:
hpss_aix_error.ksh $1 $2 $3 $4 $5 $6 $7 $8
It is also important that thePersist across system restart field be set to Ye s.