IBM Hub/Switch - page 80

Chapter 2 HPSS Planning

80 September 2002 HPSS Installation Guide

Release 4.5, Revision 2

requeststo the DMAP Gateway. Migration processes (hpss_hdm_mig) migrate data to HPSS, and

purge processes (hdm_hdm_pur) purge migrated data from DFS and XFS. A set of processes

(hpss_hdm_tcp)accept requests from the DMAP Gateway,and perform the requested operation in

DFS. A destroy process (hpss_hdm_dst) takes care of deleting ﬁles. Finally, XFS HDMs have a

processthat watches for stale events (hpss_hdm_stl) and keeps the HDM from getting bogged own

by them.

There are three types of event handlers based on the type of activity that generates the events:

administrative, name space, and data. Administrative activities include mounting and

dismountingaggregates. Name space activities include creating, deleting, or renaming objects, and

changingan object's attributes. Data activities include reading and writing ﬁle data. The number of

processesallocated to handle events generated by these activities should be large enough to allow

a reasonable mix of these activities to run in parallel.

When the HDM fetches an event from DFS or XFS, it is put on a queue and assigned to an

appropriate event handler when one becomes free. The total number of entries allowed in the

queue is determined by a conﬁguration parameter. If this value is not large enough to handle a

reasonable number of requests, some of the event handlers may be starved. For example, if the

queueﬁlls up with data events, the name space handlers will be starved. Section 7.6.3.3.1: conﬁg.dat

File on page 449 discusses the criteria for selecting the size of an event queue.

HDMlogs outstanding name space events. If the HDM is interrupted, the log is replayed when the

HDM restarts to ensure that the events have been processed to completion and the DFS/XFS and

HPSS name spaces are synchronized. The size of the log is determined by a conﬁguration

parameter, as discussed in Section 7.6.3.3.1:conﬁg.dat File on page 449.

HDMhas two other logs, each containing a list of ﬁles that are candidates for being destroyed. One

ofthe logs, called the zap log, keeps track of ﬁles on archived aggregates and ﬁle systems, while the

other, called thedestroy log, keeps track of ﬁles on mirrored aggregates. Because of restrictions

imposed by the DFS SMR, the HDM cannot take the time to destroy ﬁles immediately, so the logs

serveas a record of ﬁles that need to be destroyed by the destroy process. The size of the zap log is

boundedonly by the ﬁle system where the log is kept, but the size of the destroy log is determined

by a conﬁguration parameter. If the destroy log is too small, the HDM will be forced to wait until

space becomes available.

Since the HDM may be running on a machine where it cannot write error messages to the HPSS

message log, it uses its own log. This HDM log consists of a conﬁgurable number of ﬁles (usually

2)that are written in round-robin fashion. The sizes of these ﬁles are determined by a conﬁguration

parameter.

HDMlogging policy allows the system administrator to determine the type of messages written to

thelog ﬁle: alarm, event, debug, and/or trace messages. Typically, only alarms should be enabled,

although event messages can be useful, and do not add signiﬁcant overhead. If a problem occurs,

activating debug and trace messages may provide additional information to help identify the

problem.However, these messages add overhead, and the system will perform best if messages are

keptto a minimum. The type of messages logged is controlled by a parameter in the conﬁguration

ﬁle and can be dynamically changed using thehdm_admin utility.

HDMmigrates and purges ﬁles based on policies deﬁned in the HDM policy conﬁguration ﬁle. The

administrator can establish different policies for each aggregate in the system. Migration policy

parametersinclude the length of time to wait between migration cycles and the amount of time that

must elapse since a ﬁle was last accessed before it becomes eligible for migration. Purge policy

parametersinclude the length of time to wait between purge cycles, the amount of time that must

elapse since a ﬁle was last accessed, an upper bound specifying the percentage of DFS space that