Abstract
Product Version Supported Release Version Updates RVUs
Part Number Published
Document History Part Number Product Version Published
Introduction to Integrity NonStop NS-Series Operations
HP Integrity NonStop NS-Series Operations Guide
Determining Your System Configuration
Monitoring EMS Event Messages
ServerNet Resources Monitoring and Recovery
10-5
10-2
10-3
10-4
11-8
11-1
11-2
11-5
Power Failures Preparation and Recovery
15-22
15-19
15-20
15-21
Preventive Maintenance
Related Reading Converting Numbers
Figures
Examples
Tables
Document History
What’s New in This Manual
Manual Information
New and Changed Information
Xiv
Who Should Use This Guide
About This Guide
Converting Numbers
What Is in This Guide
ServerNet Cluster Manual
Where to Get More Information
General Syntax Notation
Support and Service Library
Notation Conventions
Hypertext Links
Inspect OFF on Saveabend
Maxattach
Interrupts
Allowsu on OFF
Enter RUN Code
Notation for Messages
Code Received
Operator
Backup Up
Event number = number Subject = first-subject-value
Proc-nametrapped in SQL in SQL file system
Change Bar Notation
Introduction to Integrity NonStop NS-Series Operations
What Are the Operator Tasks?
When to Use This Section
Monitoring the System and Performing Recovery Operations
Understanding the Operational Environment
Performing Preventive Maintenance
Preparing for and Recovering from Power Failures
Stopping and Powering Off the System
Powering On and Starting the System
Updating Firmware
Determining the Cause of a Problem a Systematic Approach
Responding to Spooler Problems
Problem-Solving Worksheet
Problem Facts Possible Causes
Problem-Solving Worksheet
Situation Facts Escalation Decision
Category Questions to Ask
Task 1a Determine the Facts About the Problem
Task 1 Get the Facts
Task 1b Determine the Facts About the Situation
Task 2a Identify the Most Likely Cause
Task 2 Find and Eliminate the Cause of the Problem
Task 3b Provide Documentation
Task 2b Fix the Most Probable Cause of the Problem
Task 3 Escalate the Problem If Necessary
Task 3a Determine Whether You Need to Escalate the Problem
Logging On to an Integrity NonStop Server
Task 4 Prevent Future Problems
System Consoles
Select StartProgramsOutsideView32
Opening a Tacl Window
Opening a Tacl Window Directly From OutsideView
Opening a Tacl Window From the Low-Level Link
Launching OSM Applications
Overview of OSM Applications
Service Procedures
Determining Your System Configuration
Integrity NonStop NS16000 Systems
Modular Hardware Components
Integrity NonStop NS1000 Systems
Integrity NonStop NS14000 Systems
System Resource or Object
Recording Your System Configuration
Terms Used to Describe System Hardware Components
Device
SCF Configuration Files
Using SCF to Determine Your System Configuration
SCF System Naming Conventions
Save Configuration
Using SCF to Display Subsystem Configuration Information
SCF Assume WS $L1.#TERM1
SCF Listdev Listing the Devices on Your System
Example 2-1. SCF Listdev Command Output
SCF Listdev
Specified device
Backup processor number and PIN of the specified device
Subsystem Name Logical Name Device Type Description
Displaying Information for the TCP/IP Subsystem $ZTCO
Displaying SCF Configuration Information for Subsystems
TCP/IP Subsystem
SCF Assume Process $ZTCO
Displaying Information for the Kernel Subsystem $ZZKRN
Kernel Subsystem
Storage Subsystem
SCF Assume Process $ZZKRN
Cbpoollen
Example 2-2. SCF ADD Disk Command Output
Info Disk $SYSTEM,OBEYFORM
Mirrorlocation 11,1,12
SCF Assume Process $ZZLAN
ServerNet LAN Systems Access Slsa Subsystem
ADD Adapter $ZZLAN.E0154 Location Type G4SA Accesslist
Info Tape $*,OBEYFORM
Subsystem Objects Controlled by SCF page 1
WAN Subsystem
Additional Subsystems Controlled by SCF
Displaying Information for the WAN Subsystem $ZZWAN
X25AM
Subsystem Objects Controlled by SCF page 2
Info Proc $ZZKRN.#
Displaying Configuration Information-SCF Examples
Example 2-3. SCF Info Process Command Output
Example 2-4. SCF Info SAC Command Output
Info Line $line-name, Detail
Example 2-5. SCF Info Process $ZZWAN Command Output
Example 2-6. SCF Info Line Command Output
Info Process $ZZWAN
Overview of Monitoring and Recovery
Functions of Monitoring
Working With a Daily Checklist
Monitoring Tasks
Task Operator’s name Date & time
Tools for Checking the Status of System Hardware
Monitoring System Components
Monitored Using These Resource Tools See
ServerNet Cluster 6770 Hardware
Additional Monitoring Tasks
Daily Tasks Checklist
General Tasks Specific Tasks For More Information, See
Top-Down Approach
Monitoring and Resolving Problems-An Approach
Using OSM to Monitor the System
Using the OSM Service Connection
OSM Management System Icons Indicate Problems Within
Expanding the Tree Pane to Locate the Source of Problems
Attributes Tab
Using System Status Icons to Monitor Multiple Systems
Alarm Summary Dialog Box
Using Alarm and Problem Summaries
Monitoring Problem Incident Reports
Using SCF to Monitor the System
Suppressing Problems and Alarms
Recovery Operations for Problems Detected by OSM
Determining Device States
Example 3-1. SCF Status Tape Command
SCF Object States page 1
SCF Object States
State Substate Explanation
Servicing Special
SCF Object States page 2
Automating Routine System Monitoring
Example 3-2. System Monitoring Command File
Example 3-3. System Monitoring Output File page 1
Example 3-3. System Monitoring Output File page 2
Example 3-3. System Monitoring Output File page 3
Status LEDs and Their Functions page 1
Using the Status LEDs to Monitor the System
Location LED Name Color Function
FC-AL I/O
Status LEDs and Their Functions page 2
Task Tool For information, see
Related Reading
Status LEDs and Their Functions page 3
Related Reading for Monitoring
ViewPoint on
What Is the Event Management Service EMS?
Monitoring EMS Event Messages
Tools for Monitoring EMS Event Messages
Related Reading for Monitoring EMS Event Messages
OSM Event Viewer
ViewPoint
Web ViewPoint
Types of Processes
Processes Monitoring Recovery
System Processes
Generic Processes
Processes IOPs
Monitoring System Processes
Monitoring Processes
Monitoring the Status of All Generic Processes
Monitoring IOPs
Monitoring Generic Processes
Monitoring the Status of $ZZKRN
CLCI-TACL $CLCI Stopped PID
Supported for the subsystem manager processes
Recovery Operations for Processes
Initiates the operation of a generic process
Communications Subsystems
Communications Subsystems Monitoring and Recovery
Local Area Networks LANs and Wide Area Networks WANs
Object Connectivity By
LAN Service Provider Subsystems Supported
To monitor the status of an adapter
Monitoring Communications Subsystems and Their Objects
Monitoring the Slsa Subsystem
Monitoring the Status of an Adapter and Its Components
SCF Status PIF pif-name
SCF Status Adapter $ZZLAN
SCF Status SAC sac-name
SCF Status SAC $ZZLAN.G11123
SCF Status LIF $ZZLAN.L11021A , Detail
Monitoring the WAN Subsystem
Monitoring Status for a Swan Concentrator
SCF Status PIF $ZZLAN.G11123
SCF Status Device $ZZWAN.#device-name
Monitoring Status for a Data Communications Device
System displays a listing similar to
SCF Status Adapter $ZZWAN
SCF Status Process $ZZWAN
To monitor a single WANBoot process, type
SCF Status Process $ZZWAN.#boot-process
Monitoring WAN Processes
SCF Listdev Tcpip
Monitoring CLIPs
Monitoring the NonStop TCP/IP Subsystem
Monitoring the NonStop TCP/IP Process
SCF Status Route $ZTCO
Monitoring NonStop TCP/IP Routes
Monitoring NonStop TCP/IP Subnets
Monitoring Line-Handler Process Status
Info Process $NCP, Lineset
SCF Status Line $LHCS6S, Detail
Tracing a Communications Line
Related Reading for Communications Lines and Devices page 1
Recovery Operations for Communications Subsystems
For Information About Refer to
Related Reading for Communications Lines and Devices page 2
WAN Subsystem Configuration and Management Manual
ServerNet Communications Network
ServerNet Resources Monitoring and Recovery
Integrity NonStop NS16000 System
Integrity NonStop NS16000 ServerNet Connectivity
Integrity NonStop NS14000 System with Ioam Enclosure
Integrity NonStop NS14000 ServerNet Connectivity
Monitoring the Status of the ServerNet Fabrics
System I/O ServerNet Connections
Integrity NonStop NS1000 ServerNet Connectivity
Monitoring the ServerNet Fabrics Using OSM
SCF Status Servernet $ZSNET
Monitoring the ServerNet Fabrics Using SCF
Normal ServerNet Fabric States
Identifying ServerNet Fabric Problems
Recovery Operations for a Down Processor
Recovery Operations for the ServerNet Fabrics
Recovery Operations for a Down Disk Due to a Fabric Failure
Recovery Operations for a Down Path Between Processors
Adapters and Modules Monitoring and Recovery
Fibre Channel ServerNet Adapter Fcsa
Adapters and Modules
Gigabit Ethernet 4-Port Adapter G4SA
Port ServerNet Extender 4PSE
Monitoring I/O Adapters and Modules
SCF Status Adapter $ZZSTO.#FCSA*, Detail
Monitoring the FCSAs
State Description
SCF Status Adapter $ZZLAN.G1123
Monitoring the G4SAs
Service, Device, and Enabled States for the G4SA page 1
Monitoring the 4PSEs
Recovery Operations for I/O Adapters and Modules
ServerNet/DA Manual
Related Reading for I/O Adapters and Modules
Processors and Components Monitoring and Recovery
Overview of the NonStop Blade Complex
LSU CPU
Term Description
Monitoring Processors Automatically Using Tfds
Monitoring and Maintaining Processors
Summary, these terms describe the Nsaa processor
Processor Status Display
Monitoring Processor Status Using the OSM Low-Level Link
OSM Representation of Processor Complex
Viewsys
Identifying Processor Problems
Monitoring Processor Performance Using ViewSys
Processor or System Hangs
Freeze code = %nnnnnn
OSM Alarms and Attribute Values
Processor Halts
Halt code = %nnnnnn
Recovery Operations for a Processor Halt
Recovery Operations for Processors
Reloading a Single Processor on a Running Server
Halting One or More Processors
Select Processor ActionsHalt Click Perform action
Select FileStart Terminal Emulator
Using Tacl Reload to Perform Reload
Reload / run-option , run-option
Fabric
Noswitch Primenoprime fabric Omitblade ABC
Noswitch
Primenoprime
Select Reload, click Perform action
Using the OSM Service Connection to Perform Reload
Recovery Operations for a System Hang
Enabling/Disabling Processor and System Freeze
Freezing the System and Freeze-Enabled Processors
Dumping a Processor to Disk
See Using Rcvdump to Dump a Processor to Disk on
Before You Begin
Using Rcvdump to Dump a Processor to Disk
FUP Purgedata dumpfile
CPU n has been dumped to dumpfile
Blade Element Reintegration
FUP Info dumpfile
Replacing Processor Memory
Troubleshooting and Recovery Operations for Disk Dumps
Submitting Information to Your Service Provider
Backing Up a Processor Dump to Tape
Submitting Tapes of Processor Dumps
Submitting Tapes of Configuration and Operations Files
Other Files to Submit to Your Service Provider
Backup $tape, CPU0,$SYSTEM.SYS00.CONFTEXT
Additional Information Required by Your Service Provider
For Information About Tool See
Disk Drives Monitoring and Recovery
Internal Scsi Disk Drives
Overview of Disk Drives
For information about
Enterprise Storage System ESS Disks
M8xxx Fibre Channel Disk Drives
For information about See
Monitoring Disk Drives With OSM
Monitoring Disk Drives
Task See
Status Disk $*, SUB Magnetic
Monitoring Disk Drives With SCF
Status $DATA02-M
Status Disk $DATA09, Detail
To display the status of the disk $DATA01
Status $DATA01
Status Disk $
To display the status of all disks
Status Disk $DATA00
To display the detailed status of the disk $DATA01
Status $DATA01, Detail
To display status of all paths for $DATA00
Monitoring the Size of Database Files
Monitoring the Use of Space on a Disk Volume
Primary and Backup Path States for Disk Drives
Monitoring the State of Disk Drives
FUP Info DATA1.MEMOS, Detail
Monitoring Disk Configuration and Performance
Example
To check the size of the file DATA1.MEMOS
Possible Causes of Common Disk Drive Problems
Identifying Disk Drive Problems
Problems Possible Symptoms
Command Description
Recovery Operations for Disk Drives
These SCF commands control Disk objects
Common Recovery Operations for Disk Drives page 1
Customer Support Center or your service provider
Common Recovery Operations for Disk Drives page 2
Start Disk $volume
Recovery Operations for a Down Disk or Down Disk Path
Reset Disk $volume
Reset Disk $WD8
FUP Alter MEMOS, Maxextents Info MEMOS, Detail
Recovery Operations for a Nearly Full Database File
Report such as this one is sent to your home terminal
10-16
Overview of Tape Drives
Tape Drives Monitoring and Recovery
Monitoring Tape Drive Status With OSM
Monitoring Tape Drives
OSM Monitoring Tape Drives Connected to an Fcsa
OSM Monitoring Tape Drives Connected to an IOMF2
Listing such as this one is sent to your home terminal
Monitoring Tape Drive Status With SCF
Listing similar to this one is sent to your home terminal
SCF Status Tape $TAPE0
Mediacom Status Tapedrive
Monitoring Tape Drive Status With Mediacom
Mediacom Status Tapedrive $TAPE0
Monitoring the Status of Labeled-Tape Operations
Identifying Tape Drive Problems
Common Tape Drive Problems
Symptom Problem Possible Causes
Performing an OSM Action on a Multiple Tape Drives
Recovery Operations for Tape Drives
Recovery Operations Using the OSM Service Connection
Performing an OSM Action on a Tape Drive
SCF Command Description
Recovery Operations Using SCF
Related Reading for Tapes and Tape Drives page 1
Related Reading for Tapes and Tape Drives page 2
Overview of Printers and Terminals
Printers and Terminals Monitoring and Recovery
Spoolcom DEV $LASER
Monitoring Printer and Collector Process Status
Monitoring Printer Status
Monitoring Collector Process Status
Recovery Operations for a Full Collector Process
Recovery Operations for Printers and Terminals
12-4
Monitoring TMF
Applications Monitoring and Recovery
TMF States on
~ Status TMF
Monitoring Data Volumes
Monitoring the Status of TMF
Tmfcom
TMF States page 1
TMF States
Tmfcom responds with output similar to
TMF subsystem can be in any of the states listed in Table
= Status Pathway
Monitoring the Status of Pathway
TMF States page 2
Status *, Prog $*.*.PATHMON
Pathcom responds with output such as
Pathmon States
= Status Pathmon
Another requester
Request is waiting for an object that has been locked by
Request is waiting for a RUN Program to finish
ESS Cabinets on
Power Failures Preparation and Recovery
External Devices
System Response to Power Failures
NonStop NS-Series Cabinets Modular Cabinets
NonStop S-Series I/O Enclosures
Air Conditioning
Preparing for Power Failure
Configure OSM Power Fail Support
ESS Cabinets
Maintain Batteries
Power Failure Recovery
Monitor Power Supplies
Monitor Batteries
Setting System Time
Procedure to Recover From a Power Failure
14-6
Alerts on
Starting and Stopping the System
Powering On a System
Select Hard Reset Click Perform Action
Powering On the System From a Low Power State
Powering On the System From a No Power State
Select Power On System
15-4
Normal System Load
Starting a System
Loading the System
Alerts
System Load Disks
System Load to a Specific Processor
Path Group Module Slot
System Load Paths for a Normal System Load
System Load Paths in Order of Use
Disk Drive Enclosure
Data Travels
Configuration File
Performing a System Load
Starting Other System Components
Click Start system
System Load Dialog Box
Performing a System Load From a Specific Processor
Reload 01 15, Prime
Reloading Processors Using the Reload Command
Reloading Processors Using OSM
Reloading Processors
Logical Processor Reload Parameters
Stopping Application, Devices, and Processes
Minimizing the Frequency of Planned Outages
Anticipating and Planning for Change
RUN Stopscm
= SHUTDOWN2, Mode Orderly
Volume $DSMSCM.ZDSMSCM
Stop DSM/SCM
Tmfcom Stop TMF
Halting All Processors Using OSM
Stopping the System
Spoolcom supervisor-name, SPOOLER, Drain
From the Processors Actions menu, select Halt
Powering Off a System
System Power-Off Using OSM
System Power-Off Using SCF
Emergency Power-Off Procedure
Troubleshooting and Recovery Operations
Fans Are Not Turning
Components Fail When Testing the Power
System Does Not Appear to Be Powered On
Green LED Is Not Lit After POSTs Finish
Info Subsys $ZZKRN
Recovering From a System Load Failure
Recovering From a Reload Failure
Getting a Corrupt System Configuration File Analyzed
Backup $TAPE, $SYSTEM.ZSYSCONF.CONFSAVE, Listall
Opening Startup Event Stream and Startup Tacl Windows
Exiting the OSM Low-Level Link
15-23
Related Reading for Starting and Stopping a System
NonStop NS-Series Hardware Installation Guide
Startup on Shutdown on
Creating Startup and Shutdown Files
Automating System Startup and Shutdown
Managed Configuration Services MCS
Startup
Shutdown
Processes That Represent the System Console
For More Information
$ZHOME Alternative
Example Command Files
Ciin File
Modifying a Ciin File
Establishing a Ciin File
Conftext Ciin Entry
If a Ciin File Is Not Specified or Enabled in OSM
Ciin File Option Results
Reload /TERM $ZHOME, OUT $ZHOME
Example Ciin Files
= Start Term TERM1, TERM2, TERM3, TERM4, TERM5, TERM6
Writing Efficient Startup and Shutdown Command Files
Command File Syntax
= Start Term
Use Parallel Processing
Avoid Manual Intervention
Tips for Startup Files
How Process Persistence Affects Configuration and Startup
Investigate Product-Specific Techniques
System Startup File
Startup File Examples
Obey $SYSTEM.STARTUP.STRTSYS
16-13
Obey $SYSTEM.STARTUP.SPLWARM
TCP/IP Stack Configuration and Startup File
Spooler Warm-Start File
TMF Warm-Start File
16-15
16-16
ATP6100 Lines Startup File
CP6100 Lines Startup File
Lines Startup File
Expand-Over-IP Line Startup File
Printer Line Startup File
Expand Direct-Connect Line Startup File
Shutdown File Examples
Tips for Shutdown Files
Obey $SYSTEM.SHUTDOWN.STOPSYS
System Shutdown File
ATP6100 Lines Shutdown File
CP6100 Lines Shutdown File
Lines Shutdown File
Expand-Over-IP Line Shutdown File
Printer Line Shutdown File
Direct-Connect Line Shutdown File
Tmfcom / in $SYSTEM.SHUTDOWN.TMFSTOP, OUT $ZHOME
Spooler Shutdown File
TMF Shutdown File
Obey $SYSTEM.SHUTDOWN.SPLDRAIN
16-24
Checking Air Temperature and Humidity
Preventive Maintenance
Monitoring Physical Facilities
Cleaning System Components
Handling and Storing Cartridge Tapes
Cleaning Tape Drives
17-4
HP Integrity NonStop NS-Series Operations Guide-529869-005
Page
Tools and Utilities for Operations
When to Use This Appendix
Disk Compression Program Dcom
Event Management Service Analyzer Emsa
Disk Space Analysis Program Dsap
Nskcom and the Kernel-Managed Swap Facility Kmsf
File Utility Program FUP
Measure
NonStop NET/MASTER
Subsystem Control Facility SCF
Pathcom
Web ViewPoint
HP Tandem Advanced Command Language Tacl
ViewPoint
ViewSys
Table C-1. Related Reading for Tools and Utilities page 1
Related Reading
Tool Documentation Description
NET/MASTER MS
Table C-1. Related Reading for Tools and Utilities page 2
Management Manual
Table C-1. Related Reading for Tools and Utilities page 3
Table C-1. Related Reading for Tools and Utilities page 4
Recovery Guide
Output
Table C-1. Related Reading for Tools and Utilities page 5
Page
Converting Numbers
Table D-1. Descriptions of Number Systems
Overview of Numbering Systems
Number System Base Description
Binary Value Decimal Value
Binary to Decimal
Octal Value Decimal Value 1375 765
Octal to Decimal
Hexadecimal Decimal
Hexadecimal to Decimal
Hexadecimal Value Decimal Value HBA10 47632
Figure D-3. Hexadecimal to Decimal Conversion
Decimal Value Binary Value 354 B101100010
Decimal to Binary
Result is
Step Division Quotient Remainder
Step Division Quotient
Decimal to Octal
Decimal Value Octal Value
Decimal Hexadecimal
Decimal to Hexadecimal
Decimal Value Hexadecimal Value
Page
Canadian Compliance
Safety and Compliance
Regulatory Compliance Statements
FCC Compliance
Statements-2
Laser Compliance
European Union Notice
Safety Caution
Waste Electrical and Electronic Equipment Weee
Important Safety Information
Statements-6
See Conftext file
Index
Numbers
Port ServerNet Extender 4PSE
Dcom 10-15,B-2
FUP
Nskcom B-3
Index-5
System shutdown file 16-20 TMF Lines
SACs SCF B-4 commands
Spooler 16-14 Startup files
Tacl 9-22,16-5,B-5
Special Characters
Index-8