v-nAssumes that the device driver or daemon has flushed the cache.

v-sTakes a soft snap, which does not dump the adapter state. This excludes the col_dump.out. This flag is used for temporary errors (TYPE=TEMP) where the integrity of the adapter is in doubt, or when it is not desirable to corrupt the adapter state by the use of diagnostic routines.

v-aSpecifies a single adapter. When this flag is not used, all the node's configured adapters will be selected. This is the default.

v-pSpecifies a single port on a specified adapter, hence a specific plane. This option must be used together with the -a option. When this flag is not specified, all the adapter's ports are selected. This is the default.

Collect the css.snap information from both the primary node and all nodes that are experiencing SP Switch2 problems. Do not reboot the nodes before running css.snap, because rebooting causes the loss of valuable diagnostic information.

css.snap file structure

The files collected by running the css.snap script are organized into the following hierarchy of directories:

/var/adm/SPlogs/css

vNode (global) level log file

vContains information relevant to all the ports and adapters in the node

/var/adm/SPlogs/cssX (where X=0 or 1)

vAdapter level log file (the 0 in css0 refers to the adapter id)

vContains information relevant to all the ports on a specific adapter

/var/adm/SPlogs/cssX/p0 (where X=0 or 1)

vPort level log file (the 0 in p0 is the port number on the adapter)

vContains information relevant to a specific port

Files collected by css.snap

The css.snap script collects all the files which reside in the /var/adm/SPlogs/css (node level), /var/adm/SPlogs/css0 (adapter level), and /var/adm/SPlogs/css0/p0 (port level) directories, and some additional files from the /tmp directory. Some of the files reside on each node, while others reside only on the primary node or on the control workstation.

Under normal circumstances, the css.snap script will collect the following files:

Table 3-4. SP Switch2 log files

Log File

Information Level

File Location

File Contents

 

 

 

 

adapter.log

adapter

nodes

Adapter status information from the fault

 

 

 

service daemon

 

 

 

 

cable_miswire

port

primary node

Node-to-switch or switch-to-switch plane

 

 

 

miswired connection information.

 

 

 

 

cadd_dump.out

node

nodes

Most recent css.snap's cadd_dump

 

 

 

command dump file. SP Switch2 adapter

 

 

 

device driver trace buffer dump file.

 

 

 

 

chgcss.log

node

nodes

Log file of chgcss, which changes the

 

 

 

adapter device driver's attributes.

 

 

 

 

col_dump.gsd

adapter

nodes

The most recent css.snap's col_dump

 

 

 

command dump file. Microcode dump

 

 

 

information.

 

 

 

 

colad.trace

adapter

nodes

SP Switch2 adapter diagnostics

 

 

 

messages.

 

 

 

 

3-4RS/6000 SP: SP Switch2 Service Guide

Page 62
Image 62
IBM RS/6000 SP manual Css.snap file structure, Var/adm/SPlogs/cssX where X=0 or, Var/adm/SPlogs/cssX/p0 where X=0 or

RS/6000 SP specifications

The IBM RS/6000 SP (Scalable Power) is a high-performance computing system that was developed in the early 1990s and has since become a significant player in the realm of supercomputing and enterprise solutions. Designed primarily for scientific, engineering, and complex data analysis tasks, the RS/6000 SP provides powerful processing capabilities, enhanced scalability, and features to support demanding applications.

One of the key characteristics of the RS/6000 SP is its modular architecture, which allows for the addition of multiple nodes. Each node is based on IBM's Power architecture, leveraging the RISC (Reduced Instruction Set Computing) design to achieve high throughput and efficiency. The system can support configurations ranging from a few nodes to hundreds, enabling organizations to scale their computational power as needed based on workload requirements.

The RS/6000 SP employs advanced interconnect technologies, most notably the SP Switch, which ensures high-speed communication between nodes. This interconnect allows for seamless data transfer, ensuring that the system can handle large datasets and complex computations without bottlenecks. The SP's ability to deliver concurrent processing capabilities makes it ideal for parallel computing tasks, including simulations, modeling, and large-scale computations.

In terms of software, the RS/6000 SP was equipped with AIX, IBM's version of the UNIX operating system, which provides a stable and secure environment for enterprise applications. Furthermore, the system supports a vast array of programming languages and tools, including Fortran, C, and C++, along with libraries and frameworks designed for high-performance computing. This compatibility allows developers to optimize their applications to fully utilize the underlying hardware.

Another notable feature of the RS/6000 SP is its extensive support for various workloads, including database management, multi-user applications, and graphical processing. The system was equipped with high-performance graphics capabilities, making it suitable for tasks that require intensive visualization, such as computer-aided design (CAD) and scientific visualization.

Overall, the IBM RS/6000 SP has solidified its role in the high-performance computing landscape, combining advanced technologies, robust scalability, and flexibility to meet the unique demands of research and enterprise environments. Its legacy continues to influence modern computing architectures, embodying IBM’s commitment to innovation and performance in the pursuit of complex problem-solving capabilities.