IBM RS/6000 SP Restoring a switch assembly to the active configuration, Viewing switch partitions

Page 78

6.The customer can re-initialize the switch using the Estart command. The frame and processor nodes which were removed in this procedure will appear in the out.top file with error messages; however, the remainder of the switch resources are now available for customer use.

7.If switch re-initialization was successful, the customer can start running parallel jobs again.

8.Return to MAPs to continue service action(s).

Restoring a switch assembly to the active configuration

1.Identify the switch assembly which is to be restored to the active configuration.

2.Have customer stop all current parallel jobs and suspend all parallel jobs on the job queue.

3.Make sure that the circuit breaker at the front of the switch assembly in the Off (`0') position.

4.Connect all cables to the switch assembly. Pay attention to the labels on the cables.

5.Put the circuit breaker at the front of the switch assembly in the On (`1') position.

6.Use switch front panel or the Eclock command to select the appropriate clock input for this switch assembly. For SSP code level 1.02 and higher, use the Eclock -rcommand.

7.If the primary processor node was changed during the isolation procedure, the customer can change the primary processor node back to the original selection; however, this is NOT required. The primary node is set by the Eprimary command. Refer to IBM RS/6000 SP: Administration Guide, for more information.

8.If the master clock was changed during the isolation procedure, the customer can select the clocks on all other switches; however, this is not suggested. (The previous clock selection for this switch assembly was recorded for possible use. See ªSelecting appropriate switch clocksº on page 3-6.)

9.The customer can re-initialize the switch using the Estart command. Cables connected to the switch assembly and processor nodes which were restored in this procedure should appear in the out.top file without any error messages.

10.If switch re-initialization was successful, the customer can start running parallel jobs again.

11.Return to MAPs to continue service action(s).

Viewing switch partitions

Open the System Monitor window and click on the ªViewª option to display any partitions.

With switch partitioning, the nodes are grouped under a partition name. Partitioning allows the SP to have different software levels installed, and to isolate groups of nodes from each other. The Estart command, fault service, and applications see only a single partition.

Fencing nodes

1.Click on ªViewº to select one of the partitions, then click on ªSPº to display an options menu.

2.Click on ªGlobal Controlsº to display the frame(s) and node(s) in this partition.

3.Click on the node you want to fence, then click on the ªShutdownº option. Now click on the ªDo Commandº option to display options.

4.Click on ªFenceº and then click on ªShutdownº to shutdown the node.

Note: Selecting ªEnable Autojoinº during this sequence will automatically place the node back into the operations of the partition on successful power on of the Node. It is suggested that ªAutojoinº not be selected at any time when performing a service action.

Unfencing nodes

1.If necessary, follow the first 2 steps under ªFencing Nodesº above.

2.Ensure that the node is selected.

3.Click on ªOnº and then click on ªDo Commandº. This will power on the node and run power-on diagnostics.

4.Click on ªUnfenceº and then click on ªDo Commandº to place the node into operation in the partition.

3-8RS/6000 SP: SP Switch Service Guide

Image 78
Contents IBM Page IBM Fifth Edition April Contents FRU removals and replacements Figures Vi RS/6000 SP SP Switch Service Guide Tables ViiViii RS/6000 SP SP Switch Service Guide Safety and environmental notices Safety noticesRS/6000 SP SP Switch Service Guide SPSFD017 Xii RS/6000 SP SP Switch Service Guide Laser safety information Environmental notices Product recycling and disposalXiv RS/6000 SP SP Switch Service Guide About this book Who should use this bookRelated information How to send your comments Summary of changes Xviii RS/6000 SP SP Switch Service Guide Maintenance Analysis Procedures MAPs Switch description and problem determination MAP Power Green Environment YellowFlashing Errors in /var/adm/SPlogs/css/out.top see Priority Message or condition ActionSP Switch environment MAP Condition ActionSwitch connector resistance values Step SP Switch inner chassis and front chassis cables Priority Component ActionStep 10RS/6000 SP SP Switch Service Guide Step 12RS/6000 SP SP Switch Service Guide SP Switch power MAP SP Switch power MAP PDU-BH-P1 PDU-BH-P2 PDU-BH-P3 PDU-BH-P4 PDU-BH-P5 From From P1 pinSP Switch function MAP EstartEprimary Splstdata -n pg Pg /var/adm/SPlogs/css/out.top NotesSP Switch function MAP Splstdata -s pg Error # Message/condition Description and actionIf the cabling does not match, correct the cabling SDRGetObjects DependentNodeL03-N01 L01-S00-BH-J18E02-S00-BH-J18 E02-N04Priority Message/condition Action Var/adm/SPlogs/css/out.top file haveStep Service Request Number Failing components DescriptionSRN 26RS/6000 SP SP Switch Service Guide Priority Failing component Action 28RS/6000 SP SP Switch Service Guide Step 30RS/6000 SP SP Switch Service Guide Estart Do not perform the cable wrap test Step Do not perform cable wrap test Step Check /var/adm/SPlogs/css/out.top file Locations Naming standard for RS/6000 SP componentsFormat structure Location diagrams of the RS/6000 SP components Major assemblyFront and rear views of RS/6000 SP frame Figure notes2shows a front view of the RS/6000 SP multi-switch frame Front view of 49-inch frame locations. See notes below Frame locations Frame FRAExample E01-FRA-G1 Switch assembly locations Air BaffleConnector details Cable routing10RS/6000 SP SP Switch Service Guide Cable routing in a multi-switch frame F/C 2030/1 12RS/6000 SP SP Switch Service Guide Switch data cables Cable Part Plug from Plug to Number Location14RS/6000 SP SP Switch Service Guide Service procedures Personal ESD requirementsTools and files overview Ifconfig Utility see note Runs on Description FaultserviceWormRTGRc.switch File see note Location Description Rc.switch.logUsing the css.snap script Node supervisor verification Switch supervisor self-testVerification tests using Perspectives Self-test Conditions Pass sequenceSwitch supervisor verification Frame supervisor verificationDetermining the correct switch clock source Selecting appropriate switch clocksSelecting the switch clock source EclockRemoving and restoring switch resources Removing a switch assembly from the active configurationNumber of Logical Master Clock Choice Model Frames Restoring a switch assembly to the active configuration Viewing switch partitionsReplacing a switch assembly from service position Resetting the clock and bootlist after servicing a nodeEfence of primary and primary backup nodes Installing firmware updates on SP nodes Installing adapter microcode packagesAdapter Package Running diagnostics on a switch port FRU removals and replacements Handling static-sensitive devicesProcedures for switch assemblies Removing the switch assemblyReplacing the switch assembly Removing the switch fans Replacing the switch fansRemoving the switch front chassis cable Replacing the switch front chassis cable Removing the switch supervisor cardReplacing the switch supervisor card Removing the switch inner chassis cableReplacing the switch inner chassis cable Removing the switch power cardsReplacing the switch power cards Removing the SP Switch power supply cards8RS/6000 SP SP Switch Service Guide Parts catalog SPS, SPS-8 Switch assembly feature view Assembly Part number Units Description Index 4RS/6000 SP SP Switch Service Guide Inner chassis assembly SPS includes all assembly in shaded Air flow Cable, front chassis Switch cables feature SP Switch Data Cables SPS-8SP Switch External Cables SPS SPS-8 Parts catalog Multi-switch frame F/C 2030/1 Multi-switch frame F/C 2030/1 12RS/6000 SP SP Switch Service Guide Trademarks United Kingdom telecommunications safety requirements European Union EU statementIndustry Canada compliance statement For installations in Japan Electromagnetic interference EMI statement TaiwanRadio protection for Germany 4RS/6000 SP SP Switch Service Guide Index Numerics ESD Switch inner chassis cable, SPS Switch partitioning 4RS/6000 SP SP Switch Service Guide Readers comments ± Wed like to hear from you How satisfied are you that the information in this book isPlease tell us how we can improve this book IBM Page IBM
Related manuals
Manual 108 pages 26.79 Kb

RS/6000 SP specifications

The IBM RS/6000 SP (Scalable Power) is a high-performance computing system that was developed in the early 1990s and has since become a significant player in the realm of supercomputing and enterprise solutions. Designed primarily for scientific, engineering, and complex data analysis tasks, the RS/6000 SP provides powerful processing capabilities, enhanced scalability, and features to support demanding applications.

One of the key characteristics of the RS/6000 SP is its modular architecture, which allows for the addition of multiple nodes. Each node is based on IBM's Power architecture, leveraging the RISC (Reduced Instruction Set Computing) design to achieve high throughput and efficiency. The system can support configurations ranging from a few nodes to hundreds, enabling organizations to scale their computational power as needed based on workload requirements.

The RS/6000 SP employs advanced interconnect technologies, most notably the SP Switch, which ensures high-speed communication between nodes. This interconnect allows for seamless data transfer, ensuring that the system can handle large datasets and complex computations without bottlenecks. The SP's ability to deliver concurrent processing capabilities makes it ideal for parallel computing tasks, including simulations, modeling, and large-scale computations.

In terms of software, the RS/6000 SP was equipped with AIX, IBM's version of the UNIX operating system, which provides a stable and secure environment for enterprise applications. Furthermore, the system supports a vast array of programming languages and tools, including Fortran, C, and C++, along with libraries and frameworks designed for high-performance computing. This compatibility allows developers to optimize their applications to fully utilize the underlying hardware.

Another notable feature of the RS/6000 SP is its extensive support for various workloads, including database management, multi-user applications, and graphical processing. The system was equipped with high-performance graphics capabilities, making it suitable for tasks that require intensive visualization, such as computer-aided design (CAD) and scientific visualization.

Overall, the IBM RS/6000 SP has solidified its role in the high-performance computing landscape, combining advanced technologies, robust scalability, and flexibility to meet the unique demands of research and enterprise environments. Its legacy continues to influence modern computing architectures, embodying IBM’s commitment to innovation and performance in the pursuit of complex problem-solving capabilities.