Q

C – Troubleshooting Kernel and Initialization Issues

C.4.6

InfiniPath ib_ipath Initialization Failure

There may be cases where ib_ipath was not properly initialized. Symptoms of this may show up in error messages from an MPI job or another program. Here is a sample command and error message:

$ mpirun -np 2 -m ~/tmp/mbu13 osu_latency

<nodename>:The link is down

MPIRUN: Node program unexpectedly quit. Exiting.

First, check to be sure that the InfiniPath driver is loaded:

$ lsmod grep ib_ipath

If no output is displayed, the driver did not load for some reason. Try the commands (as root):

#modprobe -v ib_ipath

#lsmod grep ib_ipath

#dmesg grep ipath tail -25

This will indicate whether the driver has loaded. Printing out messages using dmesg may help to locate any problems with ib_ipath.

If the driver loaded, but MPI or other programs are not working, check to see if problems were detected during the driver and InfiniPath hardware initialization with the command:

$ dmesg grep -i ipath

This may generate more than one screen of output. Also, check the link status with the commands:

$ cat /sys/bus/pci/driver/ib_ipath/0?/status_str

These commands are normally executed by the ipathbug-helperscript, but running them separately may help locate the problem.

Refer also to appendix C.9.16 and appendix C.9.8.

C.4.7

MPI Job Failures Due to Initialization Problems

If one or more nodes do not have the interconnect in a usable state, messages similar to the following will occur when the MPI program is started:

userinit: userinit ioctl failed: Network is down [1]: device init failed

userinit: userinit ioctl failed: Fatal Error in keypriv.c(520): device init failed

This could indicate that a cable is not connected, the switch is down, SM is not running, or a hardware error has occurred.

IB6054601-00 D

C-11

Page 85
Image 85
Q-Logic IB6054601-00 D manual InfiniPath ibipath Initialization Failure, MPI Job Failures Due to Initialization Problems

IB6054601-00 D specifications

The Q-Logic IB6054601-00 D is a high-performance InfiniBand adapter card designed for data centers and enterprise applications requiring robust connectivity and low-latency communication. This adapter is part of QLogic's extensive portfolio of networking solutions, catering to the needs of high-performance computing (HPC), cloud computing, and virtualization environments.

One of the standout features of the IB6054601-00 D is its capability to support data transfer rates of up to 56 Gbps. This makes it ideal for applications demanding large bandwidth and quick data processing. The adapter is optimized for RDMA (Remote Direct Memory Access) technology, which allows data to be transferred directly between the memory of different computers without involving the CPU. This reduces latency and CPU overhead, leading to enhanced overall system performance.

The architecture of the IB6054601-00 D includes support for a dual-port design, which offers increased bandwidth, redundancy, and fault tolerance. This dual-port configuration is especially advantageous for environments that require high availability and reliability, such as financial services and mission-critical applications.

The adapter utilizes advanced error detection and correction mechanisms, ensuring that data integrity is maintained during transmission. With features like adaptive routing and congestion management, the IB6054601-00 D is capable of optimizing the handling of data flows, thereby enhancing performance even under heavy loads.

In terms of compatibility, the Q-Logic IB6054601-00 D supports a wide range of operating systems and virtualization technologies, making it easy to integrate into diverse IT environments. It also includes drivers and software packages that facilitate seamless deployment and management.

In addition to high-speed connectivity, the adapter is designed with power efficiency in mind. It adheres to Energy Star regulations, helping organizations lower their operational costs while minimizing their environmental footprint.

Overall, the Q-Logic IB6054601-00 D stands out for its high throughput, low latency, and reliability. Its combination of advanced features and technologies positions it as an excellent choice for organizations looking to enhance their data center performance and maximize the efficiency of their network infrastructure. With the growing demands for faster and more efficient data transfer, solutions like the IB6054601-00 D are essential in meeting the evolving needs of modern enterprises.