Information furnished in this manual is believed to be accurate and reliable. However, QLogic Corporation assumes no
responsibility for its use, nor for any infringements of patents or other rights of third parties which may result from its
use. QLogic Corporation reserves the right to change product specifications at any time without notice. Applications
described in this document for any of these products are for illustrative purposes only. QLogic Corporation makes no
representation nor warranty that such applications are suitable for the specified use without further testing or
modification. QLogic Corporation assumes no responsibility for any errors that may appear in this document.
The QLogic OFED+ Host Software User Guide shows end users how to use the
installed software to setup the fabric. End users include both the cluster
administrator and the Message-Passing Interface (MPI) application programmers,
who have different but overlapping interests in the details of the technology.
For specific instructions about installing the QLogic QLE7340, QLE7342,
QMH7342, and QME7342 PCI Express
InfiniBand
Fabric Software, see the QLogic InfiniBand
®
Adapter Hardware Installation Guide, and the initial installation of the
®
Intended Audience
This guide is intended for end users responsible for administration of a cluster
network as well as for end users who want to use that cluster.
This guide assumes that all users are familiar with cluster computing, that the
cluster administrator is familiar with Linux
programmer is familiar with MPI, vFabrics, SRP, and Distributed SA.
This guide uses the following documentation conventions:
NOTE: provides additional information.
CAUTION!
causing damage to data or equipment.
WARNING!!
causing personal injury.
IB0054606-02 Axv
indicates the presence of a hazard that has the potential of
indicates the presence of a hazard that has the potential of
Preface
License Agreements
Tex t i n blue font indicates a hyperlink (jump) to a figure, table, or section in
this guide, and links to Web sites are shown in underlined blue
example:
Table 9-2 lists problems related to the user interface and remote agent.
See “Installation Checklist” on page 3-6.
. For
For more information, visit www.qlogic.com
Tex t i n bold font indicates user interface elements such as a menu items,
buttons, check boxes, or column headings. For example:
Click the Start button, point to Programs, point to Accessories, and
then click Command Prompt.
Under Notification Options, select the Warning Alarms check box.
Tex t i n Courier font indicates a file name, directory path, or command line
text. For example:
To return to the root directory from anywhere in the file structure:
Type
cd /root and press ENTER.
Enter the following command: sh ./install.bin
Key names and key strokes are indicated with UPPERCASE:
Press CTRL+P.
Press the UP ARROW key.
Tex t i n italics indicates terms, emphasis, variables, or document titles. For
example:
For a complete listing of license agreements, refer to the QLogic
Software End User License Agreement.
.
What are shortcut keys?
To enter the date type mm/dd/yyyy (where mm is the month, dd is the
day, and yyyy is the year).
Topic titles between quotation marks identify related topics either within this
manual or in the online help, which is also referred to as the help system
throughout this document.
License Agreements
Refer to the QLogic Software End User License Agreement for a complete listing
of all license agreements affecting this product.
xviIB0054606-02 A
Technical Support
Customers should contact their authorized maintenance provider for technical
support of their QLogic products. QLogic-direct customers may contact QLogic
Technical Support; others will be redirected to their authorized maintenance
provider. Visit the QLogic support Web site listed in Contact Information for the
latest firmware and software updates.
For details about available service plans, or for information about renewing and
extending your service, visit the Service Program web page at
http://www.qlogic.com/services
Training
QLogic offers training for technical professionals for all iSCSI, InfiniBand® (IB),
and Fibre Channel products. From the main QLogic web page at www.qlogic.com
click the Support tab at the top, and then click Training and Certification on the
left. The QLogic Global Training portal offers online courses, certification exams,
and scheduling of in-person training.
Preface
Technical Support
.
,
Technical Certification courses include installation, maintenance and
troubleshooting QLogic products. Upon demonstrating knowledge using live
equipment, QLogic awards a certificate identifying the student as a certified
professional. You can reach the training professionals at QLogic by e-mail at
training@qlogic.com
Contact Information
QLogic Technical Support for products under warranty is available during local
standard working hours excluding QLogic Observed Holidays. For customers with
extended service, consult your plan for available hours.For Support phone
numbers, see the Contact Support link at support@qlogic.com
Support Headquarters
QLogic Web Site
Technical Support Web Site
Technical Support E-mail
Technical Training E-mail
.
.
QLogic Corporation
4601 Dean Lakes Blvd.
Shakopee, MN 55379 USA
www.qlogic.com
http://support.qlogic.com
support@qlogic.com
training@qlogic.com
IB0054606-02 Axvii
Preface
Technical Support
Knowledge Database
The QLogic knowledge database is an extensive collection of QLogic product
information that you can search for specific solutions. We are constantly adding to
the collection of information in our database to provide answers to your most
urgent questions. Access the database from the QLogic Support Center:
http://support.qlogic.com.
xviiiIB0054606-02 A
1Introduction
How this Guide is Organized
The QLogic OFED+ Host Software User Guide is organized into these sections:
Section 1, provides an overview and describes interoperability.
Section 2, describes how to setup your cluster to run high-performance MPI
jobs.
Section 3, describes the lower levels of the supplied QLogic OFED+ Host
software. This section is of interest to a InfiniBand
Section 4, helps the
best use of the Open MPI implementation. Examples are provided for
compiling and running MPI programs.
Section 5, gives examples for compiling and running MPI programs with
other MPI implementations.
Section 7, describes QLogic Performance Scaled Messaging (PSM) that
provides support for full Virtual Fabric (vFabric) integration, allowing users to
specify InfiniBand
provide a configured Service ID (SID) to target a vFabric.
Section 8, describes dispersive routing in the InfiniBand
congestion hotspots by “sraying” messages across the multiple potential
paths.
Appendix A, describes how to run QLogic’s performance measurement
programs.
Message Passing Interface (MPI) programmer make the
®
Service Level (SL) and Partition Key (PKey), or to
®
cluster administrator.
®
fabric to avoid
(gPXE)
Appendix B, describes SCSI RDMA Protocol (SRP) configuration that allows
the SCSI protocol to run over InfiniBand
usage.
IB0054606-02 A1-1
®
for Storage Area Network (SAN)
1–Introduction
NOTE
Overview
Appendix C, describes two methods the administrator can use to allow users
to submit MPI jobs through batch queuing systems.
Appendix D, provides information for troubleshooting installation, cluster
administration, and MPI.
Appendix E, provides information for troubleshooting the upper layer
protocol utilities in the fabric.
Appendix F, provides instructions for checking write combining and for using
the Page Attribute Table (PAT) and Memory Type Range Registers (MTRR).
Appendix G, contains useful programs and files for debugging, as well as
commands for common tasks.
Appendix H, contains a list of useful web sites and documents for a further
In addition, the QLogic InfiniBand
information on QLogic hardware installation and the QLogic InfiniBand
Software Installation Guide contains information on QLogic software installation.
Overview
The material in this documentation pertains to a QLogic OFED+ cluster. A cluster
is defined as a collection of nodes, each attached to an InfiniBand
through the QLogic interconnect.
The QLogic IB Host Channel Adapters (HCA) are InfiniBand
quad data rate (QDR) adapters (QLE7340, QLE7342, QMH7342, and QME7342)
have a raw data rate of 40Gbps (data rate of 32Gbps). The QLE7340, QLE7342,
QMH7342, and QME7342 adapters can also run in DDR or SDR mode.
The QLogic IB HCA utilize standard, off-the-shelf InfiniBand
cabling. The QLogic interconnect is designed to work with all
InfiniBand
understanding of the InfiniBand
®
Adapter Hardware Installation Guide contains
®
-compliant switches.
®
fabric, and related information.
®
4X adapters. The
®
4X switches and
®
Fabric
®
-based fabric
If you are using the QLE7300 series adapters in QDR mode, a QDR switch
must be used.
QLogic OFED+ software is interoperable with other vendors’ IBTA compliant
InfiniBand
®
adapters running compatible OFED releases. There are several
options for subnet management in your cluster:
1-2IB0054606-02 A
1–Introduction
NOTE
Interoperability
An embedded subnet manager can be used in one or more managed
switches. QLogic offers the QLogic Embedded Fabric Manager (FM) for
both DDR and QDR switch product lines supplied by your IB switch vendor.
A host-based subnet manager can be used. QLogic provides the QLogic
Fabric Manager (FM), as a part of the QLogic InfiniBand
Interoperability
QLogic OFED+ participates in the standard IB subnet management protocols for
configuration and monitoring. Note that:
QLogic OFED+ (including Internet Protocol over InfiniBand
interoperable with other vendors’ InfiniBand
OFED releases.
In addition to supporting running MPI over verbs, QLogic provides a
high-performance InfiniBand
PSM. MPIs run over PSM will not interoperate with other adapters.
See the OpenFabrics web site at www.openfabrics.org for more information
on the OpenFabrics Alliance.
®
Fabric Suite (IFS).
®
®
adapters running compatible
®
-Compliant vendor-specific protocol, known as
(IPoIB)) is
IB0054606-02 A1-3
1–Introduction
Interoperability
1-4IB0054606-02 A
2Step-by-Step Cluster Setup
and MPI Usage Checklists
This section describes how to set up your cluster to run high-performance
Message Passing Interface (MPI) jobs.
Cluster Setup
Perform the following tasks when setting up the cluster. These include BIOS,
adapter, and system settings.
1.Make sure that hardware installation has been completed according to the
instructions in the QLogic InfiniBand
and software installation and driver configuration has been completed
according to the instructions in the QLogic InfiniBand
Installation Guide. To minimize management problems, the compute nodes
of the cluster must have very similar hardware configurations and identical
software installations. See “Homogeneous Nodes” on page 3-37 for more
information.
2.Check that the BIOS is set properly according to the instructions in the
QLogic InfiniBand
3.Set up the Distributed
your virtual fabrics. See “QLogic Distributed Subnet Administration” on
page 3-12
4.Adjust settings, including setting the appropriate MTU size. See “Adapter
and Other Settings” on page 3-38.
5.Remove unneeded services. See “Remove Unneeded Services” on
page 3-39.
6.Disable powersaving features. See “Host Environment Setup for MPI” on
page 3-40.
®
Adapter Hardware Installation Guide.
Subnet Administration (SA) to correctly synchronize
®
Adapter Hardware Installation Guide
®
Fabric Software
7.Check other performance tuning settings. See “Performance Settings and
Management Tips” on page 3-24.
IB0054606-02 A2-1
2–Step-by-Step Cluster Setup and MPI Usage Checklists
Using MPI
8.Set up the host environment to use ssh. Two methods are discussed in
“Host Environment Setup for MPI” on page 3-40.
9.Verify the cluster setup. See “Checking Cluster and Software Status” on
page 3-44.
Using MPI
1.Verify that the QLogic hardware and software has been installed on all the
nodes you will be using, and that ssh is set up on your cluster (see all the
steps in the Cluster Setup checklist).
2.Setup Open MPI. See “Setup” on page 4-2.
3.Compile Open MPI applications. See “Compiling Open MPI Applications” on
page 4-2
4.Create an mpihosts file that lists the nodes where your programs will run.
See “Create the mpihosts File” on page 4-3.
5.Run Open MPI applications. See “Running Open MPI Applications” on
page 4-3.
6.Configure MPI programs for Open MPI. See “Configuring MPI Programs for
Open MPI” on page 4-5
7.To test using other MPIs that run over PSM, such as MVAPICH, MVAPICH2,
Platform MPI, and Intel MPI, see Section 5Using Other MPIs.
8.To switch between multiple versions of MVAPICH, use the mpi-selector.
See “Managing MVAPICH, and MVAPICH2 with the mpi-selector Utility” on
page 5-5.
9.Refer to “Performance Tuning” on page 3-25 to read more about runtime
performance tuning.
10.Refer to Section 5Using Other MPIs to learn about using other MPI
implementations.
2-2IB0054606-02 A
3InfiniBand
InfiniBand®/OpenFabrics
User Verbs
MPI Applications
QLogic OFED+ Driver ib_qib
Kernel Space
uMAD API
User Space
QLogic OFED+
Communication
Library (PSM)
QLogic OFED+
Hardware
TCP/IP
IPoIB
QLogic IB adapter
Platform MPI
MVAPICH
Open MPI
MVAPICH
Open MPI
Intel MPI
QLogic FM
MVAPICH2
Common
Intel MPI
uDAPL
Platform MPI
SRP
MVAPICH2
and Administration
This section describes what the cluster administrator needs to know about the
QLogic OFED+ software and system administration.
Introduction
The IB driver ib_qib, QLogic Performance Scaled Messaging (PSM), accelerated
Message-Passing Interface (MPI) stack, the protocol and MPI support libraries,
and other modules are components of the QLogic OFED+ software. This software
provides the foundation that supports the MPI implementation.
Figure 3-1 illustrates these relationships. Note that HP-MPI, Platform MPI, Intel
MPI, MVAPICH, MVAPICH2, and Open MPI can run either over PSM or
OpenFabrics
®
User Verbs.
®
Cluster Setup
Figure 3-1. QLogic OFED+ Software Structure
IB0054606-02 A3-1
3–InfiniBand® Cluster Setup and Administration
Installed Layout
Installed Layout
This section describes the default installed layout for the QLogic OFED+ software
and QLogic-supplied MPIs.
QLogic-supplied Open MPI, MVAPICH, and MVAPICH2 RPMs with PSM support
and compiled with GCC, PGI, and the Intel compilers are installed in directories
using the following format:
/usr/mpi/<compiler>/<mpi>-<mpi_version>-qlc
For example:
/usr/mpi/gcc/openmpi-1.4-qlc
QLogic OFED+ utility programs, are installed in:
/usr/bin
/sbin
/opt/iba/*
Documentation is found in:
/usr/share/man
/usr/share/doc/infinipath
License information is found only in usr/share/doc/infinipath. QLogic
OFED+ Host Software user documentation can be found on the QLogic web site
on the software download page for your distribution.
Configuration files are found in:
/etc/sysconfig
Init scripts are found in:
/etc/init.d
The IB driver modules in this release are installed in:
The ib_qib module provides low-level QLogic hardware support, and is the base
driver for both MPI/PSM programs and general OpenFabrics protocols such as
IPoIB and sockets direct protocol (SDP). The driver also supplies the Subnet
Management Agent (SMA) component.
The following is a list of the optional configurable OpenFabrics components and
their default settings:
IPoIB network interface. This component is required for TCP/IP networking
for running IP traffic over the IB link. It is not running until it is configured.
OpenSM. This component is disabled at startup. QLogic recommends using
the QLogic Fabric Manager (FM), which is included with the IFS or optionally
available within the QLogic switches. QLogic FM or OpenSM can be
installed on one or more nodes with only one node being the master SM.
SRP (OFED and QLogic modules). SRP is not running until the module is
loaded and the SRP devices on the fabric have been discovered.
MPI over uDAPL (can be used by Intel MPI). IPoIB must be configured
before MPI over uDAPL can be set up.
Other optional drivers can now be configured and enabled, as described in “IPoIB
Network Interface Configuration” on page 3-3.
Complete information about starting, stopping, and restarting the QLogic OFED+
services are in “Managing the ib_qib Driver” on page 3-21.
IPoIB Network Interface Configuration
The following instructions show you how to manually configure your OpenFabrics
IPoIB network interface. QLogic recommends using the QLogic OFED+ Host
Software Installation package or the iba_config tool. For larger clusters,
FastFabric can be used to automate installation and configuration of many nodes.
These tools automate the configuration of the IPoIB network interface. This
example assumes that you are using
QLogic OFED+ and OpenFabric’s RPMs are installed, and your startup scripts
have been run (either manually or at system boot).
sh or bash as your shell, all required
For this example, the IPoIB network is 10.1.17.0 (one of the networks reserved for
private use, and thus not routable on the Internet), with a /8 host portion. In this
case, the netmask must be specified.
IB0054606-02 A3-3
3–InfiniBand® Cluster Setup and Administration
NOTE
IPoIB Network Interface Configuration
This example assumes that no hosts files exist, the host being configured has the
IP address 10.1.17.3, and DHCP is not used.
Instructions are only for this static IP address case. Configuration methods
for using DHCP will be supplied in a later release.
The output of the ping command will be similar to the following, with a line
for each host already configured and connected:
WARNING: pinging broadcast address
PING 10.1.17.255 (10.1.17.255) 517(84) bytes of data.
174 bytes from 10.1.17.3: icmp_seq=0 ttl=174 time=0.022
ms
64 bytes from 10.1.17.1: icmp_seq=0 ttl=64 time=0.070 ms
(DUP!)
64 bytes from 10.1.17.7: icmp_seq=0 ttl=64 time=0.073 ms
(DUP!)
The IPoIB network interface is now configured.
4.Restart (as a root user) by typing:
/etc/init.d/openibd restart
3-4IB0054606-02 A
3–InfiniBand® Cluster Setup and Administration
NOTE
The configuration must be repeated each time the system is rebooted.
IPoIB-CM (Connected Mode) is enabled by default. The setting in
/etc/infiniband/openib.conf is SET_IPOIB_CM=yes. To use
datagram mode, change the setting to
also be changed when asked during initial installation (./INSTALL).
IPoIB Administration
Administering IPoIB
Stopping, Starting and Restarting the IPoIB Driver
IPoIB Administration
SET_IPOIB_CM=no. Setting can
QLogic recommends using the QLogic IFS Installer TUI or iba_config command to
enable autostart for the IPoIB driver. Refer to the QLogic InfiniBand
Software Installation Guide for more information. For using the command line to
stop, start, and restart the IPoIB driver use the following commands.
To stop the IPoIB driver, use the following command:
/etc/init.d/openibd stop
To start the IPoIB driver, use the following command:
/etc/init.d/openibd start
To restart the IPoIB driver, use the following command:
/etc/init.d/openibd restart
Configuring IPoIB
QLogic recommends using the QLogic IFS Installer TUI, FastFabric, or
iba_config command to configure the boot time and autostart of the IPoIB
driver. Refer to the QLogic InfiniBand
information on using the QLogic IFS Installer TUI. Refer to the QLogic FastFabric User Guide for more information on using FastFabric. For using the command line
to configure the IPoIB driver use the following commands.
®
Fabric
®
Fabric Software Installation Guide for more
Editing the IPoIB Configuration File
1.For each IP Link Layer interface, create an interface configuration file,
/etc/sysconfig/network/ifcfg-NAME, where NAME is the value of the
IB0054606-02 A3-5
3–InfiniBand® Cluster Setup and Administration
NOTE
NOTE
IB Bonding
NAME field specified in the CREATE block. The following is an example of the
ifcfg-NAME file:
DEVICE=ib1
BOOTPROTO=static
BROADCAST=192.168.18.255
IPADDR=192.168.18.120
NETMASK=255.255.255.0
ONBOOT=yes
NM_CONTROLLED=no
For IPoIB, the INSTALL script for the adapter now helps the user
create the
2.After modifying the /etc/sysconfig/ipoib.cfg file, restart the IPoIB driver
with the following:
/etc/init.d/openibd restart
ifcfg files.
IB Bonding
IB bonding is a high availability solution for IPoIB interfaces. It is based on the
Linux Ethernet Bonding Driver and was adopted to work with IPoIB. The support
for IPoIB interfaces is only for the active-backup mode, other modes should not be
used. QLogic supports bonding across HCA ports and bonding port 1 and port 2
on the same HCA.
Interface Configuration Scripts
Create interface configuration scripts for the ibX and bondX interfaces. Once the
configurations are in place, perform a server reboot, or a service network restart.
For SLES operating systems (OS), a server reboot is required. Refer to the
following standard syntax for bonding configuration by the OS.
For all of the following OS configuration script examples that set MTU,
MTU=65520 is valid only if all IPoIB slaves operate in connected mode and
are configured with the same value. For IPoIB slaves that work in datagram
mode, use MTU=2044. If the MTU is not set correctly or the MTU is not set
at all (set to the default value), performance of the interface may be lower.
3-6IB0054606-02 A
Loading...
+ 254 hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.