Written by Mark Schwenden
Illustrated by Chrystie Danzer
Production by Mark Schwenden
Engineering contributions by Rich Altmaier, Steve Bowen, Michael T . Brown, Dick Brownell, David Collins, Thomas Crapisi, S teve Dean, Hartmut Gottwald,
Paul Wiley, Gary Meyer, Galen Flunker, Paul Kinyon, Mark Koneazny, William Kellerman, Paul Pedersen, Gregory Thorson, and Gary Spilde.
The software described in this document is "commercial computer software" provided with restricted rights (except as to included open/free source) as specified
in the F AR 52.227-19 and/or the DFAR 227.7202, or successive sections. Use beyond license provis ions is a violation of worldwide intellectual property laws,
treaties and conventions. This document is provided with limited rights as defined in 52.227-14.
The electronic (software) version of this document was developed at private expense; if acquired under an agreement with the USA government or any
contractor thereto, it is acquired as “commercial computer software” subject to the provisions of its applicable license agreement, as specified in (a) 48 CFR
12.212 of the FAR; or, if acquired for Department of Defense units, (b) 48 CFR 227-7202 of the DoD FAR Supplement; or sections succeeding thereto.
Contractor/manufacturer is SGI, 1140 East Arques Avenue Sunnyvale, CA 94085.
TRADEMARKS AND ATTRIBUTIONS
SGI, the SGI logo, and Altix are registered trademarks, and NUMAlink and NUMAflex are trademarks, of SGI, in the United States and/or other countries
worldwide.
Intel and Itanium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. UNIX is a registered
trademark in the United States and other countries, licensed exclusively through X/Open Company, Ltd.
Linux is a registered trademark of Linus Torvalds.
All other trademarks mentioned herein are the property of their respective owners.
Record of Revision
VersionDescription
-001August 2006,
First release
-002March 2007,
Engineering updates to cover RAID 1, DVD-R/W, 2D graphics and IA2.
007-4857-002iii
Contents
Contents
List of Figures . . . . . . . . . . . . . . . . . . . . . . . . xi
List of Tables . . . . . . . . . . . . . . . . . . . . . . . . xv
Table A-10Pin Assignments for USB Type A Connector . . . . . . . .163
007-4857-002xv
About This Guide
This guide provides an overview of the installation, architecture, general operation and
descriptions of the major components that compose the SGI
provides the standard procedures for powering on and powering off the system, basic
troubleshooting and maintenance information, and important safety and regulatory specifications.
Audience
This guide is written for owners, installers, system administrators, and users of SGI Altix 450
computer systems. It is written with the assumption that the reader has a good working knowledge
of computers and computer systems.
Important Information
W arning: To avoid problems that could void your warranty , your SGI or other approved
system support engineer (SSE) should perform any replacement of parts, or service of your
SGI Altix 450 system not covered in the following list of items that you can perform yourself:
•Unpacking, setting up and connecting the system rack(s) and individual rack units IRU(s).
•Using your system console and your L1 controller to enter commands and perform system
functions such as powering on and powering off, as described in this guide.
®
Altix® 450 family of systems. It also
•Adding and replacing system blades and DIMM memory as described in this guide.
•Adding and replacing PCI, PCI-X and PCI express cards, as described in this guide.
•Replacing fan modules and power supplies in the individual rack unit (IRU) enclosures.
•Adding and replacing DVD or disk drives in the base IA/IA2 (I/O) blade.
•Using the On/Off switch and other switches (the reset and non-maskable interrupt [NMI]
switches) on the system IA or IA2 blade (base I/O).
007-4857-002xvii
About This Guide
•Using the ESI/ops panel (operating panel) on optional mass storage bricks. See the user’s
Chapter Descriptions
The following topics are covered in this guide:
•Chapter 1, “System Installation” guides an installer through the steps of unpacking and
•Chapter 2, “Operation Procedures,” provides instructions for operation of the system
•Chapter 3, “System Overview,” provides environmental and technical information needed to
•Chapter 4, “System Control,” describes the function of the L1 and L2 controllers and
•Chapter 5, “Rack Information,” describes the rack sizes and features.
•Chapter 6, “Maintenance and Upgrade Procedures,” provides instructions for installing or
•Chapter 7, “Troubleshooting and Diagnostics,” provides recommended actions if problems
•Appendix A, “Technical Specifications and Pinouts‚" provides physical, environmental, and
•Appendix B, “Safety Information and Regulatory Specifications‚" lists all regulatory
guide for your optional SGI mass storage enclosure.
positioning the system rack(s). Rack mounting guidelines and basic cabling for the IRUs is
also covered.
including powering on and powering off your system.
properly set up and configure the Altix 450 series system.
provides instructions for operating the controllers.
removing the customer-replaceable components of your system.
occur on your system.
power specifications for your system. Also included are the pinouts for the non-proprietary
connectors.
information related to use of the Altix 450 system in the United States and other countries. It
also provides a list of safety instructions to follow when installing, operating, or servicing
the product.
xviii007-4857-002
Related Publications
The following SGI documents are relevant to the Altix 450 series system:
•SGI L1 and L2 Controller Software User’s Guide
•SGI InfiniteStorage 120 Mass Storage Hardware Topics
(P/N 007-3938-00x)
This guide describes how to use the L1 and L2 controller commands at your system console
to monitor and manage your SGI system.
(P/N 007-4853-00x)
This document provides a hardware overview of the SGI non-RAID 12-drive optional mass
storage enclosure used with Altix 450 systems.
(P/N 007-4718-00x)
This guide discusses the optional SGI reconfigurable application-specific software
computing (RASC) program that delivers scalable, configurable computing elements for the
SGI Altix family of servers and superclusters.
•Man pages (online)
Man pages locate and print the titled entries from the online reference manuals.
You can obtain SGI documentation, release notes, or man pages in the following ways:
•See the SGI Technical Publications Library at http://docs.sgi.com. Various formats are
available. This library contains the most recent and most comprehensive set of online books,
release notes, man pages, and other information.
•The release notes, which contain the latest information about software and documentation in
this release, are in a file named README.SGI in the root directory of the SGI ProPack for
Linux Documentation CD.
•You can also view man pages by typing man <title> on a command line.
SGI systems include a set of Linux man pages, formatted in the standard UNIX “man page” style.
Important system configuration files and commands are documented on man pages. These are
found online on the internal system disk (or on a DVD) and are displayed using the man
command. For example, to display the man page for the xscsidisktest command, type the
following on a command line:
man xscsidisktest
007-4857-002xix
About This Guide
References in the documentation to these pages include the name of the command and the section
number in which the command is found.
For additional information about displaying man pages using the man command, see man(1).
In addition, the apropos command locates man pages based on keywords. For example, to
display a list of man pages that describe disks, type the following on a command line:
apropos disk
For information about setting up and using apropos, see apropos(1).
xx007-4857-002
Conventions
Conventions
The following conventions are used throughout this document:
ConventionMeaning
CommandThis fixed-space font denotes literal items such as commands, files,
routines, path names, signals, messages, and programm in g language
structures.
variableThe italic typeface denotes variable entries and words or concepts being
defined. Italic typeface is also used for book titles.
user inputThis bold fixed-space font denotes literal items that the user enters in
interactive sessions. Output is shown in nonbold, fixed-space font.
[ ]Brackets enclose optional portions of a command or directive line.
...Ellipses indicate that a preceding element can be repeated.
man page(x)Man page section identifiers appear in parentheses after man page names.GUI elementThis font denotes the names of graphical user interface (GUI) elements such
as windows, screens, dialog boxes, menus, toolbars, icons, buttons, boxes,
fields, and lists.
Product Support
SGI provides a comprehensive product support and maintenance program for its products, as
follows:
•If you are in North America, contact the Technical Assistance Center at
+1 800 800 4SGI or contact your authorized service provider.
•If you are outside North America, contact the SGI subsidiary or authorized distributor in
your country.
007-4857-002xxi
About This Guide
Reader Comments
If you have comments about the technical accuracy, content, or organization of this document,
contact SGI. Be sure to include the title and document number of the manual with your comments.
(Online, the document number is located in the front matter of the manual. In printed manuals, the
document number is located at the bottom of each page.)
You can contact SGI in any of the following ways:
•Send e-mail to the following address: techpubs@sgi.com
•Contact your customer service representative and ask that an incident be filed in the SGI
incident tracking system.
•Send mail to the following address:
Technical Publications
SGI
1140 East Arques Avenue, M/S 50-1-946
Sunnyvale, California 94085
SGI values your comments and will respond to them promp tly.
xxii007-4857-002
Chapter 1
1.System Installation
This chapter describes how to unpack and install your Altix 450 system. Specifically, the
following topics are covered:
•“System Installation Overview” on page 2
•“Safety Precautions” on page 3
•“Installing a Rack” on page 6
•“Unpacking and Inspecting Modules” on page 22
•“Installing the IRUs in a Rack” on page 24
•“NUMAlink Cabling the System IRUs” on page 29
•“Cabling the System IRUs to a Power Source” on page 31
If you have arranged to have your system installed and set up by SGI professional services
personnel, you can skip this chapter and go on to Chapter 2, “Operation Procedures”.
007-4857-002 1
1: System Installation
System Installation Overview
In this chapter, different installation instructions are provided for different types of system
installations. Follow the instructions for your type of installation, as follows:
Note: It is assumed that you will be connecting a system console to your system in each
installation case.
•If you have ordered a server system that is already rackmounted, which means that the
modules that compose your system are also already cabled together, follow the instructions
in these sections:
–“Safety Precautions” on page 3
–“Installing a Rack” on page 6
–“Cabling the System IRUs to a Power Source” on page 31
–“Connecting the System Console” on page 36
•If you have ordered un-racked IRU modules with the intention of rackmounting and cabling
your own server system, follow the instructions in these sections:
–“Safety Precautions” on page 3
–“Installing a Rack” on page 6
–“Installing the IRUs in a Rack” on page 24
–“NUMAlink Cabling the System IRUs” on page 29
–“Cabling the System IRUs to a Power Source” on page 31
–“Connecting the System Console” on page 36
2 007-4857-002
Safety Precautions
•If you have ordered either a single- or dual-IRU system that you intend to install on a table
top, follow the instructions in these sections:
–“Safety Precautions” on page 3
–“Unpacking and Inspecting Modules” on page 22
–“Installing the IRUs in a Rack” on page 24
–“Cabling the System IRUs to a Power Source” on page 31
–“Connecting the System Console” on page 36
•If you have your own rack, and have ordered system modules with the intention of
rackmounting and cabling your own system, follow the instructions in these sections:
–“Safety Precautions” on page 3
–“Unpacking and Inspecting Modules” on page 22
–“Installing the IRUs in a Rack” on page 24
–“NUMAlink Cabling the System IRUs” on page 29
–“Cabling the System IRUs to a Power Source” on page 31
Safety Precautions
Before you install an Altix 450 system, you should familiarize yourself with the safety precautions
discussed in the following subsections:
•“Hazard Statements” on page 4
•“ESD Precautions” on page 4
•“Safety Measures” on page 5
–“Connecting the System Console” on page 36
007-4857-002 3
1: System Installation
Hazard Statements
During the installation of your Altix 450 system, be alert for hazard advisory statements with
icons, which signify the following:
•Caution indicates a potentially hazardous situation that, if not avoided, can result in minor
or moderate injury. A caution statement also alerts you to unsafe practices that can result in
equipment damage and/or data corruption. A caution message is accompanied by an icon as
shown in the following example:
ESD Precautions
!
!
Caution:
•Warning indicates a potentially hazardous situation that, if not avoided, could result in
death or serious injury. A warning message is accompanied by icon as shown in the
following example:
Warning:
•Danger indicates an imminently hazardous situation that, if not avoided, will result in death
or serious injury. A danger message is accompanied by the same icon as a warning.
Observe electrostatic discharge (ESD) precautions during the entire installation process to
eliminate possible ESD damage to the equipment. Wear an SGI-approved wrist strap when you
handle an ESD-sensitive device. Connect the wrist strap cord directly to earth ground.
Caution: Observe all ESD precautions. Failure to do so can result in damage to the equipment.
4 007-4857-002
Safety Measures
Safety Precautions
Observe the following safety measures when you install the system:
•Use caution when you remove the system from the shipping crate. Failure to handle the
system carefully can result in personal injury or property damage.
!
Caution: Ensure that the shipping crate is positioned close to its destination before you
unpack the crate.
W arning: Employ a minimum of two people to lift an IRU, to move the IRU(s) from
one location to another , and to install the IRU(s) in a rack. Otherwise, someone could be
seriously injured. Note that IRUs should have blades and power supplies removed to
make the unit easier to lift and position.
•Do not move the system while it is connected to power.
Warning: Keep fingers and conductive tools away from high-voltage areas. Failure
to follow these precautions will result in serious injury or death. Th e high-volt age ar eas
of the system are indicated with high-voltage warning labels.
•Ensure that a qualified electrician has properly installed the power receptacles.
•Set all PDU circuit breakers to the OFF (
cord.
O) position before you plug in the system power
Warning: Use the following guidelines to prevent the rack from toppling over.
Otherwise, people could be seriously injured and/or equipment could be damaged.
•Follow these guidelines to prevent a rack from toppling over:
–Install all equipment in the lowest available position in the rack.
–Use optional seismic tiedowns where appropriate.
–Always site the unit on a level surface.
007-4857-002 5
1: System Installation
Installing a Rack
This section describes how to install a rack that already has your system IRUs rackmounted and
cabled together. The following information is provided:
•“Preinstallation Activities” on page 6
•“Unloading and Moving System Equipment” on page 12
•“Removing a Short Rack from the Shipping Crate” on page 16
•“Unpacking and Installing the Tall Rack” on page 18
•“Positioning and Leveling Racks” on page 21
After you have completed installing your rack, proceed to “Cabling the System IRUs to a Power
Source” on page 31 to continue your installation.
If your system components were not shipped in a rack, skip this section and proceed to
“Unpacking and Inspecting Modules” on page 22 to begin your installation.
Preinstallation Activities
Perform the following preinstallation activities:
•Perform site verifications (see “Site Plan Verification” on page 6)
•Gather appropriate tools to complete the installation (see “Tools Required” on page 7)
•Ensure that the correct power receptacle is installed and properly wired (see “Power
Note: You can perform the preinstallation activities days or weeks before you receive your
system.
Site Plan Verification
Ensure that all site requirements are met before your system arrives. If you have questions about
the site requirements or you would like to order full-size floor templates for your site, contact a
site planning representative by e-mail (site@sgi.com).
Receptacle Verification” on page 8)
6 007-4857-002
Tools Required
Installing a Rack
Table 1-1 lists the tools that you need to complete the installation.
Table 1-1Installation Tools
ToolPart Number Purpose
13-mm wrench7260744Adjust the leveling pads.
13-mm socket (3/8-in. drive)7260726Remove bracket bolts from tall rack shipping crate.
19-mm socket (3/8-in. drive)9470618Remove bolts from short rack shipping crate.
Extension, 6-in. (3/8-in. drive)7260655Used with ratchet and sockets.
Ratchet, reversible (3/8-in. drive)7260755Used with extension and sockets.
Level, 9-in.9470556Level the rack.
007-4857-002 7
1: System Installation
Power Receptacle Verification
Ensure that a qualified technician installs the correct power receptacles. Your Altix 450 system
uses one or two single-phase power receptacles. For North American sites, the single-phase
receptacle is a 30-amp, 200- to 240-volt receptacle that has two phase sockets and one ground
socket. For international sites, the single-phase receptacle is a 32-amp, 200-volt receptacle that has
one phase socket, one neutral socket, and one ground socket. Note that the ground pin is slightly
longer than the other two pins on the North American plug.
Optional three-phase power distribution is available in systems mounted in 42U (tall) racks.
For North American sites, follow these steps to ensure that a single-phase power receptacle is
properly wired:
1.Set the voltmeter to a high AC voltage range.
2.Check the voltage between socket X and socket Y (see Figure 1-1). The meter should read
3.Check the voltage between socket X and the ground socket. The meter should read
between 200 and 240 VAC.
approximately 120 VAC.
4.Check the voltage between socket Y and the ground socket. The meter should read
approximately 120 VAC.
5.Check the voltage between the ground socket and an earth-ground location. The meter
should read 0 VAC.
6.Change the voltmeter to a low-resistance setting.
7.Measure between the ground socket and an appropriate earth-ground location and ensure that
resistance is less than 1 ohm.
8.Repeat steps 1 through 7 for any additional single-phase power receptacles.
Caution: If a voltage reading is incorrect, or if the resistance measured in step 7 is more than 1
!
ohm, contact a site-approved electrician. Do not proceed with the installation.
8 007-4857-002
Pole X
Socket Y
Installing a Rack
Power cord
connector
Ground pole
Pole Y
Ground
socket
Socket X
Receptacle
Figure 1-130-amp Single-phase Power Receptacle for North American Sites
007-4857-002 9
1: System Installation
For international sites, follow these steps to ensure that a single-phase power receptacle is properly
wired:
1.Set the voltmeter to a high AC voltage range.
2.Check the voltage between socket 1 and socket 2 (see Figure 1-2). The meter should read
between 200 and 240 VAC.
3.Check the voltage between socket 1 (line) and the ground socket. The meter should read
between 200 and 240 VAC.
Note: The ground pin is slightly larger in diameter than the other two pins.
4.Check the voltage between socket 2 (neutral) and the ground socket. The meter should read
approximately 0 VAC.
5.Check the voltage between the ground socket and an earth-ground location. The meter
should read 0 VAC.
6.Change the voltmeter to a low-resistance setting.
7.Measure between the ground socket and an appropriate earth-ground location and ensure that
resistance is less than 1 ohm.
8.Repeat steps 1 through 7 for any additional single-phase power receptacles.
Caution: If a voltage reading is incorrect, or if the resistance measured in step 7 is more than 1
!
10 007-4857-002
ohm, contact a site-approved electrician. Do not proceed with the installation.
Pin 1 (line)
Socket 1
(line)
Ground pin
Installing a Rack
Pin 2 (neutral)
Power cord
connector
Ground socket
Socket 2
(neutral)
Receptacle
Figure 1-232-amp Single-phase Power Plug for International Sites
007-4857-002 11
1: System Installation
Unloading and Moving System Equipment
Your Altix 450 system arrives at the site in a pallet-mounted cardboard shipping crate.
For a short rack system, the documentation carton and the accessories carton are packed with the
system. The documentation carton contains the system manuals as well as warranty and licensing
information. The accessories carton contains the I/O, peripheral, and system cables, and any
additional connectors or tools that are required for a specific configuration. Optional storage or
monitors are shipped in a separate carton.
For a tall rack system, the system documentation; accessories; and other optional hardware arrive
in separate cartons.
If you have purchased optional installation support from SGI service skip the following sections
and go on to the next chapter.
This section describes how to unload and transport the rack system to its designated location, as
follows:
•“Unloading the Equipment from the Truck” on page 12
•“Inspecting the Shipping Crate” on page 14
•“Transporting the Shipping Crate” on page 15
Unloading the Equipment from the Truck
If your loading dock is the same height as the transportation vehicle, use a pallet jack to unload
the system from the transportation vehicle. The pallet jack should have 48-in. tines or forks.
Follow any instructions that are printed on the packing crates.
If the loading dock is not the same height as the vehicle, you must provide a forklift or another
approved method to unload the system. You can use a platform or ramp to obtain the desired level
as long as the ramp incline does not exceed a ratio of one unit vertical to six units horizontal. For
more information on site requirements, contact SGI site planning by e-mail (site@sgi.com).
Warning: Use two or more people to prevent computer equipment from rolling off the
transportation vehicle. Failure to do so could result in serious damage to the computer
equipment.
12 007-4857-002
Installing a Rack
If your site does not have a loading dock, arrange for a forklift to unload the system from the
transportation vehicle. Ensure that two or three people are available to help unload the equipment.
Move all crates slowly and carefully.
Figure 1-3 shows the lift openings and dimensions of a tall rack shipping crate. This figure also
shows where to position the pallet jack.
Side view
81.5
(2070.1 mm)
52.0
(1320.8 mm)
Pallet jack position
End view
(1066.8 mm)
Figure 1-3Dimensions of Tall Rack Shipping Crate
42
007-4857-002 13
1: System Installation
(
Figure 1-4 shows the lift openings and dimensions of a short rack shipping crate. This figure also
shows where to position the pallet jack.
1238 mm)
Pallet jack position
Figure 1-4Dimensions of Short Rack Shipping Crate
Inspecting the Shipping Crate
After the system is unloaded from the truck, follow these steps before you unpack it:
1.Ensure that the crates and cartons arrive unopened.
2.Inspect the shipping crate for signs of external damage such as dents, holes, crushed corners,
3.Ensure that the tilt watch has not been tripped.
Side view
48.75
(1068 mm)
and water marks.
42.75
End view
29.25
(743 mm)
4.If the crate is damaged, file a damage claim with the carrier immediately. In addition, notify
your local Customer Support Center (CSC) for any missing, incorrect, or damaged items.
For CSC contact information, see http://www.sgi.com/support/supportcenters.html.
14 007-4857-002
Transporting the Shipping Crate
Use a pallet jack with forks that are 48 in. (122 cm) long or longer to transport the shipping crate
to the designated location. See Figure 1-3 and Figure 1-4 for the crate dimensions and location to
position the pallet jack.
If the crate does not fit through all access doors, you may need to partially disassemble the crate.
Caution: If the system shipping or storage environment is significantly colder than the
!
environment in which it will be installed [40
shipping crate for at least 24 hours at room temperature before you start the installation . This
acclimation prevents damage to the equipment that could result from thermal shock and
condensation.
Installing a Rack
ºF (22 ºC) or greater disparity], leave the rack in its
007-4857-002 15
1: System Installation
Removing a Short Rack from the Shipping Crate
W arning: Be caref ul when you unpack and move the short rack system. Use a pallet jack
to move the rack to the unpacking site. Ensure that the rack remains on a level surface and
that the rack weight remains evenly distributed across the four casters. If you must lift the
casters over an obstacle, such as a door threshold, use proper lifting tech niques and employ
a minimum of two people.
Caution: Do not subject the rack to any unnecessary shocks or vibration while you unpack and
!
install the system.
See Figure 1-5 as you follow these steps; the numbered illustrations correspond to the numbered
steps.
1.Ensure that the temperature of the rack is acclimated to the environment in which you are
installing it and move the crate to the installation location.
2.Remove all external banding and lift the crate’s ramp away from the unit.
3.Remove the cardboard carton top and packing/insulation materials.
4.Prepare to move the rack off the crate using the following steps:
a.Remove the two bolts that hold the pallet’s rear retention bracket in place.
b.Remove the two bolts that hold the rear “gate” of the pallet in place.
c.Remove the two bolts that hold the pallet’s front retention bracket in place.
5.Align the holes in the edge of the ramp with the holes at the rear of the crate and secure the
ramp in place using the two bolts removed from the pallet’s rear gate.
6.Using two people, carefully pull the rack off the crate and roll it down the ramp.
Warning: The maximum weight of the short rack is 750 lbs (340 kg). Use caution when
you roll the rack down the ramp.
16 007-4857-002
Installing a Rack
1
4a4b
Rear mounting
bracket
2
Ramp
3
Cardboard
cover
Foam
cushion
4c
5
6
Front mounting
bracket
Figure 1-5Removing a Short Rack from the Shipping Crate
007-4857-002 17
1: System Installation
Unpacking and Installing the Tall Rack
W arning: In its maximu m configuration, a (42U) tall rack system weighs approximately
1,450 lb (658 kg). Use caution when you unpack and move this rack. A pallet jack is needed
to move the rack prior to unpacking. Ensure that the rack remains on a level surface and
that the rack weight remains evenly distributed across the four casters.
To unpack a tall rack, you will need the following tools:
See Figure 1-6 as you follow these steps; the numbered illustrations correspond to the numbered
steps. Ensure that the temperature of the rack is acclimated to the installation environment and that
the system crate is stable, and in an upright position.
1.Use a sharp cutting tool to remove the bands th at secure the cardboard crate, then remove the
outer packing materials.
2.Remove the cover bag and bungee cushion from the rack.
3.Locate and remove the 8 bolts that secure the wooden frame pieces to the base.
4.The rack’s front and rear doors must be removed before you can remove the rack’s shippi ng
runners.
5.Remove and discard the foam plugs from either end of the rack’s base. At this time you will
need to place a pallet jack under the rack and lift it until the shipping runners are clear of the
floor.
6.Undo and remove the two bolts on the shipping runners at either end of the rack and set the
runners aside.
7.Use the pallet jack to lower the rack to the floor and use at least two people to roll the rack to
the installation site.
Warning: Personal injury and system damage could result if the rack becomes
unbalanced or gains too much momentum when rolled.
Reattach the doors to the front and rear before you move the rack to its designated location (see
Figure 1-7 on page 20).
18 007-4857-002
Installing a Rack
1
2
3
Bungee
cushion
4
COMPACTCOMPACT
5
COMPACT
Foam plug
7
COMPACTCOMPACT
6
COMPACT
COMPACTCOMPACT
COMPACTCOMPACT
Figure 1-6Removing a Tall Rack from the Shipping Crate
007-4857-002 19
1: System Installation
COMPACTCOMPACT
COMPACTCOMPACT
Figure 1-7Reattaching the Tall Rack Doors
20 007-4857-002
Positioning and Leveling Racks
Caution: T o avoid ESD damage to the electronic components, be sure to position the racks before
!
you remove the ESD bags that cover the rack assemblies.
To position and level your system racks, follow these steps:
1.Grasp the rear of each rack and roll the rack to its designated location.
2.Remove the ESD bags.
3.Adjust the leveling bolts of a rack, as shown in Figure 1-8, until the rack is level.
Installing a Rack
Figure 1-8Leveling Bolts
4.If you are installing additional racks, repeat until all of the racks are level.
5.Ensure that the circuit breakers on the power distribution un its are in the OFF (
Then connect the power cords to grounded power outlets. Plugging in the power cords
grounds the racks.
6.It is recommended you secure all racks with optional seismic tie-downs if you are installing
the system in an earthquake zone.
Note: Ta ll and short racks have four threaded holes that are located at the bottom of the rack
(see Figure 1 -9). Use these holes to secure the seismic tie-downs. Note that SGI does not
supply the seismic tie-downs.
007-4857-002 21
O) position.
1: System Installation
T
ts
op view of short rack
Top view of tall rack
Seismic tie-down attachment poin
Casters
Leveling pads
Figure 1-9Seismic Tie-down Attachment Points
Unpacking and Inspecting Modules
This section describes how to unpack and inspect individual IRUs. Follow these instructions if you
plan to install individual modules in a rack or on a table top.
Before unpacking your IRUs, inspect the packaging container for evidence of mishandling during
transit. If the packaging container is damaged, photograph it for reference. After you remove the
contents, keep the damaged container and the packing materials.
Remove the IRU(s) from the packaging container and ensure that all accessories are included.
Inspect the IRU(s) and accessories for damage. If the contents appear damaged, file a damage
claim with the carrier immediately. In addition, notify your local SGI Customer Support Center
(CSC) for any missing, incorrect, or damaged items. For CSC contact information, see
http://www.sgi.com/support/supportcenters.html.
22 007-4857-002
Note: When using the IRU as a stand-alone unit (set on a desk or table top) you must secure the
IRU and its fan module together with connection plates. IRUs mounted in a rack are held in place
by the screws at the front and rear of the unit.
Unpacking and Inspecting Modules
Firmly connect the base module of the IRU to the fan module using these steps:
1.Align the rear connectors of the fan module with the rear of the IRU base module and
carefully slide it in until it fully joins together.
2.Align the right side joining plate screw holes with the eight holes in the two adjacent
modules. See Figure 1-10 for an example.
3.Insert and tighten the eight retention screws that hold the connection plate in place.
4.Repeat the process for mounting the joining plate on the left side of the IRU.
COMPACT
COMPACT
i
n
si
de
TM
inside
TM
Figure 1-10IRU Connection Plate for Desktop Systems
007-4857-002 23
1: System Installation
Installing the IRUs in a Rack
This section describes how to install the IRUs that compose an Altix 450 system in a rack. This
section also describes how to remove the IRUs from a rack. You can use the instructions to
rackmount one or more IRUs in a 20U (short) or 42U (tall) Altix rack.
Caution: Always use at least two people when moving, installing, or removing a rackmounted
!
Determining Space Requirement s
Altix 450 IRU.
This rackmounting information is found in the following sections:
•“Determining Space Requirements” on page 24
•“Determining Where to Attach Rails in a Rack” on page 25
•“Rackmounting IRUs with Shelf Rails” on page 26
Table 1-2 specifies the space requirements when rackmounting an IRU in a 19-inch rack.
Table 1-2IRU Space Requirements
Height(5U) 8.68 inches (22 cm)
Width17.5 inches (44.45 cm)
Depth32.5 inches (82.5 cm) with blower attached
Weight~130 lb (59 kg) max config
Required front clearance for IRU8.25 in. (20.96 cm)
Required rear clearance for IRU10 in. (25.40 cm)
Required side clearance for IRUNo clearance requirement
Required front clearance for rack48 in. (122 cm)
Required rear clearance for rack48 in. (122 cm)
a. Weight will vary slightly depending on the number of blades installed and the amount of DIMMs installed, and
on whether you have one or two disk drives in the IRU.
24 007-4857-002
a
Determining Where to Attach Rails in a Rack
1
U
The base IRU, and any additional IRUs, each require five units (5U) of vertical space within the
rack (one unit is equivalent to 1.75 inches [44.5 mm]). To determine where you should install the
unit’s shelf rails in the rack, you must count mounting holes. Each U contains three mounting
holes; therefore, in the 5U of space that the module occupies, there are 15 mounting holes. The
bottom hole of the 5U space is hole 1. The top mounting hole in the 5U space is hole 15. See
Figure 1-11.
Note: An IRU in the rack is identified by the lowest U number that it occupies. For example, in
Figure 1-11 the IRU’s bottom is in U1 (the first unit within the rack).
Installing the IRUs in a Rack
6th hole
8
7
6
5
4
3
TM
inside
8
7
6
5
4
3
5
2
1
COMPACT
2
1
Figure 1-11Mounting Hole Pattern of Rack Rails
To determine how many mounting holes you must count, use the following formula: 3x(the lowest U number that the module will occupy) - 2. For example, when you want to install the
module in locations U6 through U11, count 16 mounting holes (3 x 6 - 2) starting from the bottom
of the rack. The 16th hole is the first mounting hole of U6.
007-4857-002 25
1: System Installation
Rackmounting IRUs with Shelf Rails
This section describes how to rackmount an IRU using shelf rails. The shelf rails, which are
shipped with the module, support the module within the rack. To attach the shelf rails to the rack,
follow these steps:
1.Locate the slot location in the rack in which you will install the IRU. See “Determining
Where to Attach Rails in a Rack” on page 25 if you are uncertai n.
2.In the slot location from step 1, position the left shelf rail at a 90-degree angle to the front
and rear rack rails (see Figure 1-12).
3.Insert the top-front bracket tab into one of the narrow slots in the front rack rail.
4.Insert the top-rear bracket tab into the narrow slot in the rear rack rail.
Note: Verify that the tabs are the same height to ensure that the shelf is level in the rack.
5.Lower the bottom of the shelf rail until its two lower tabs engage with the wide slots in the
front and rear rack rails. The shelf rail is held in by gravity and the tabs.
6.Repeat these steps for the right shelf rail.
9
8
7
6
5
4
3
2
1
Figure 1-12Installing the Shelf Rails in the Rack
10
9
8
7
6
5
4
3
2
1
26 007-4857-002
Installing the IRUs in a Rack
9
Note: Even with the blades, power supplies and fan module disconnected, the IRU’s base
unit is heavy. Always use two people when installing it into a rack.
7.Working from the front of the rack, set the rear edge of the IRU module on the shelf rails.
Then slide the module into the rack until the module ears are snug against the front rack
rails. See Figure 1-13.
8.Fasten the module to the front rails of the rack with the screws provided. See Figure 1-13.
9.Working from the rear of the rack, align the rear connectors of the fan module with the rear
of the IRU base module and carefully slide it in until it fully connects together.
10. Fasten the fan module to the rear rails of the rack with the screws provided.
8
7
6
5
4
3
2
1
COMPACT
inside
TM
10-32 x 1/2-inch screws
Figure 1-13Securing the Module to the Front of the Rack
007-4857-002 27
1: System Installation
Removing a Rackmounted IRU Module
To remove an IRU that is already mounted in a rack, follow these steps:
1.Power off the module. For instructions on how to power off the module, see “Powering the
System On and Off” on page 38.
2.Disconnect all of the cables at the front of the module.
3.Removing the system blades and power supplies from the IRU is recommended to decrease
the overall weight of the unit. Be sure to place any system blades in a static-free area.
Warning: Components may be hot. To avoid injury, allow the components to cool for
approximately five minutes before you proceed with these instructions.
4.Remove the screws that secure the module to the front rails of the rack.
5.Remove the screws that secure the IRU at the rear of the rack.
Caution: Always use at least two people when installing or removing a rackmounted Altix 450
!
IRU.
6.Use a minimum of two people to slide the unit out the front of the rack. Both lifters need to
place one hand securely under the IRU as it slides out to support the weight of the unit as it
clears the rack.
7.Place the IRU base module on a flat, stable surface.
8.Remove the IRU’s fan module from the rear of the rack and place it nearby.
28 007-4857-002
NUMAlink Cabling the System IRUs
Some configurations of an Altix 450 system will consist of either two or more IRUs. This section
describes how to cable together these multiple units. If your system was shipped with all the cables
pre-installed, go on to the next section.
Note: If your system is made up of a single module, you can skip this section and proceed to
“Cabling the System IRUs to a Power Source” on page 31 for instructions to connect your system
to power.
The primary IRU module (housing the system disk), connects to additional Altix 450 IRUs via the
module’s front NUMAlink connectors. Figure 1-14 shows an example.
Note: Before you connect two Altix 450 IRUs to each other, you must assign unique system ID
numbers to the compute modules, as follows:
1.Designate one of the IRU modules as the master server and the other IRU chassis as the
slave.
NUMAlink Cabling the System IRUs
2.Connect a console to the master IRU (see “Connecting the System Console” on page 36),
and then power on the IRU (see “Powering the System On and Off” on page 38).
3.Use the <slotnumber> L1 command to set the secondary IRU’s ID number to a higher
number than the primary IRU’s ID number . For example, if the master IRU unit’ s L1 prompt
indicates that its ID number is 01 (
should be 02 or higher (
001c01-L1>brick slot 02
brick slot set to 02.
001c02-L1>).
001c01-L1>), then the subsidiary IRU ID number
See the SGI L1 and L2 Controller Software User’s Guide (007-3938-00x) for more
information on L1 commands.
007-4857-002 29
1: System Installation
i
ns
i
de
TM
COMPACTCOMPACT
i
ns
i
de
TM
Figure 1-14NUMAlink Cabling an IRU to an Additional IRU
To connect two Altix 450 compute modules to each other with NUMAlink cables, follow these
steps:
1.Power off both compute modules (see “Powering the System On and Off” on page 38).
2.Connect the NUMAlink cables to the NUMAlink connectors on both IRU modules.
3.Connect a system console to the master IRU (the one with the system disk), as described in
“Connecting the System Console” on page 36.
4.Power on both IRUs as described in “Powering the System On and Off” on page 38.
30 007-4857-002
The console output should indicate that the processors and memory on both IRU modules have
been discovered. If the secondary IRU module has not been discovered, use the reset L1
command to perform a reset of the system.
Note: Use the network L1 command to set the network mode. (See the SGI L1 and L2
Controller Software User’s Guide (007-3938-00x) for more information on L1 commands.)
Cabling the System IRUs to a Power Source
This section describes how to make the following connections to a power source:
•“Connecting an IRU Module to a Power Source (No Rack)” on page 31
•“Connecting System Modules to a Rack Power Source” on page 34
Connecting an IRU Module to a Power Source (No Rack)
Cabling the System IRUs to a Power Source
If you are operating your system on a table top, connect the IRU(s) to an approved electric
receptacle. See Figure 1-15 for the location of the IRU’s power connectors.
007-4857-002 31
1: System Installation
inside
TM
Figure 1-15Power-Supply Connectors on the IRU Module
Figure 1-16 shows the connecting of a dual-IRU system located on either a tabletop or desktop, to
a power source.
32 007-4857-002
P
ower
s
ource
Power
distribution
unit (PDU)
Cabling the System IRUs to a Power Source
COMPACT
i
n
s
i
d
T
M
e
COMPACT
in
s
ide
T
M
Figure 1-16Connecting an Unracked IRU System to a Power Source
007-4857-002 33
1: System Installation
P
s
Connecting System Modules to a Rack Power Source
If your Altix 450 system is shipped from SGI in a rack, there will be at least one power distribution
unit (PDU) installed in the rack. Note that each single-phase PDU can support a maximum of two
Altix 450 IRUs. Plug both power cords from each IRU into the PDU(s) installed in your short or
tall rack.
Figure 1-17 shows an example of cabling the system modules that reside in a rack to a PDU.
Power
distribution
unit (PDU)
COMPACT
i
nsid
T
M
e
COMPACT
i
nside
ower
ource
TM
Components within rack
Figure 1-17Connecting Multiple Modules to a PDU
An optional three-phase PDU (used in 42U system racks only) has 18 outlet connections, of which
up to 16 may be needed to power eight IRUs. Each three-phase rack system houses a single PDU.
34 007-4857-002
Cabling the System IRUs to a Power Source
n
)
Figure 1-18 shows an example of the optional three-phase PDU available in tall racks only.
Power
distributio
unit (PDU
Figure 1-18Optional Three-Phase PDU Example
007-4857-002 35
1: System Installation
Connecting the System Console
This section explains how to attach and establish a connection between a system console and an
Altix 450 system. The system console enables you to perform the following activities:
•Start a Linux console session to set up networking and other configuration parameters.
•Monitor your system by reading the status and error message information that the L1
controller generates.
•Enter L1 controller commands to monitor or change particular system functions. You can,
for example, monitor the speed of fans for a particular module. See the SGI L1 and L2 Controller Software User’s Guide (007-3938-00x) for descriptions of the L1 controller
commands that you can use.
•Power-on or power-off your system.
To attach a PC to your server system, connect a null modem serial cable between the PC and the
Console port of the IRU (see Figure 1-19). To establish a connection between your system and the
system console (PC), follow these steps:
1.Ensure that the L1 controller of the base compute module is powered on. When the base
compute module is connected to a power source, the L1 controller should be powered on.
Note: When powered on, the L1 controller displays L1 running. If it does not display L1
running, check the connection between the base compute module and the power source. In
addition, ensure that the circuit breaker of the power source is on.
2.Power on the system console.
3.Ensure that the system console has the following network settings:
•Baud 38400
•No parity
•8 data bits
•1 stop bit
•Hardware flow control on (RTS/CTS)
4.Type cu -l ttys0 -S38400 for a Linux based console, and then press Enter to
display the L1 prompt. If the console uses another operating system, start a terminal
emulation program as instructed in the console’s user documentation.
36 007-4857-002
Console port
E
Reset button
Connecting the System Console
TM
inside
thernet port
L1 controller
panel
Figure 1-19Location of the L1 Console Port
007-4857-002 37
1: System Installation
Operating Your System
This section describes how to operate your Altix 450 system, as follows:
•“Powering the System On and Off” on page 38
•“L1 Front Panel Controls” on page 44
•“Operating the L1 Controller” on page 45
Powering the System On and Off
This section describes how to power on and power off your system (or an individual IRU) at a
system console, or manually, as follows:
•“Preparing to Power On” on page 38
•“Powering Off at the System Console” on page 42
•“Powering On Manually or With L1 Console” on page 41
•“Powering Off at the System Console” on page 42
•“Powering Off Manually” on page 43
Preparing to Power On
To prepare to power on your system, follow these steps:
1.Confirm that all the cable connections between the IRUs and other system items are
connected securely . If necessary, see “NUMAlink Cabling the System IRUs” on page 29 for
information.
2.Confirm that all the power cables are plugged in securely to a power source. If necessary, see
“Cabling the System IRUs to a Power Source” on page 31 for information.
Note: You must manually power on any optional storage modules by placing the power
switch(es) in the ON (
I) position. Note that you should turn any optional storage module on before
starting the operating system.
38 007-4857-002
Powering On With Optional System Console and L1/L2 Controller
s
To power on and boot your system using the system console, follow these steps:
1.If not yet done so, connect a system console to the server system as described in
“Connecting the System Console” on page 36.
2.If the monitors and other peripherals are equipped with voltage select switches, verify that
they are set for the appropriate AC voltage and plug them in. Note that they are normally
plugged into sources outside the rack system.
3.Turn on the on/off circuit breaker switchs of the PDU as shown in Figure 1-20.
Power
distribution
unit (PDU)
Operating Your System
On/off switche
Power
source
Figure 1-20PDU Circuit Breakers Example
007-4857-002 39
1: System Installation
4.Verify that all of the L1 controllers display “L1 run ning.” If any L1 controllers are not
running, contact your SGI system support engineer (SSE).
5.At the system console, access the L2 controller by entering the following command:
$> /stand/sysco/bin/12term
6.From the L2 prompt, display the system configuration by entering the following command:
L2> cfg
This command lists the IRUs that the L2 controller detects in the system and their system
controller addresses. If a IRU that you want to power on does not appear in the list, it will
not power on. Ensure that the IRU’s L1 controller is running and that the IRU is cabled
properly.
7.From the L2 prompt (L2>), power on an individual IRU by entering the following command.
(If you want to power on the entire system, proceed to Step 8.)
L2> r <rack#> s <slot#> pwr u
For example, to power on a IRU in rack 1, slot 18, enter the following command:
L2> r 1 s 18 pwr u
The slot number is the unit number of the IRU within the rack.
If you want to power on two selected IRUs of a rack at the same time, you must enter the
rack number followed by the slot numbers of the IRUs that you want to power on. For
example, to power on the IRUs in slots 16 and 21, enter the following command:
L2> r 1 s 16,21 pwr u
8.If you want to power on the entire system, enter the following command:
L2> pwr u
(The default setting for the pwr u command is all racks and slots.)
9.When the L2 prompt appears, you will not see the output that is produced during the
power-on procedure unless you redirect the keyboard input from the L2 controller to the
normal console by typing control d.
L2> ctrl d
40 007-4857-002
Powering On Manually or With L1 Console
n
L
If your system does not have a a local system console, you can manually power on and off the
server by using the power button(s).
To power on your system or an individual IRU manually, follow these steps:
1.If the monitors and other peripherals are equipped with voltage select switches, verify that
they are set for the appropriate AC voltage and plug them in. Note that they are normally
plugged into power sources outside the rack system.
2.Turn the circuit breaker switchs of the PDU to the on position, see Figure 1-20 on page 39.
3.Press the power buttons (see Figure 1-21) on each of the IRUs that you want to power on.
Note that any optional storage modules should be powered up first.
inside
Operating Your System
TM
1 controller
display
Power butto
with LED
Figure 1-21Location of the Power Button
T o boot the system from the L1 interface you must have a console connected to the system via the
console port. Use the following commands depending on system type:
1.For a system that contains one IRU, enter the following command at the L1 prompt to power
on the server:
001c01-L1> power up
2.For a system with multiple IRUs, enter the following command at the L1 prompt:
001c01-L1> * power up
007-4857-002 41
1: System Installation
Powering Off at the System Console
To power off your system using the system console, follow these steps:
Caution: If you power off the system before you halt the operating system, you can lose data.
!
1.Shut down the operating system by entering the following command:
# init 0
2.T o access the L2 prompt, direct the keyboard input to the L2 controller by entering Ctrl+T.
$> ctrl t
3.From the L2 prompt (L2>), power off an individual IRU by entering the following
command. (If you want to power off the entire system, proceed to Step 4.)
L2> r <rack#> s <slot#> pwr d
For example, to power off the IRU in rack 1, slot 16, enter the following command:
L2> r 1 s 16 pwr d
The slot number is the unit number of the IRU within the rack.
If you want to power off more than one IRU in the rack at the same time, enter the rack
number followed by the slot numbers of the IRUs that you want to power off. For example,
to power off the IRUs in slots 16 and 21, enter the following command:
L2> r 1 s 16,21 pwr d
4.If you want to power off all of the IRUs within the rack, enter the following command:
L2> pwr d
(The default setting for the pwr d command is all racks and slots.)
“Powered Down” appears on the L1 display when the IRU is powered off. The L1 controller
is still powered on.
5.T o completely power down an L1 controller, unplug the IRU’s power cords from the PDU or
two front power supplies.
42 007-4857-002
Powering Off Manually
!
Operating Your System
To power off your system manually, follow these steps:
Caution: If you power off the system before you halt the operating system, you can lose data.
1.Shut down the operating system by entering the following command:
# init 0
2.Press the power buttons or power switches on each of the IRUs that you want to power off.
You may power off the IRUs in any order:
•To power of f the optional storage units, press the power button(s) on its rear panel to the
OFF (
O) position.
•To power off the IRUs, press the power button with the LED on the front panel of each
unit (see Figure 1-21 on page 41).
007-4857-002 43
1: System Installation
F
L1 Front Panel Controls
The L1 front panel of the Altix 450 IRU provides the following control features, as shown in
Figure 1-22:
•Status LEDs. The front panel has the following LEDs:
•Power button. Normally you power on the system from the system console. Alternatively,
•Reset button. Actuate this button to reset the internal processors and ASICs. Th e reset will
•NMI button. Actuate the NMI (non-maskable interrupt) button to reset the internal
•Power button LED. This LED illuminates green when the internal components are on.
•Service-required LED. This LED illuminates yellow to indicate that an item is not
functioning properly (for example, a fan is off), but the IRU is operating.
•Failure LED. This LED illuminates red to indicate that a failure has occurred and the
module is down.
you can press this button to power on the module.
cause a memory loss. (To perform a reset without losing memory, see the NMI button
information that follows.)
processor(s) and ASICs, and write register data and memory to the /var/adm/crash file.
Use a paperclip or other small object to access the “pin-hole” actuators on the front panel.
L1 controller display
inside
TM
ailure LED
Service-required LED
Power button
with LED
Reset
Figure 1-22L1 Front Panel Functions
44 007-4857-002
NMI
Note: If your system includes an optional storage module, see the user’s guide that comes with
the product for operating instructions.
Operating the L1 Controller
The L1 controller operates in one of the two following modes:
Operating Your System
•L1 mode. The L1 prompt (
001c01-L1>) is visible and all input is directed to the L1
command processor.
•Console mode from L1. Output from the system is visible and all input is directed to the
system.
The L1 controller is ready to accept commands when you see a prompt of the following form:
001c01-L1>
See the SGI L1 and L2 Controller Software User’s Guide (007-3938-00x) for a detailed list of L1
commands.
To enter console mode, press Ctrl+D at the L1 prompt, as follows:
001c01-L1> Ctrl+D
entering console mode 001c01 console, <CTRL-T> to escape to L1
.
<system output appears here>
.
To return to L1 mode, press Ctrl+T, as follows:
Ctrl+T
escaping to L1 system controller
001c01-L1>
At this point, you can enter any L1 command. When the command completes execution, the L1
returns to console mode:
re-entering console mode 001c01 console, <CTRL-T> to escape to L1
T o permanently engage the L1 mode, press Ctrl+T and then enter the l1 command, as follows:
Ctrl+T
007-4857-002 45
1: System Installation
escaping to L1 system controller
001c01-L1> l1
L1 command processor engaged, <CTRL-D> for console mode.
001c01-L1>
46 007-4857-002
Chapter 2
2.Operation Procedures
This chapter explains how to operate your new system in the following sections:
•“Precautions” on page 47
•“System Controller Network” on page 49
•“Powering the System On and Off” on page 51
•“Using Embedded Support Partner (ESP)” on page 56
•“Monitoring Your Server” on pa ge 57
•“Installing Optional Components” on page 59
Precautions
Before operating your system, familiarize yourself with the safety information in the following
sections:
•“ESD Precaution” on page 47
•“Safety Precautions” on page 48
ESD Precaution
Caution: Observe all ESD precautions. Failure to do so can result in damage to the equipment.
W ear an SGI-approved wrist strap when you handle an ESD-sensitive device to eliminate possible
ESD damage to equipment. Connect the wrist strap cord directly to earth ground.
007-4857-00247
2: Operation Procedures
Safety Precautions
!
Warning: Before operating or servicing any part of this product, read the “Safety
Information” on page 16 5.
Danger: Keep fingers and conductive tools away from high-voltage areas. Failure to
follow these precautions will result in serious injury or death. The high-voltage areas of the
system are indicated with high-voltage warning labels.
Caution: Power off the system only after the system software has been shut down in an orderly
manner. If you power of f the system before you halt the operating system, data may be corrupted.
W arning: If a lithium battery is installed in your system as a soldered part, only qualified
SGI service personnel should replace this lithium battery. For a battery of another type,
replace it only with the same type or an equivalent type recommended by the battery
manufacturer, or an explosion could occur. Discard used batteries according to the
manufacturer’s instructions.
48007-4857-002
System Controller Network
All Altix 450 enclosures contain an embedded microprocessor board and display assembly known
as the system controller. This microprocessor runs an embedded version of the Linux operating
system. The system controller runs off standby power and is running as long as the enclosure is
connected to an active power source.
There are two primary applications that run on the system controller. The L1, or Level 1 system
controller, is an application that provides control and monitoring functionality for each individual
rack unit enclosure (IRU), and communication to other L1s in adjacent enclosures connected via
NUMALink 4 cables. The L1 is always resident.
The console mode Level 2 (L2) system controller is an application that provides control over
multiple L1s and communication to other L2s. The L2 is resident and active when the enclosure
is connected by an Ethernet connection to a Local Area Network (LAN).
The system controller network provides the following functionality:
•Powering the entire system on and off.
•Powering individual IRUs on and off.
System Controller Network
•Monitoring the environmental state of the system
•Viewing the system’s status and error message information generated and displayed by the
SGI system’s L1 controller.
•Enter L1 controller commands to monitor or change particular system functions. You can,
for example, monitor the speed of fans for a particular individual rack unit (IRU) enclosure.
See the SGI L1 and L2 Controller Software User’s Guide for a complete list of commands.
•Provides access to the system OS console allowing you to run diagnostics and boot the
system.
007-4857-00249
2: Operation Procedures
Console Connections
A console is defined as a connection to the system that provides access to the system controller
network. A console can be a personal computer (Ethernet connection) or a dumb terminal (serial
connection).
The Altix 450 series supports multiple types of console connections:
•An ethernet connection on the system control board of an IRU - an RJ45 connection is
typically labeled “L2 host”
•A serial connection to the serial console port on the system control board - a DB9 connector
typically labeled “Console”
•A 2D video card installed in the base I/O blade (not available with RHEL 5) - this
low-profile PCI video interface card requires use of a USB keyboard and mouse
Connecting to the System Controller
The ethernet connection is generally the preferred method of accessing the system console.
Depending on the size of the system, there will be one or more ethernet connections utilized.
When an enclosure is connected to the LAN, the system controller will start the ethernet interface
with either the pre-assigned IP address or will attempt to acquire an IP address via DHCP, and will
then spawn the L2 application. See the section on setting up an IP address for further details. Once
the system controller is connected to the LAN, it can be accessed via a simple telnet session. The
serial connection is used to communicate directly with the L1 system controller . This connection
is typically used for service purposes or for system controller and system console access in small
systems where an ethernet connection is not available.
Caution: An L1/L2 network connection must be kept on a private and physically secure network
connection. The System Controller does not provide software security, authentication, or
safeguards against malicious or careless users. The consequences of unauthorized access to the
System Controllers are potential system unavailability, system reconfiguration, and detailed
access to every aspect of the system. The recommended configuration for the L1/L2 network
connection is a separate, private and physically secure network attachment to an equally separate
and physically secure PC. Any Linux system hosting L3 software must be administered to prevent
unauthorized access.Optional L3 software is a package that runs on a Linux based PC. It is mainly
used by service personnel as a diagnostic aid.
50007-4857-002
Communicating with the System Controller
Once a connection to the console is established, the user will be presented with either an L2
prompt (ethernet connection), known as L2 mode, or and L1 prompt (serial connectio n), known
as L1 mode. From either of these prompts various system controller commands can be entered.
001c01-L1> <l1 command>
olympic-101-L2> <l2 command>
To access the system console, known as console mode, a control-d is entered at either the L1
prompt or the L2 prompt. To return to or escape back to the L1 or L2 mode, a control-t is entered.
This escape is only temporary and you will be returned to console mode once the “Return/Enter”
key is pressed. To re-engage L1 or L2 mode, enter control-t followed by either “l1” or “l2”
depending on the original mode. System control commands are always entered in lower case
unless otherwise specified.
Powering the System On and Off
Communicating with the System Controller
This section explains how to power on and power off individual rack units, or your entire Altix
450 system, as follows:
•“Preparing to Power On” on page 52
•“Powering On and Off at the Console” on page 54
For servers with a system console, you can power on and power off individual IRUs or the entire
system at the system console.
You may also monitor and manage your server with tools such as VACM, Console Manager, and
PCP. For details, see the documentation for the particular too l.
The Embedded Support Partner (ESP) program enables you and your SGI system support engineer
(SSE) to monitor your server remotely and resolve issues before they become problems. For
details on this program, see “Using Embedded Support Partner (ESP)” on page 56.
007-4857-00251
2: Operation Procedures
Preparing to Power On
To prepare to power on your system, follow these steps:
1.Check to ensure that the cabling between the rack’s power distribution unit (PDU) and the
wall power-plug receptacle is secure.
2.For each individual IRU that you want to power on, make sure that the power cables are
plugged into all the IRU power supplies correctly, as shown in Figure 2-1. Setting the circuit
breakers on the PDUs to the “On” position will apply power to the IRU and will start the
system controller(s) in the IRUs. Note that the system controller in each IRU stays powered
on as long as there is power coming into the unit.
inside
TM
Figure 2-1IRU Power Supply Cable Location Example
3.If you plan to power on a server that includes optional mass storage enclosures, make sure
that the power switch on the rear of each PSU/cooling module (one or two per enclosure) is
in the
1 (on) position.
4.Make sure that all PDU circuit breaker switches (see the example in Figure 2-2 on page 53)
are turned on to provide power to the server system when it is switched on.
The power-on and of f procedure at a console varies with your server setup, as follows:
•If you have a console connected to a server with a serial interface, you can toggle between
L1 and console mode. This enables you to power on your server with L1 commands and
view the activity by changing to the console mode.
For detailed instructions on using a system console using the L1 mode, see “Operating the
L1” on page 84.
•If you have a system console connected to a server with an ethernet interface, you can toggle
between L2 and console mode, and power on your server with L2 controller commands, and
monitor the power-on activity by changing to the console mode.
Powering On and Off from the L1
Commands issued at the L1 prompt typically only affect the local IRU. The following sections
describe how to power on and power off your system in L1 mode.
The L1 controller display, located on the front of each IRU, should display
L1 running once the
power-on procedure starts (storage modules do not have L1s). The prompt on your console screen
shows the rack and slot number of the IRU to which you have connected your console.
Note: If you have a problem while powering on and an error message appears on your console
display, see “L1 Controller Error Messages” on page 145 to learn what the error message indicates
and how to resolve the problem.
If you want to power on the IRU (001c01 in our example) indicated in the prompt, enter the
following command.
001c01-L1> power up
To power off:
001c01-L1> power down
If you are attempting to power on or power off a system with multiple IRUs from the L1 prompt,
you will need to prepend the command with an asterisk as follows:
oo1c01-L1> * power up
or
54007-4857-002
Powering the System On and Off
oo1c01-L1> * power down
(* indicates all)
From the L1 prompt, display the system configuration information by entering the following
command:
In L1 mode, you can obtain only limited information about the system configuration. An IRU has
information about its internal blades, and also when other IRUs are NUMAlink attached to the
IRU, information about those IRUs.
Note: Verify that the power LED on the L1 display turns on and lights green and that your
controllers display that the system is powered on for each segment of the procedure, which
indicates that the power-on procedure is proceeding properly. If you have a problem while
powering on and an error message appears on the L1 controller, or the system console, see your
online log files and the information in “L1 Controller Error Messages” on page 145 to learn what
the error message indicates and how to best resolve the problem.
007-4857-00255
2: Operation Procedures
Powering-On and Off From the L2
Commands issued at the L2 prompt are typically sent to all NUMAlink connected IRU enclosures
in the system.
To power on a system:
system-101-L2> power up
To power off a system:
system-101-L2> power down
Using Embedded Support Partner (ESP)
Embedded Support Partner (ESP) automatically detects system conditions that indicate potential
future problems and then notifies the appropriate personnel. This enables you and SGI system
support engineers (SSEs) to proactively support systems and resolve issues before they develop
into actual failures.
ESP enables users to monitor one or more systems at a site from a local or remote connection. ESP
can perform the following functions:
•Monitor the system configuration, events, performance, and availability.
•Notify SSEs when specific events occur.
•Generate reports.
ESP also supports the following:
•Remote support and on-site troubleshooting.
•System group management, which enables you to manage an entire group of systems from a
single system.
For additional information on this and other available monitoring services, see the section “SGI
Electronic Support” in Chapter 7.
56007-4857-002
Monitoring Your Server
rt
rt
P
P
You can monitor you r Alt ix 450 server from the following sources:
•On the L1 controller’s display at the front of the IRU as shown in Figure 2-3, you can
monitor system operational status. For example, you can monitor error messages that warn
of power or temperature values that are out of tolerance.
•You can connect and monitor the system via an Ethernet port adapter. You will need to
connect either a local or remote workstation to monitor the system via Ethernet.
These console connections enable you to view the status and error messages generated by
the L1/L2 controllers in your Altix 450 rack. You can also use these consoles to input L1/L2
commands to manage and monitor your system.
•A DVI /VGA video card allows you to interact directly with the system, though not directly
with the system’s L1/L2 interface. See “2D Graphics Video Interface” on page 58 for more
information. SLES 10 or later is required for this card. RHEL 5 is not supported.
Monitoring Your Server
L1 controller
panel
ort
ort
PCI-X 1 PCI-X 0
H
DVDROM
G
HDD0HDD1
ETH0 ETH1
P0PF N0N1
F1F2 F3 F4
A1A2 A3A4
SAS 4XUSB0 USB1 USB2 USB3
TM
inside
Po
G
Po
H
Power
Compute/memory blades
Router
board
IA blade
System control
board
supplies
Router
board
Figure 2-3IRU L1 Controller Display Location Example
007-4857-00257
2: Operation Procedures
2D Graphics Video Interface
If your system was ordered in January 2007 or later, it may come equipped with an optional 2D
graphics board interface (supported with SLES 10 or later system software). This low-profile PCI
interface card is installed in the base I/O blade in the Altix system (see Figure 2-4 on page 59).
Note that this blade is commonly referred to as the IA2 blade. One 2D card is supported per system
or partition; no audio function is supported. The 2D video card has the following features:
•64 MB DDR graphics memory
•DMS-59 display connector for two display support (no synchronization between monitors)
•Simultaneous display to DVI and VGA monitors using an adapter
•Analog resolution up to 2048 x 1536 per display
•DVI resolution up to 1600 x 1200 per display
Note: A single USB keyboard/mouse is supported by the IA2 (base I/O) blade.
The 2D video interface card can be used for all basic interaction with your Altix system. You
should note that it does not provide an interface to the L1/L2 controller and L1/L2 administrative
commands cannot be issued through the 2D video interface.
58007-4857-002
Installing Optional Components
d
DVDROM
PCI-X 0
PCI-X 1
HDD0HDD1
2D graphics car
ETH0 ETH1
P0 PF N0 N1
F1 F2 F3F4
A1 A2 A3 A4
SAS 4XUSB0USB1USB2USB3
Figure 2-42D Graphics Card Location in IA2 (base I/O) Blade
Installing Optional Components
Besides adding a system console, you can add or replace the following hardware items on your
Altix 450 series server:
•Peripheral component interface (PCI) cards into your system I/O blades.
•Disk drives in your IA/IA2 blade (base I/O).
The sections that follow discuss these activities in more detail.
007-4857-00259
2: Operation Procedures
I/O Blade Overview
W arning: You can add or replace only the items listed in this section. For your safety and
for the protection of your server system, contact your SGI system support engineer (SSE) to
install any hardware items not listed in this section.
Warning: Before installing, operating, or servicing any part of this product, read the
“Safety Information” on page 165.
System I/O is primarily accomplished through standard and optional blades in the SGI Altix 450
IRUs. The PCI, PCI-X and PCIe based I/O sub-systems, are industry standard for connecting
peripherals, storage and graphics to a processor blade. These are the primary configurable I/O
system interfaces for the Altix 450 series systems:
•The IA/IA2 blade (base I/O) used in the Altix 450 system includes two half-height
PCI/PCI-X slots. The two option cards install adjacent to the system disk and DVD drives
that are also resident in the IA blade. The IA2 blade differs from the original IA base I/O in
that it supports RAID 1 disk mirroring and DVD-R/W capabilities.
•The optional three-slot PCI/PCI-X double-wide blade holds three sled-mounted PCI/PCI-X
cards for easy insertion and extraction from the system. It is only used in A450 systems that
have more than one IRU.
•The optional single-wide two-slot PCI-X blade supports two PCI or PCI-X option cards.
•The optional double-wide PCI express (PCIe) blade is used only in Altix 450 systems with
two or more IRUs. It houses two PCIe slots and two PCI-X slots. Note that use of two PCIe
(3D) graphics cards in this blade precludes using the two PCI-X slots due to space
limitations.
Not all blades may be available with your system configuration. Check with your SGI sales or
service representative for availability . See Chapter 6, “Maintenance and Upgrade Procedures” for
detailed instructions on installing or removing PCI cards and blades.
60007-4857-002
Adding or Removing Disk Drives
The IA/IA2 blade (base I/O blade) within the IRU supports the system boot functions and contains
one or two low-profile disk drives. See “Installing or Replacing a Disk Drive in the IA Blade” on
page 138 for detailed instructions on installing or removing disk drives.
For information on removing or replacing disk drives in optional mass storage enclosures, see the
hardware document supplied with the storage option.
Reconfigure the System Disk Drive(s) for RAID 1
System disk drive(s) ordered starting in March 2007 may come configured as either a non-RAID
(jbod) or as a RAID 1 (disk mirror) system. Note that while RAID 0 (striping) is supported by the
hardware, SGI recommends using RAID 1 to ensure maximum reliability an d data retention.
The type of disk configuration is ordered by the customer and configured at the SGI factory. If a
requirement exits to reconfigure the system disk(s), use the following information to make the
changes needed. The lsiutil configuration software is the only supported methodology for
modifying the disks at the time this document was published. Use of other software interfaces has
not been verified and cannot be supported.
Installing Optional Components
Important: The example process that follows presumes there are two disks identical in size and
speed installed in your Altix system IA2 (base I/O) blade. Also, note that this example of
re-imaging a disk is done via network access.
The lsiutil file (efi version) resides at /boot/efi
Launch the configuration utility with the command:
fs0:\> lsiutil64.efi
You should see an interactive interface similar to the following:
LSI Logic MPT Configuration Utility, Version 1.xx, November 7, 200X
1 MPT Port found
Port Name Chip Vendor/Type/Rev MPT Rev Firmware Rev IOC
1. 01/00/01/0 LSI Logic SAS1068 B0 105 01100000 0
The SAS1068 is the chip that controls the device that the lsiutil utility was loaded from.
007-4857-00261
Select a device: [1-1 or 0 to quit] 1
1. Identify firmware, BIOS, and/or FCode
2. Download firmware (update the FLASH)
4. Download/erase BIOS and/or FCode (update the FLASH)
8. Scan for devices
10. Change IOC settings (interrupt coalescing)
13. Change SAS IO Unit settings
16. Display attached devices
18. Change WWID
20. Diagnostics
21. RAID actions
22. Reset bus
23. Reset target
24. Clear ACA
39. Force firmware download boot
45. Concatenate SAS firmware and NVDATA files
60. Show non-default settings
61. Restore default settings
97. Reset SAS phy
98. Reset SAS link
99. Reset port
e Enable expert mode in menus
p Enable paged mode in menus
w Enable logging
Main menu, select an option: [1-99 or e/p/w or 0 to quit] 8
Important: After this command is entered, the load device cannot be accessed again to open files
until lsiutil has been exited and restarted.
SAS1068's links are 3.0 G, 3.0 G, down, down, down, down, down, down
B___T___L Type Vendor Product Rev SASAddress PhyNum
0 0 0 Disk SGI ST3146854SS X422 5000c5000002cb45 0
0 1 0 Disk SGI ST3146854SS X422 5000c500000121cd 1
Main menu, select an option: [1-99 or e/p/w or 0 to quit] 0
Port Name Chip Vendor/Type/Rev MPT Rev Firmware Rev IOC
1. 01/00/01/0 LSI Logic SAS1068 B0 105 01100000 0
Select a device: [1-1 or 0 to quit] 1
Installing Optional Components
1. Identify firmware, BIOS, and/or FCode
2. Download firmware (update the FLASH)
4. Download/erase BIOS and/or FCode (update the FLASH)
8. Scan for devices
10. Change IOC settings (interrupt coalescing)
13. Change SAS IO Unit settings
16. Display attached devices
18. Change WWID
20. Diagnostics
21. RAID actions
22. Reset bus
23. Reset target
24. Clear ACA
39. Force firmware download boot
45. Concatenate SAS firmware and NVDATA files
60. Show non-default settings
61. Restore default settings
97. Reset SAS phy
98. Reset SAS link
99. Reset port
e Enable expert mode in menus
p Enable paged mode in menus
w Enable logging
Main menu, select an option: [1-99 or e/p/w or 0 to quit] 21
1. Show volumes
2. Show physical disks
3. Get volume state
23. Replace physical disk
30. Create volume
31. Delete volume
32. Change volume settings
50. Create hot spare
99. Reset port
e Enable expert mode in menus
p Enable paged mode in menus
w Enable logging
RAID actions menu, select an option: [1-99 or e/p/w or 0 to quit] 1
0 volumes are active, 0 physical disks are active
RAID actions menu, select an option: [1-99 or e/p/w or 0 to quit] 30
007-4857-00263
2: Operation Procedures
B___T___L Type Vendor Product Rev Disk Blocks Disk MB
1. 0 0 0 Disk SGI ST3146854SS X422 286749488 140014
2. 0 1 0 Disk SGI ST3146854SS X422 286749488 140014
To create a volume, select two or more of the available targets
Select a target: [1-2 or RETURN to quit] 1
Select a target: [1-2 or RETURN to quit] 2
2 physical disks were created
Select volume type: [0=Mirroring, 1=Striping, default is 0]
Select volume size: [1 to 139898 MB, default is 139898]
Enable write caching: [Yes or No, default is No]
Zero the first and last blocks of the volume? [Yes or No, default is No]
Volume was created
RAID actions menu, select an option: [1-99 or e/p/w or 0 to quit] 1
1 volume is active, 2 physical disks are active
Volume 0 is Bus 0 Target 0, Type IM (Integrated Mirroring)
Volume State: degraded, enabled , resync in progress
Volume Settings: write caching disabled , auto configure
Volume Size 139898 MB, Stripe Size 0 KB, 2 Members
Primary is PhysDisk 0 (Bus 0 Target 56)
Secondary is PhysDisk 1 (Bus 0 Target 1)
RAID actions menu, select an option: [1-99 or e/p/w or 0 to quit] 99
Resetting port...
RAID actions menu, select an option: [1-99 or e/p/w or 0 to quit] 0
Main menu, select an option: [1-99 or e/p/w or 0 to quit] 0
Port Name Chip Vendor/Type/Rev MPT Rev Firmware Rev IOC
1. 01/00/01/0 LSI Logic SAS1068 B0 105 01100000 0
Select a device: [1-1 or 0 to quit] 0
Sas(Pun0,Lun0) LSILOGICLogical Volume 3000
Starting: netboot
ELILO
.
.
.
.
Quietly populating /dev/sda2 and /dev/sda1 with
ia64-propack51-sles10-Jan19
# reboot....
Note: The initial resynchronization of the mirrored disk (depending on the size of the disk) may
take up to 45 minutes or longer.
RAID actions menu, select an option: [1-99 or e/p/w or 0 to quit] 1
007-4857-00265
2: Operation Procedures
1 volume is active, 2 physical disks are active
Volume 0 is Bus 0 Target 0, Type IM (Integrated Mirroring)
Volume State: optimal, enabled
Volume Settings: write caching disabled, auto configure
Volume draws from Hot Spare Pools: 0
Volume Size 139898 MB, Stripe Size 0 KB, 2 Members
Primary is PhysDisk 1 (Bus 0 Target 56)
Secondary is PhysDisk 0 (Bus 0 Target 1)
For information on the two RAID 1 disks, use the lsscsi command as shown in the following
example:
# lsscsi
[1:0:0:0] disk SGI ST3146854SS X422 - /dev/sg2
[1:0:1:0] disk SGI ST3146854SS X422 - /dev/sg3
[1:1:0:0] disk LSILOGIC Logical Volume 3000 /dev/sdc /dev/sg4
This is an example of what you'll see on SLES10 or RHEL5 Linux operating systems. Disk
[1:0:0:0] and [1:0:1:0] are the individual drives in the RAID 1, but you cannot access them through
the disk driver. You access [1:1:0:0], which is /dev/sdc in this case. Caution must be exercised
when entering Linux SCSI generic (sg) commands via the sg devices.
66007-4857-002
Chapter 3
3.System Overview
This chapter provides an overview of the physical and architectural aspects of your SGI Altix 450
system. The major components of the Altix 450 series systems are described and illustrated.
The Altix 450 series is a family of multiprocessor distributed shared memory (DSM) computer
systems that initially scale from 2 to 76 Intel 64-bit processor cores as a cache-coherent single
system image (SSI). Contact your SGI sales or service representative for the most current
information on this topic.
In a DSM system, each processor board contains memory that it shares with the other processors
in the system. Because the DSM system is modular, it combines the advantages of low entry-level
cost with global scalability in processors, memory, and I/O. You can install and operate the Altix
450 series system in your lab or server room. Each 20U SGI rack holds from one to four 5U high
enclosures that support up to five compute/memory and I/O sub modules known as “blades.”
These blades are single printed circuit boards (PCBs) with ASICS, processors, memory
components and I/O chipsets mounted on a mechanical carrier. The blades slide directly into and
out of the Altix 450 IRU enclosures.
This chapter consists of the following sections:
•“System Models” on page 68
•“System Architecture” on page 70
•“System Features” on page 72
•“System Components” on page 76
Figure 3-1 shows the front view of a 20U rack used to house the Altix 450 system.
007-4857-00267
3: System Overview
System Models
Figure 3-1SGI Altix 450 System (Short Rack)
The basic enclosure within the Altix 450 system is the 5U high (8.68 inch or 22 cm) “individual
rack unit” (IRU). The IRU enclosure houses a maximum of four single-wide blades and one
double-wide blade. Each IRU comes with two built-in high-speed routers. The routers connect to
the installed blades via a backplane. Each router has two ports that are brought out to external
NUMAlink 4 connectors. The 20U or 42U rack for this server houses all IRU enclosures, option
modules, and other components; up to a 76-processor core configuration (38 processor sockets) in
a single rack. The Altix 450 server system needs a minimum of one IA/IA2 blade (base I/O).
The rack system requires a minimum of one 20U-high rack with one single-phase power
distribution unit (PDU) installed in the rack. Each single-phase PDU has 5 outlets (to support two
IRUs).
Figure 3 -2 shows an example configuration of a 42U Altix 450 system “tall rack.”
68007-4857-002
System Models
Figure 3-2SGI Altix 450 Server System (Tall Rack)
You can also add additional PCI expansion blades or RAID and non-RAID disk storage to your
server system.
Figure 3-3 shows an individual blade, IRU and Rack.
007-4857-002 69
3: System Overview
Blade
Individual Rack Unit (IRU)
(Contains 5 blades)
Blade slot 0
Figure 3-3Blade, IRU and Rack Components
Blade slot 1
Blade slot 2
Blade slot 3
Blade slot 4
Rack
(Contains 4 IRUs)
TM
inside
TM
inside
TM
inside
TM
inside
TM
inside
System Architecture
The Altix 450 computer system is based on a distributed shared memory (DSM) architecture. The
system uses a global-address-space, cache-coherent multiprocessor that scales up to 76 Intel 64-bit
processor cores in a single rack. Because it is modular, the DSM combines the advantages of lower
entry cost with the ability to scale processors, memory, and I/O independently.
The system architecture for the Altix 450 system is a fourth-generation NUMAflex DSM
architecture known as NUMAlink 4. In the NUMAlink 4 architecture, all processors and memory
are tied together into a single logical system with special crossbar switches (routers). This
combination of processors, memory, and crossbar switches constitute the interconnect fabric
called NUMAlink. There are two internal router switches on each 5U IRU enclosure.
The basic expansion building block for the NUMAlink interconnect is the processor node; each
processor node consists of a Super-Hub (SHub) ASIC and one or two 64-bit processors with three
levels of on-chip secondary caches. The Intel 64-bit processors are connected to the SHub ASIC
via a single high-speed front side bus.
The SHub ASIC is the heart of the processor and memory node blade technology. This specialized
ASIC acts as a crossbar between the processors, local SDRAM memory, and the network
70007-4857-002
System Architecture
interface. The SHub ASIC memory interface enables any processor in the system to access the
memory of all processors in the system.
Another component of the NUMAlink 4 architecture is the router ASIC. The router ASIC is a
custom designed 8-port crossbar ASIC. Using the router ASICs with a highly specialized
backplane and NUMAlink 4 cables provides a high-bandwidth, extremely low-latency
interconnect between all processor, I/O, and other option blades within the system.
Figure 3-4 shows a functional block diagram of the Altix 450 series IRU components.
Blade Slot 0
Left router board
(Slab 0)
NUMAlink
NUMAlink
System control
Figure 3-4Functional Block Diagram of the Individual Rack Unit (IRU)
Blade Slot 1
DEF
rtr
G
H
C
A
BA
Power SupplyPower Supply
Blade Slot 2
Blade Slot 3
Blade Slot 4
rtr
E
B
F
Right router board
BCD
G
HA
(Slab 1)
NUMAlink
NUMAlink
DC
power
bus
007-4857-002 71
3: System Overview
System Features
The main features of the Altix 450 series server systems are introduced in the following sections:
•“Modularity and Scalability” on page 72
•“Distributed Shared Memory (DSM)” on page 72
•“Distributed Shared I/O” on page 73
•“Reliability, Availability, and Serviceability (RAS)” on page 74
Modularity and Scalability
The Altix 450 series systems are modular systems. The components are primarily housed in
building blocks referred to as individual rack units (IRUs). Additional optional mass storage may
be added to the rack along with additional IRUs. You can add different types of blade options to
a system IRU to achieve the desired system configuration. You can easily configure systems
around processing capability, I/O capability, memory size, or storage capacity. You place
individual blades that create the basic functionality (compute/memory, I/O, and power) into IRUs.
The air-cooled IRU enclosure system has redundant, hot-swap fans and redundant, hot-swap
power supplies at the IRU level.
Distributed Shared Memory (DSM)
In the Altix 450 system, memory is physically distributed both within and among the IRU
enclosures (compute/memory/I/O blades); however, it is accessible to and shared by all
NUMAlinked devices within the single-system image. This is to say that all NUMAlinked
components sharing a single Linux operating system operate and share the memory “fabric” of the
system.
Note the following sub-types of memory within a system:
•If a processor accesses memory that is connected to the same SHub ASIC on a compute
node blade, the memory is referred to as the node’s local memory.
•If processors access memory located in other blade nodes within the IRU, (or other
NUMAlinked IRUs) the memory is referred to as remote memory.
•The total memory within the NUMAlinked system is referred to as global memory.
72007-4857-002
Distributed Shared I/O
ccNUMA Architecture
Cache Coherency
System Features
Memory latency is the amount of time required for a processor to retrieve data from me mory.
Memory latency is lowest when a processor accesses local memory.
Like DSM, I/O devices are distributed among the blade nodes within the IRUs (each base I/O
blade node has two NUMAlink ports) and are accessible by all compute nodes within the SSI
through the NUMAlink interconnect fabric.
As the name implies, the cache-coherent non-uniform memory access (ccNUMA) architecture has
two parts, cache coherency and nonuniform memory access, which are discussed in the sections
that follow.
The Altix 450 systems use caches to reduce memory latency. Although data exists in local or
remote memory, copies of the data can exist in various processor caches throughout the system.
Cache coherency keeps the cached copies consistent.
T o keep the copies consistent, the ccNUMA architecture uses directory-based coherence protocol.
In directory-based coherence protocol, each block of memory (128 bytes) has an entry in a table
that is referred to as a directory. Like the blocks of memory that they represent, the directories are
distributed among the compute/memory blade nodes. A block of memory is also referred to as a
cache line.
Each directory entry indicates the state of the memory block that it represents. For example, when
the block is not cached, it is in an unowned state. When only one processor has a copy of the
memory block, it is in an exclusive state. And when more than one processor has a copy of the
block, it is in a shared state; a bit vector indicates which caches contain a copy.
When a processor modifies a block of data, the processors that have the same block of data in their
caches must be notified of the modification. The Altix 450 server series use an invalidation
method to maintain cache coherence. The invalidation method purges all unmodified copies of the
block of data, and the processor that wants to modify the block receives exclusive ownership of
the block.
007-4857-002 73
3: System Overview
Non-uniform Memory Access (NUMA)
In DSM systems, memory is physically located at various distances from the processors. As a
result, memory access times (latencies) are different or “non-uniform.” For example, it takes less
time for a processor blade to reference its locally installed memory than to reference remote
memory.
Reliability, Availability, and Serviceability (RAS)
The Altix 450 server series components have the following features to increase the reliability,
availability, and serviceability (RAS) of the systems.
•Power and cooling:
–IRU power supplies are redundant and can be hot-swapped under most circumstances.
Note that this might not be po ssib le in a “f ully loaded” system. If all the blade positions
are filled, be sure to consult with a service technician before removing a power supply
while the system is running.
–IRUs have overcurrent protection at the blade and power supply level.
–Fans are redundant and can be hot-swapped.
–Fans run at multiple speeds in the IRUs. Speed increases automatically when
temperature increases or when a single fan fails.
•System monitoring:
–System controllers monitor the internal power and temperature of the IRUs, and can
automatically shut down an enclosure to prevent overheating.
–Memory, L2 cache, L3 cache, and all external bus transfers are protected by single-bit
error correction and double-bit error detection (SECDED).
–The NUMAlink interconnect network is protected by cyclic redundancy check (CRC).
–The L1 primary cache is protected by parity.
–Each IRU and each blade/node installed has failure LEDs that indicate the failed part;
LEDs are readable at the front of the IRU or via the system controllers.
–Systems support the optional Embedded Support Partner (ESP), a tool that monitors the
system; when a condition occurs that may cause a failure, ESP notifies the appropriate
SGI personnel.
–Systems support remote console and maintenance activities.
74007-4857-002
System Features
•Power-on and boot:
–Automatic testing occurs after you power on the system. (These power-on self-tests or
POSTs are also referred to as power-on diagnostics or PODs).
–Processors and memory are automatically de-allocated when a self-test failure occurs.
–Boot times are minimized.
•Further RAS features:
–Optional RAID 1 in base I/O (IA2 blade), check with your SGI sales or service
representative for availability. Note that RAID 0 (striping), while supported by the IA2
hardware, is not recommended as a RAS option.
–Systems have a local field-replaceable unit (FRU) analyzer.
–All system faults are logged in files.
–Memory can be scrubbed using error checking code (ECC) when a single-bit error
occurs.
007-4857-002 75
3: System Overview
System Components
The Altix 450 series system features the following major components:
•20U rack. The “short” rack is a custom rack used with the Altix 450 system. It holds up to 4
•42U rack. The “tall” rack is a custom rack used for both the compute and I/O rack in the
•Individual Rack Unit (IRU). This enclosure contains the compute/memory blades, IA
•Compute/Memory blade. Holds up to two IA-64 processor sockets and 4, 8 or 12 memory
•Memory-only blade. This blade acts as a memory expansion node with no processor
•Single-wide PCI-X expansion blade. This two-slo t PCI/PCI-X opt ion blade su pplies an
IRUs or a combination of IRUs and option modules (such as mass storage).
Altix 450 system. Up to 8 IRUs can be installed in each rack. There is also a 2U space
reserved at the top for an option module.
blade (base I/O), standard routers and optional I/O blades for the Altix 450 series systems.
The enclosure is 5U high.
DIMMs.
compute circuitry included on the blade. This blade holds 4, 8, or 12 memory DIMMs.
cards. This three-slot blade features card carriers that allow you to slide PCI/PCI-X boards
directly into and out of the unit.
•Single-wide PCIe expansion blade. The single wide PCI blade supports one or two PCI
Express option cards. Note that when used with a 3D PCIe graphics card, only one can be
inserted due to space constraints.
•Double-wide PCIe/PCI-X expansion blade. This blade supports two PCI Express option
cards and two PCI/PCI-X option cards. Note that when this blade is used with two 3D PCIe
graphics cards the PCI-X slots cannot be used due to space constraints.
•IA/IA2 blade (Base I/O blade). Double-wide I/O blade that supports all base system I/O
functions including one or two disk drives, a DVD drive, two low-profile PCI-X card slots,
two ethernet ports, one SAS/SATAII port, and four USB ports. Optional RAID 1
functionality and DVD-R/W is available with the IA2 version of the base I/O blade only.
Figure 3-5 on page 77 shows the Altix 450 IRU system components.
76007-4857-002
System Components
rt
rt
P
P
L1 controller
panel
ort
ort
Bay (Unit) Numbering
PCI-X 1 PCI-X 0
H
DVDROM
G
HDD0HDD1
ETH0 ETH1
P0PF N0N1
F1F2 F3 F4
A1A2 A3A4
SAS 4XUSB0 USB1 USB2 USB3
TM
inside
Power
Router
board
IA blade
Compute/memory blades
System control
board
supplies
Router
board
Figure 3-5Altix 450 IRU System Components Example
Bays in the racks are numbered using standard units. A standard unit (SU) or unit (U) is equal to
1.75 inches (4.445 cm). Because IRUs occupy multiple standard units, IRU locations within a rack
are identified by the bottom unit (U) in which the IRU resides. For example, in a 42U rack, an IRU
positioned in U01 through U05 is identified as U01.
Po
G
Po
H
Rack Numbering
Each rack is numbered with a three-digit number sequentially beginning with 001. A rack contains
007-4857-002 77
IRU enclosures, optional mass storage enclosures, and potentially other options. In a single
compute rack system, the rack number is always 001.
3: System Overview
Optional System Components
Availability of optional components for the SGI 450 systems may vary based on new prod uct
introductions or end-of-life components. Some options are listed in this manual, others may be
introduced after this document goes to production status. Check with your SGI sales or support
representative for the most current information on available product options not discussed in this
manual.
78007-4857-002
Loading...
+ hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.