BladeCenter HS20 Ty pe 7981 and 1884
Problem Dete rminatio n and Service Guid e
BladeCenter HS20 Ty pe 7981 and 1884
Problem Dete rminatio n and Service Guid e
Note: Before using this information and the product it supports, read the general information in Appendix B, “Notices,” on page 85,
and the Warranty and Support Information document on the IBM BladeCenter Documentation CD.
Second Edition (May 2006)
© Copyright International Business Machines Corporation 2006. All rights reserved.
US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract
with IBM Corp.
Contents
Safety . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii
Guidelines for trained service technicians . . . . . . . . . . . . . . . viii
Inspecting for unsafe conditions . . . . . . . . . . . . . . . . . viii
Guidelines for servicing electrical equipment . . . . . . . . . . . . . viii
Safety statements . . . . . . . . . . . . . . . . . . . . . . . .ix
Chapter 1. Introduction . . . . . . . . . . . . . . . . . . . . . .1
Related documentation . . . . . . . . . . . . . . . . . . . . . .1
Notices and statements in this document . . . . . . . . . . . . . . . .2
Features and specifications . . . . . . . . . . . . . . . . . . . . .3
Blade server controls, LEDs, and connectors . . . . . . . . . . . . . .4
Turning on the blade server . . . . . . . . . . . . . . . . . . . . .6
Turning off the blade server . . . . . . . . . . . . . . . . . . . . .6
System-board layouts . . . . . . . . . . . . . . . . . . . . . . .7
System-board connectors . . . . . . . . . . . . . . . . . . . .7
System-board switches . . . . . . . . . . . . . . . . . . . . .7
System-board LEDs . . . . . . . . . . . . . . . . . . . . . .8
Chapter 2. Diagnostics . . . . . . . . . . . . . . . . . . . . . .9
Diagnostic tools . . . . . . . . . . . . . . . . . . . . . . . . .9
POST . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9
POST beep codes . . . . . . . . . . . . . . . . . . . . . . .9
Error logs . . . . . . . . . . . . . . . . . . . . . . . . . .16
POST error codes . . . . . . . . . . . . . . . . . . . . . . .18
Checkout procedure . . . . . . . . . . . . . . . . . . . . . . .24
About the checkout procedure . . . . . . . . . . . . . . . . . .24
Performing the checkout procedure . . . . . . . . . . . . . . . .25
Troubleshooting tables . . . . . . . . . . . . . . . . . . . . . .25
CD or DVD drive problems . . . . . . . . . . . . . . . . . . .26
Diskette drive problems . . . . . . . . . . . . . . . . . . . . .27
General problems . . . . . . . . . . . . . . . . . . . . . . .27
Hard disk drive problems . . . . . . . . . . . . . . . . . . . .28
Intermittent problems . . . . . . . . . . . . . . . . . . . . . .28
Keyboard, mouse, or pointing-device problems . . . . . . . . . . . .29
Memory problems . . . . . . . . . . . . . . . . . . . . . . .30
Microprocessor problems . . . . . . . . . . . . . . . . . . . .30
Monitor or video problems . . . . . . . . . . . . . . . . . . . .31
Network connection problems . . . . . . . . . . . . . . . . . .32
Optional-device problems . . . . . . . . . . . . . . . . . . . .33
Power error messages . . . . . . . . . . . . . . . . . . . . .33
Power problems . . . . . . . . . . . . . . . . . . . . . . .35
ServerGuide problems . . . . . . . . . . . . . . . . . . . . .36
Service processor problems . . . . . . . . . . . . . . . . . . .37
Software problems . . . . . . . . . . . . . . . . . . . . . .37
Universal Serial Bus (USB) port problems . . . . . . . . . . . . . .37
Light path diagnostics . . . . . . . . . . . . . . . . . . . . . .38
Viewing the light path diagnostics LEDs . . . . . . . . . . . . . . .38
Light path diagnostics LEDs . . . . . . . . . . . . . . . . . . .40
Diagnostic programs, messages, and error codes . . . . . . . . . . . .41
Running the diagnostic programs . . . . . . . . . . . . . . . . .41
Diagnostic text messages . . . . . . . . . . . . . . . . . . . .42
Viewing the test log . . . . . . . . . . . . . . . . . . . . . .43
Diagnostic error codes . . . . . . . . . . . . . . . . . . . . .43
© Copyright IBM Corp. 2006 iii
Recovering from a BIOS update failure . . . . . . . . . . . . . . . .48
Service processor (BMC) error codes . . . . . . . . . . . . . . . .49
Solving SAS hard disk drive problems . . . . . . . . . . . . . . . .49
Solving undetermined problems . . . . . . . . . . . . . . . . . . .49
Calling IBM for service . . . . . . . . . . . . . . . . . . . . . .50
Chapter 3. Parts listing, Type 7981 and 1884 . . . . . . . . . . . . .53
Chapter 4. Removing and replacing blade server components . . . . . .55
Installation guidelines . . . . . . . . . . . . . . . . . . . . . .55
System reliability guidelines . . . . . . . . . . . . . . . . . . .56
Handling static-sensitive devices . . . . . . . . . . . . . . . . .56
Returning a device or component . . . . . . . . . . . . . . . . .56
Removing the blade server from a BladeCenter unit . . . . . . . . . . .57
Installing the blade server in a BladeCenter unit . . . . . . . . . . . . .58
Removing and replacing Tier 1 CRUs . . . . . . . . . . . . . . . .59
Removing the blade server cover . . . . . . . . . . . . . . . . .59
Installing the blade server cover . . . . . . . . . . . . . . . . .60
Removing the bezel assembly . . . . . . . . . . . . . . . . . .61
Installing the bezel assembly . . . . . . . . . . . . . . . . . . .61
Removing a SAS hard disk drive . . . . . . . . . . . . . . . . .62
Installing a SAS hard disk drive . . . . . . . . . . . . . . . . . .63
Removing a memory module . . . . . . . . . . . . . . . . . . .63
Installing a memory module . . . . . . . . . . . . . . . . . . .64
Removing a small-form-factor expansion card . . . . . . . . . . . .66
Installing a small-form-factor expansion card . . . . . . . . . . . . .66
Removing a standard-form-factor expansion card . . . . . . . . . . .67
Installing a standard-form-factor expansion card . . . . . . . . . . . .69
Removing the battery . . . . . . . . . . . . . . . . . . . . .70
Installing the battery . . . . . . . . . . . . . . . . . . . . . .70
Removing and replacing FRUs . . . . . . . . . . . . . . . . . . .72
Removing a microprocessor and heat sink . . . . . . . . . . . . . .72
Installing a microprocessor and heat sink . . . . . . . . . . . . . .74
Removing the system board assembly . . . . . . . . . . . . . . .78
Installing the system board assembly . . . . . . . . . . . . . . . .78
Chapter 5. Configuration information and instructions . . . . . . . . .79
Firmware updates . . . . . . . . . . . . . . . . . . . . . . . .79
Using the Configuration/Setup Utility program . . . . . . . . . . . . .79
Configuring the Gigabit Ethernet controllers . . . . . . . . . . . . . .79
Blade server Ethernet controller enumeration . . . . . . . . . . . . . .80
Configuring a SAS RAID . . . . . . . . . . . . . . . . . . . . .81
Appendix A. Getting help and technical assistance . . . . . . . . . .83
Before you call . . . . . . . . . . . . . . . . . . . . . . . . .83
Using the documentation . . . . . . . . . . . . . . . . . . . . .83
Getting help and information from the World Wide Web . . . . . . . . . .83
Software service and support . . . . . . . . . . . . . . . . . . .84
Hardware service and support . . . . . . . . . . . . . . . . . . .84
Appendix B. Notices . . . . . . . . . . . . . . . . . . . . . .85
Edition notice . . . . . . . . . . . . . . . . . . . . . . . . .85
Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . .86
Important notes . . . . . . . . . . . . . . . . . . . . . . . . .86
Product recycling and disposal . . . . . . . . . . . . . . . . . . .87
Battery return program . . . . . . . . . . . . . . . . . . . . . .88
iv BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Electronic emission notices . . . . . . . . . . . . . . . . . . . .89
Federal Communications Commission (FCC) statement . . . . . . . . .89
Industry Canada Class A emission compliance statement . . . . . . . .89
Australia and New Zealand Class A statement . . . . . . . . . . . .89
United Kingdom telecommunications safety requirement . . . . . . . . .89
European Union EMC Directive conformance statement . . . . . . . . .89
Taiwanese Class A warning statement . . . . . . . . . . . . . . .90
Chinese Class A warning statement . . . . . . . . . . . . . . . .90
Japanese Voluntary Control Council for Interference (VCCI) statement . . .90
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . .91
Contents v
vi BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Safety
Before installing this product, read the Safety Information.
Antes de instalar este produto, leia as Informações de Segurança.
Pred instalací tohoto produktu si prectete prírucku bezpecnostních instrukcí.
Læs sikkerhedsforskrifterne, før du installerer dette produkt.
Lees voordat u dit product installeert eerst de veiligheidsvoorschriften.
Ennen kuin asennat tämän tuotteen, lue turvaohjeet kohdasta Safety Information.
Avant d’installer ce produit, lisez les consignes de sécurité.
Vor der Installation dieses Produkts die Sicherheitshinweise lesen.
Prima di installare questo prodotto, leggere le Informazioni sulla Sicurezza.
Les sikkerhetsinformasjonen (Safety Information) før du installerer dette produktet.
Antes de instalar este produto, leia as Informações sobre Segurança.
Antes de instalar este producto, lea la información de seguridad.
Läs säkerhetsinformationen innan du installerar den här produkten.
© Copyright IBM Corp. 2006 vii
Guidelines for trained service technicians
This section contains information for trained service technicians.
Inspecting for unsafe conditions
Use the information in this section to help you identify potential unsafe conditions in
an IBM product on which you are working. Each IBM product, as it was designed
and manufactured, has required safety items to protect users and service
technicians from injury. The information in this section addresses only those items.
Use good judgment to identify potential unsafe conditions that might be caused by
non-IBM alterations or attachment of non-IBM features or options that are not
addressed in this section. If you identify an unsafe condition, you must determine
how serious the hazard is and whether you must correct the problem before you
work on the product.
Consider the following conditions and the safety hazards that they present:
v Electrical hazards, especially primary power. Primary voltage on the frame can
cause serious or fatal electrical shock.
v Explosive hazards, such as a damaged CRT face or a bulging capacitor.
v Mechanical hazards, such as loose or missing hardware.
inspect the product for potential unsafe conditions, complete the following steps:
To
1. Make sure that the power is off and the power cord is disconnected.
2. Make sure that the exterior cover is not damaged, loose, or broken, and
observe any sharp edges.
3. Check the power cord:
v Make sure that the third-wire ground connector is in good condition. Use a
meter to measure third-wire ground continuity for 0.1 ohm or less between
the external ground pin and the frame ground.
v Make sure that the power cord is the correct type, as specified in the
documentation for your BladeCenter unit type.
v Make sure that the insulation is not frayed or worn.
Remove the cover.
4.
5. Check for any obvious non-IBM alterations. Use good judgment as to the safety
of any non-IBM alterations.
6. Check inside the server for any obvious unsafe conditions, such as metal filings,
contamination, water or other liquid, or signs of fire or smoke damage.
7. Check for worn, frayed, or pinched cables.
8. Make sure that the power-supply cover fasteners (screws or rivets) have not
been removed or tampered with.
Guidelines for servicing electrical equipment
Observe the following guidelines when servicing electrical equipment:
v Check the area for electrical hazards such as moist floors, non-grounded power
extension cords, and missing safety grounds.
v Use only approved tools and test equipment. Some hand tools have handles that
are covered with a soft material that does not provide insulation from live
electrical current.
v Regularly inspect and maintain your electrical hand tools for safe operational
condition. Do not use worn or broken tools or testers.
viii BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
v Do not touch the reflective surface of a dental mirror to a live electrical circuit.
The surface is conductive and can cause personal injury or equipment damage if
it touches a live electrical circuit.
v Some rubber floor mats contain small conductive fibers to decrease electrostatic
discharge. Do not use this type of mat to protect yourself from electrical shock.
v Do not work alone under hazardous conditions or near equipment that has
hazardous voltages.
v Locate the emergency power-off (EPO) switch, disconnecting switch, or electrical
outlet so that you can turn off the power quickly in the event of an electrical
accident.
v Disconnect all power before you perform a mechanical inspection, work near
power supplies, or remove or install main units.
v Before you work on the equipment, disconnect the power cord. If you cannot
disconnect the power cord, have the customer power-off the wall box that
supplies power to the equipment and lock the wall box in the off position.
v Never assume that power has been disconnected from a circuit. Check it to
make sure that it has been disconnected.
v If you have to work on equipment that has exposed electrical circuits, observe
the following precautions:
– Make sure that another person who is familiar with the power-off controls is
near you and is available to turn off the power if necessary.
– When you are working with powered-on electrical equipment, use only one
hand. Keep the other hand in your pocket or behind your back to avoid
creating a complete circuit that could cause an electrical shock.
– When using a tester, set the controls correctly and use the approved probe
leads and accessories for that tester.
– Stand on a suitable rubber mat to insulate you from grounds such as metal
floor strips and equipment frames.
Use extreme care when measuring high voltages.
v
v To ensure proper grounding of components such as power supplies, pumps,
blowers, fans, and motor generators, do not service these components outside of
their normal operating locations.
v If an electrical accident occurs, use caution, turn off the power, and send another
person to get medical aid.
Safety statements
Important:
Each caution and danger statement in this documentation begins with a number.
This number is used to cross reference an English-language caution or danger
statement with translated versions of the caution or danger statement in the Safety
Information document.
For example, if a caution statement begins with a number 1, translations for that
caution statement appear in the Safety Information document under statement 1.
Be sure to read all caution and danger statements in this documentation before
performing the instructions. Read any additional safety information that comes with
your server or optional device before you install the device.
Safety ix
Statement 1:
DANGER
Electrical
current from power, telephone, and communication cables is
hazardous.
To avoid a shock hazard:
v Do not connect or disconnect any cables or perform installation,
maintenance, or reconfiguration of this product during an electrical
storm.
v Connect all power cords to a properly wired and grounded electrical
outlet.
v Connect to properly wired outlets any equipment that will be attached to
this product.
v When possible, use one hand only to connect or disconnect signal
cables.
v Never turn on any equipment when there is evidence of fire, water, or
structural damage.
v Disconnect the attached power cords, telecommunications systems,
networks, and modems before you open the device covers, unless
instructed otherwise in the installation and configuration procedures.
v Connect and disconnect cables as described in the following table when
installing, moving, or opening covers on this product or attached
devices.
To Connect: To Disconnect:
1. Turn everything OFF.
2. First, attach all cables to devices.
3. Attach signal cables to connectors.
4. Attach power cords to outlet.
1. Turn everything OFF.
2. First, remove power cords from outlet.
3. Remove signal cables from connectors.
4. Remove all cables from devices.
5. Turn device ON.
x BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Statement 2:
CAUTION:
When replacing the lithium battery, use only IBM Part Number 33F8354 or an
equivalent type battery recommended by the manufacturer. If your system has
a module containing a lithium battery, replace it only with the same module
type made by the same manufacturer. The battery contains lithium and can
explode if not properly used, handled, or disposed of.
Do not:
v Throw or immerse into water
v Heat to more than 100°C (212°F)
v Repair or disassemble
Dispose
Statement 3:
of the battery as required by local ordinances or regulations.
CAUTION:
When laser products (such as CD-ROMs, DVD drives, fiber optic devices, or
transmitters) are installed, note the following:
v Do not remove the covers. Removing the covers of the laser product could
result in exposure to hazardous laser radiation. There are no serviceable
parts inside the device.
v Use of controls or adjustments or performance of procedures other than
those specified herein might result in hazardous radiation exposure.
DANGER
laser products contain an embedded Class 3A or Class 3B laser
Some
diode. Note the following.
Laser radiation when open. Do not stare into the beam, do not view directly
with optical instruments, and avoid direct exposure to the beam.
Safety xi
Statement 4:
≥ 18 kg (39.7 lb) ≥ 32 kg (70.5 lb) ≥ 55 kg (121.2 lb)
CAUTION:
Use safe practices when lifting.
Statement 5:
CAUTION:
The power control button on the device and the power switch on the power
supply do not turn off the electrical current supplied to the device. The device
also might have more than one power cord. To remove all electrical current
from the device, ensure that all power cords are disconnected from the power
source.
2
1
xii BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Statement 8:
CAUTION:
Never remove the cover on a power supply or any part that has the following
label attached.
Hazardous voltage, current, and energy levels are present inside any
component that has this label attached. There are no serviceable parts inside
these components. If you suspect a problem with one of these parts, contact
a service technician.
Statement 10:
CAUTION:
Do not place any object on top of rack-mounted devices.
Safety xiii
xiv BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Chapter 1. Introduction
This Problem Determination and Service Guide contains information to help you
solve problems that might occur in your IBM
1884 blade servers. It describes the diagnostic tools that come with the blade
server, error codes and suggested actions, and instructions for replacing failing
components.
Replaceable components are of three types:
v Tier 1 customer replaceable unit (CRU): Replacement of Tier 1 CRUs is your
responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for
the installation.
v Tier 2 customer replaceable unit: You may install a Tier 2 CRU yourself or
request IBM to install it, at no additional charge, under the type of warranty
service that is designated for your server.
v Field replaceable unit (FRU): FRUs must be installed only by trained service
technicians.
information about the terms of the warranty and getting service and assistance,
For
see the Warranty and Support Information document.
Related documentation
In addition to this document, the following documentation also comes with the
server:
v Installation and User’s Guide
This printed document contains general information about the server, including
how to install supported options and how to configure the server.
v Safety Information
This document is in Portable Document Format (PDF) on the Documentation CD.
It contains translated caution and danger statements. Each caution and danger
statement that appears in the documentation has a number that you can use to
locate the corresponding statement in your language in the Safety Information
document.
v Warranty and Support Information
This document is in PDF on the Documentation CD. It contains information about
the terms of the warranty and about service and assistance.
®
BladeCenter HS20 Type 7981 and
Depending
IBM Documentation CD.
The blade server might have features that are not described in the documentation
that comes with the server. The documentation might be updated occasionally to
include information about those features, or technical updates might be available to
provide additional information that is not included in the blade server
documentation. The most recent versions of all BladeCenter documentation are at
http://www.ibm.com/support/. In addition to the documentation in this library, be sure
to review the IBM BladeCenter Planning and Installation Guide for your BladeCenter
unit type for information to help you prepare for system installation and
configuration. This document is available at http://www.ibm.com/bladecenter/.
© Copyright IBM Corp. 2006 1
on the server model, additional documentation might be included on the
Notices and statements in this document
The caution and danger statements that appear in this document are also in the
multilingual Safety Information document, which is on the IBM Documentation CD.
Each statement is numbered for reference to the corresponding statement in the
Safety Information document.
The following notices and statements are used in this document:
v Note: These notices provide important tips, guidance, or advice.
v Important: These notices provide information or advice that might help you avoid
inconvenient or problem situations.
v Attention: These notices indicate potential damage to programs, devices, or
data. An attention notice is placed just before the instruction or situation in which
damage could occur.
v Caution: These statements indicate situations that can be potentially hazardous
to you. A caution statement is placed just before the description of a potentially
hazardous procedure step or situation.
v Danger: These statements indicate situations that can be potentially lethal or
extremely hazardous to you. A danger statement is placed just before the
description of a potentially lethal or extremely hazardous procedure step or
situation.
2 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Features and specifications
The following table provides a summary of the features and specifications of the
HS20 Type 7981 and 1884 blade servers operating in a non-NEBS environment.
Note: Power, cooling, removable-media drives, external ports, and advanced
system management are provided by the BladeCenter unit.
Microprocessor:
Supports up to two Dual-Core Intel
Xeon Processor LV microprocessors
Note: Use the Configuration/Setup
Utility program to determine the type
and speed of the microprocessors in
your blade server.
Memory:
v Dual channel 400 MHz (DDR2) with
4 dual inline memory module
(DIMM) slots
v Type: 2-way interleaved, DDR2,
PC3200, ECC SDRAM registered
x4 DIMMs
v Supports 512 MB, 1 GB, 2GB, and
4 GB DIMMs (as of the date of this
publication) for up to 16 GB of total
memory.
Drives:
Support for two internal
small-form-factor Serial Attached SCSI
(SAS) drives
Integrated functions:
v Dual-channel gigabit Ethernet
®
controller
v Expansion card interface
v ATI Radeon 7000M video
controller
v LSI 1064 S Serial Attached SCSI
(SAS) controller
v Light path diagnostics
v Local service processor (BMC)
v RS-485 interface for
communication with the
management module
v Automatic server restart (ASR)
v Serial over LAN (SOL)
v Intelligent Platform Management
Interface (IPMI)
v 4 USB buses for communication
with keyboard, mouse, and
removable media drives
Predictive
alerts:
v Microprocessor
v Memory
Failure Analysis
®
(PFA)
Electrical Input: 12 V dc
Environment:
v Air temperature:
– Blade server on: 10° to 35°C (50°
to 95°F). Altitude: 0 to 914 m
(2998.69 ft)
– Blade server on: 10° to 32°C (50°
to 90°F). Altitude: 914 m to 2133
m (3000 ft to 7000 ft)
– Blade server off: -40° to 60°C
(-40° to 140°F)
v
Humidity:
– Blade server on: 8% to 80%
– Blade server off: 5% to 80%
Size:
v Height: 24.5 cm (9.7 inches)
v Depth: 44.6 cm (17.6 inches)
v Width: 2.9 cm (1.14 inches)
v Maximum weight: 5.4 kg (12 lb)
Note: The operating system in the blade server must provide USB support for the
blade server to recognize and use the keyboard, mouse, and removable media
drives. The BladeCenter unit uses USB for internal communications with these
devices.
Chapter 1. Introduction 3
Blade server controls, LEDs, and connectors
This section describes the blade server control panel buttons and LEDs.
Note: The control panel door is shown in the closed (normal) position in the
following illustration. To access the power-control button, you must open the control
panel door.
Activity LED
Location LED
KVM select button
Information LED
Blade-error LED
Media-tray select
buttom
Power-control button
Power-on LED
KVM (keyboard/video/mouse) select button: Press this button to associate the
shared BladeCenter unit keyboard, video, and mouse ports with the blade server.
The LED on this button flashes while the request is being processed then is lit
when the ownership of the KVM has been transferred to the blade server. It can
take approximately 20 seconds to switch the KVM control to the blade server.
Using a keyboard that is directly attached to the management-module, you can
press keyboard keys in the following sequence to switch KVM control between
blade servers:
NumLock NumLock blade_server_number Enter
Where blade_server_number is the two-digit number for the blade bay in which
the blade server is installed.
If there is no response when you press the KVM select button, you can use the
management-module Web interface to determine whether local control has been
disabled on the blade server.
Note:
1. The operating system in the blade server must provide USB support for the
blade server to recognize and use the keyboard and mouse, even if the
keyboard and mouse have PS/2-style connectors.
2. If you install a supported Microsoft Windows operating system on the blade
server while it is not the current owner of the keyboard, video, and mouse, a
delay of up to 1 minute occurs the first time that you switch the keyboard, video,
and mouse to the blade server. During this one-time-only delay, the blade server
device manager enumerates the keyboard, video, and mouse and loads the
device drivers. All subsequent switching takes place in the normal KVM
switching time frame (up to 20 seconds).
4 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Media-tray select button: Press this button to associate the shared BladeCenter
unit media tray (removable-media drives and USB ports) with the blade server. The
LED on the button flashes while the request is being processed, then is lit when the
ownership of the media tray has been transferred to the blade server. It can take
approximately 20 seconds for the operating system in the blade server to recognize
the media tray. If there is no response when you press the media-tray select button,
you can use the management-module Web interface to determine whether local
control has been disabled on the blade server.
Note: The operating system in the blade server must provide USB support for the
blade server to recognize and use the removable-media drives and USB ports.
Activity LED: When this green LED is lit, it indicates that there is activity on the
hard disk drive or network.
Location LED: When this blue LED is lit, it has been turned on by the system
administrator to aid in visually locating the blade server. The location LED on the
BladeCenter unit will be lit also. The location LED can be turned off through the
management-module Web interface or through the management console that is
running IBM Director Console.
Information LED: When this amber LED is lit, it indicates that information about a
system error for the blade server has been placed in the management-module
event log. The information LED can be turned off through the management-module
Web interface or through the management console that is running IBM Director
Console.
Blade-error LED: When this amber LED is lit, it indicates that a system error has
occurred in the blade server. The blade-error LED will turn off only after the error is
corrected.
Power-control button: This button is behind the control panel door. Press this
button to turn on or turn off the blade server.
Note: The power-control button has effect only if local power control is enabled for
the blade server. Local power control is enabled and disabled through the
management-module Web interface.
Power-on LED: This green LED indicates the power status of the blade server in
the following manner:
v Flashing rapidly: The service processor (BMC) on the blade server is initializing
with the management-module.
v Flashing slowly: The blade server has power but is not turned on.
v Lit continuously: The blade server has power and is turned on.
Chapter 1. Introduction 5
Turning on the blade server
After you connect the blade server to power through the BladeCenter unit, the blade
server can start in any of the following ways:
v You can press the power-control button on the front of the blade server (behind
the control panel door, see “Blade server controls, LEDs, and connectors” on
page 4) to start the blade server.
Notes:
1. Wait until the power-on LED on the blade server flashes slowly before
pressing the blade server power-control button. During this time, the
management-module is initializing; therefore, the power-control button on the
blade server does not respond.
2. While the blade server is powering-up, the power-on LED on the front of the
server is lit. See “Blade server controls, LEDs, and connectors” on page 4 for
the power-on LED states.
If a power failure occurs, the BladeCenter unit and then the blade server can
v
start automatically when power is restored (if the blade server is configured
through the management-module to do so).
v You can turn on the blade server remotely by means of the management-module.
v If the operating system supports the Wake on LAN
power-on LED is flashing slowly, the Wake on LAN feature can turn on the blade
server, if the Wake on LAN feature has not been disabled through the
management-module.
®
feature and the blade server
Turning off the blade server
When you turn off the blade server, it is still connected to power through the
BladeCenter unit. The blade server can respond to requests from the service
processor, such as a remote request to turn on the blade server. To remove all
power from the blade server, you must remove it from the BladeCenter unit.
Shut down the operating system before you turn off the blade server. See the
operating-system documentation for information about shutting down the operating
system.
The blade server can be turned off in any of the following ways:
v You can press the power-control button on the blade server (behind the control
panel door, see “Blade server controls, LEDs, and connectors” on page 4). This
also starts an orderly shutdown of the operating system, if this feature is
supported by the operating system.
Note: After turning off the blade server, wait at least 5 seconds before you press
the power-control button to turn on the blade server again.
v If the operating system stops functioning, you can press and hold the
power-control button for more than 4 seconds to turn off the blade server.
v The management-module can turn off the blade server.
6 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
System-board layouts
The following illustrations show the connectors, LEDs, switches, and jumpers on the
system board. The illustrations in this document might differ slightly from your
hardware.
System-board connectors
The following illustration shows the connectors on the system board.
I/O expansion
option
DIMM 1
DIMM 2
DIMM 3
DIMM 4
Microprocessor 1
and heat sink
Microprocessor 2
and heat sink option
System-board switches
The following illustration shows the location of the switch block (SW2) on the
system board.
The following table defines the function of each switch in the switch block (SW2).
Switch Function
SW2-1 BIOS backup
SW2-2 Wake On LAN
SW2-3 Reserved
SW2-4 Power-on override
SW2-5 Reserved
Battery
SAS hard disk drive 0
SAS hard disk drive 1
Control-panel
connector
Switch block (SW2)
Chapter 1. Introduction 7
Switch Function
SW2-6 Clear CMOS
SW2-7 Reserved
SW2-8 Bypass power-on password
System-board LEDs
The following illustration shows the LEDs on the system board. You must remove
the blade server from the BladeCenter unit, open the cover, and press the light path
diagnostics switch to light any error LEDs that were turned on during processing.
DIMM 3 Error LED
(CR5)
DIMM 4 Error LED
(CR6)
I/O expansion
connectors
Light path diagnostics (SW4)
DIMM 1 error LED (CR201)
DIMM 2 error LED (CR4)
Microprocessor 1 error LED (CR12)
SAS 1 hard
Reserved
disk drive LED
(CR262)
Control panel
connector
Microprocessor 2 error LED (CR13)
3V lithium battery
SAS 0 Hard disk drive LED (CR260)
SAS trays/connectors
The following illustration shows the light path diagnostics panel on the system
board.
NMI
MIS
S BRD
TEMP
NMI error LED
Microprocessor mismatch error LED
System-board error LED
Over temperature error LED
Light path diagnostics LED
Light path diagnostics switch
8 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Chapter 2. Diagnostics
This chapter describes the diagnostic tools that are available to help you solve
problems that might occur in the server.
If you cannot locate and correct the problem using the information in this chapter,
see Appendix A, “Getting help and technical assistance,” on page 83 for more
information.
Diagnostic tools
The following tools are available to help you diagnose and solve hardware-related
problems:
v POST beep codes, error messages, and error logs
The power-on self-test (POST) generates beep codes and messages to indicate
successful test completion or the detection of a problem. See “POST” for more
information.
v Troubleshooting tables
These tables list problem symptoms and actions to correct the problems. See
“Troubleshooting tables” on page 25 for more information.
v Light path diagnostics
Use the light path diagnostics to diagnose system errors quickly. See “Light path
diagnostics” on page 38 for more information.
v Diagnostic programs, messages, and error codes
The diagnostic programs are the primary method of testing the major
components of the blade server. These programs are stored in read-only memory
(ROM) on the blade server. See “Diagnostic programs, messages, and error
codes” on page 41 for more information.
POST
When you turn on the blade server, it performs a series of tests to check the
operation of the blade server components and some optional devices in the blade
server. This series of tests is called the power-on self-test, or POST.
If a power-on password is set, you must type the password and press Enter, when
prompted, for POST to run.
If POST is completed without detecting any problems, a single beep sounds, and
the blade server startup is completed.
If POST detects a problem, more than one beep might sound, or an error message
is displayed. See “Beep code descriptions” on page 10 and “POST error codes” on
page 18 for more information.
POST beep codes
A beep code is a combination of short or long beeps or a series of short beeps that
are separated by pauses. For example, a “1-2-3” beep code is one short beep, a
pause, two short beeps, a pause, and three short beeps. A beep code other than
one beep indicates that POST has detected a problem. To determine the meaning
of a beep code, see “Beep code descriptions” on page 10. If no beep code sounds,
see “No-beep symptoms” on page 16.
© Copyright IBM Corp. 2006 9
Beep code descriptions
The following table describes the beep codes and suggested actions to correct the
detected problems.
A single problem might cause more than one error message. When this occurs,
correct the cause of the first error message. The other error messages usually will
not occur the next time POST runs.
Exception: If there are multiple error codes or light path diagnostics LEDs that
indicate a microprocessor error, the error might be in a microprocessor or in a
microprocessor socket. See “Microprocessor problems” on page 30 for information
about diagnosing microprocessor problems.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Beep code Description Action
1-1-2 Microprocessor register test failed
1-1-3 CMOS write/read test failed.
1-1-4 BIOS ROM checksum failed.
1-2-1 Programmable interval timer failed. (Trained service technician only) Replace the
1. Reseat the following components:
a. (Trained service technician only)
Microprocessor 2
b. (Trained service technician only)
Microprocessor 1
Replace the following components one at a
2.
time, in the order shown, restarting the
blade server each time:
a. (Trained service technician only)
Microprocessor 2
b. (Trained service technician only)
Microprocessor 1
c. (Trained service technician only) System
board assembly
1. Reseat the battery
2. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. Battery
b. (Trained service technician only) System
board assembly
1. Update the BIOS code.
2. Reseat the DIMMs.
3. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
system board assembly.
10 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Beep code Description Action
1-2-2 DMA initialization failed. (Trained service technician only) Replace the
system board assembly.
1-2-3 DMA page register write/read failed. (Trained service technician only) Replace the
system board assembly.
1-2-4 RAM refresh verification failed.
1. Reseat the DIMMs.
2. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
1-3-1 First 64K RAM test failed.
1. Reseat the DIMMs.
2. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
1-3-2 First 64K RAM parity test failed.
1. Reseat the DIMMs.
2. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
2-1-1 Secondary DMA register failed. (Trained service technician only) Replace the
system board assembly.
2-1-2 Primary DMA register failed. (Trained service technician only) Replace the
system board assembly.
2-1-3 Primary interrupt mask register failed. (Trained service technician only) Replace the
system board assembly.
2-1-4 Secondary interrupt mask register failed. (Trained service technician only) Replace the
system board assembly.
2-2-1 Interrupt vector loading failed. (Trained service technician only) Replace the
system board assembly.
Chapter 2. Diagnostics 11
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Beep code Description Action
2-2-2 Keyboard controller failed.
1. Reseat the keyboard.
2. Check the management-module
functionality (see the documentation for
your BladeCenter unit).
3. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. Keyboard
b. (Trained service technician only) System
board assembly
2-2-3 CMOS power failure and checksum
checks failed.
1. Reseat the battery.
2. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. Battery
b. (Trained service technician only) System
board assembly
2-2-4 CMOS configuration information
validation failed.
1. Reseat the battery.
2. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. Battery
b. (Trained service technician only) System
board assembly
2-3-1 Screen initialization failed. (Trained service technician only) Replace the
system board assembly.
2-3-2 Screen memory failed. (Trained service technician only) Replace the
system board assembly.
2-3-3 Screen retrace failed. (Trained service technician only) Replace the
system board assembly.
2-3-4 Search for video ROM failed. (Trained service technician only) Replace the
system board assembly.
2-4-1 Video failed; screen believed operable. (Trained service technician only) Replace the
system board assembly.
2-4-4 Unsupported memory configuration.
1. Check the DIMM error LEDs on the blade
server.
2. Replace noncompatible or failing DIMMs in
the blade server.
3-1-1 Timer tick interrupt failed. (Trained service technician only) Replace the
system board assembly.
3-1-2 Interval timer channel 2 failed. (Trained service technician only) Replace the
system board assembly.
12 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Beep code Description Action
3-1-3 RAM test failed above address 0FFFFh.
1. Reseat the DIMMs.
2. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
3-1-4 Time-of-day clock failed.
1. Reseat the battery.
2. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. Battery
b. (Trained service technician only) System
board assembly
3-2-1 Serial port failed. (Trained service technician only) Replace the
system board assembly.
3-2-2 Parallel port failed (Trained service technician only) Replace the
system board assembly.
3-2-3 Math coprocessor test failed
1. (Trained service technician only) Reseat the
microprocessor.
2. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. (Trained service technician only)
Microprocessor
b. (Trained service technician only) System
board assembly
3-2-4 Failure comparing CMOS memory size
against actual
1. Reseat the following components:
a. DIMMs
b. Battery
Replace the following components one at a
2.
time, in the order shown, restarting the
blade server each time:
a. DIMMs
b. Battery
c. (Trained service technician only) System
board assembly
Chapter 2. Diagnostics 13
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Beep code Description Action
3-3-1 Memory size mismatch occurred.
1. Verify that both DIMMs in the bank are of
the same size, speed, type, and technology.
2. Reseat the following components:
a. DIMMs
b. Battery
Replace the following components one at a
3.
time, in the order shown, restarting the
blade server each time:
a. DIMMs
b. Battery
c. (Trained service technician only) System
board assembly
3-3-2 Critical SMBUS error occurred.
1. Power down the blade server and reseat it
in the BladeCenter unit.
2. Reseat the DIMMs.
3. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
3-3-3 No operational memory in system.
Note: In some memory configurations,
the 3-3-3 beep code might sound during
POST followed by a blank display
screen. If this occurs and the Boot Fail
Count feature in the Start Options of the
Configuration/Setup Utility is set to
Enabled (its default setting), you must
restart the server three times to force the
system BIOS code to reset the memory
connector or bank of connectors from
Disabled to Enabled.
Important: In some memory configurations,
the 3-3-3 beep code might sound during POST
followed by a blank display screen. If this
occurs and the Boot Fail Count feature in the
Start Options of the Configuration/Setup Utility
program is set to Enabled (its default setting),
you must restart the blade server three times to
force the system BIOS to reset the CMOS
values to the default configuration (memory
connector or bank of connectors enabled).
1. Install or reseat DIMMS and restart the
server.
2. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
Two short beeps Information only, configuration has
changed.
1. Run diagnostics.
2. Run the Configuration/Setup Utility program.
14 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Beep code Description Action
Three short beeps Memory error.
1. Reseat the DIMMs.
2. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
One continuous beep Microprocessor error.
1. Reseat the following components:
a. (Trained service technician only)
Microprocessor 1
b. (Trained service technician only)
Microprocessor 2
Replace the following components one at a
2.
time, in the order shown, restarting the
blade server each time:
a. (Trained service technician only)
Microprocessor 1
b. (Trained service technician only)
Microprocessor 2
c. (Trained service technician only) System
board assembly
Repeating short beeps Keyboard error.
1. Reseat the keyboard.
2. Replace the following components one at a
time, in the order shown, restarting the
blade server each time:
a. Keyboard
b. (Trained service technician only) System
board assembly
Chapter 2. Diagnostics 15
No-beep symptoms
The following table describes situations in which no beep code sounds when POST
is completed.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
No-beep symptom Action
No beep and the blade server operates correctly (Trained service technician only) Replace the system
board assembly.
No beep and no video (system-error LED is off) See “Solving undetermined problems” on page 49.
No beep and no video (system attention LED is lit) See “Light path diagnostics” on page 38.
Error logs
The BMC log contains all system status messages from the blade server service
processor. The management-module event log in your BladeCenter unit contains
messages that were generated on each blade server during POST and status
messages from the BladeCenter service processor. (See the documentation for your
management-module for more information.)
The following illustration shows an example of a BMC log entry.
BMC System Event Log
---------------------------------------------------------Get Next Entry
Get Previous Entry
Clear BMC SEL
Entry Number= 00005 / 00011
Record ID= 0005
Record Type= 02
Timestamp= 2005/01/25 16:15:17
Entry Details: Generator ID= 0020
Sensor Type= 04
Assertion Event
Fan
Threshold
Lower Non-critical - going high
Sensor Number= 40
Event Direction/Type= 01
Event Data= 52 00 1A
Important:
v A single problem might cause several error messages. When this occurs, work to
correct the cause of the first error message. After you correct the cause of the
first error message, the other error messages usually will not occur the next time
you run the test.
v The management-module event log in your BladeCenter unit lists messages
according to the position of the blade server in the blade bays. If a blade server
is moved from one bay to another, the management-module event log will report
16 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
messages for that blade server using the new bay number; messages for that
blade server that were generated before the move will still be listed using the
previous bay number.
BMC log is limited in size. When the log is full, new entries will not overwrite
The
existing entries; therefore, you must periodically clear the BMC log through the
Configuration/Setup Utility program (the menu choices are described in the
Installation and User’s Guide ). When you are troubleshooting an error, be sure to
clear the BMC log so that you can find current errors more easily.
Entries that are written to the BMC log during the early phase of POST show an
incorrect date and time as the default time stamp; however, the date and time are
corrected as POST continues.
Each BMC log entry appears on its own page. To display all the data for an entry,
use the Up Arrow (↑ ) and Down Arrow (↓ ) keys or the Page Up and Page Down
keys. To move from one entry to the next, select Get Next Entry or Get Previous
Entry .
The BMC log indicates an assertion event when an event has occurred. It indicates
a deassertion event when the event is no longer occurring.
Some of the error codes and messages in the BMC log are abbreviated.
You can view the contents of the BMC log from the Configuration/Setup Utility
program and from the diagnostic programs.
When you are troubleshooting PCI-X slots, note that the error logs report the PCI-X
buses numerically. The numerical assignments vary depending on the configuration.
You can check the assignments by running the Configuration/Setup Utility program
(see the Installation and User’s Guide for more information).
Viewing the BMC log from the Configuration/Setup Utility
program
For complete information about using the Configuration/Setup Utility program, see
the Installation and User’s Guide .
To view the BMC log, complete the following steps:
1. Turn on the blade server.
2. When the prompt Press F1 for Configuration/Setup appears, press F1. If you
have set a power-on password, you must type the password and press Enter to
start the Configuration/Setup Utility program.
3. Select Advanced Settings , select Baseboard Management Controller (BMC)
settings, and then select BMC System Event Log.
Viewing the BMC log from the diagnostic programs
The BMC log contains the same information, whether it is viewed from the
Configuration/Setup Utility program or from the diagnostic programs.
For information about using the diagnostic programs, see “Running the diagnostic
programs” on page 41.
To view the BMC log, complete the following steps:
1. If the blade server is running, turn off the blade server.
2. Turn on the blade server.
Chapter 2. Diagnostics 17
3. When the prompt F2 for Diagnostics appears, press F2.
4. From the top of the screen, select Hardware Info .
5. From the list, select BMC Log .
For complete information about using the Configuration/Setup Utility program, see
the Installation and User’s Guide .
POST error codes
The following table describes the POST error codes and suggested actions to
correct the detected problems.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
062 Three consecutive startup failures
101, 102, 106 System and processor error
1. Run the Configuration/Setup Utility program,
select Load Default Settings , make sure that the
date and time are correct, and save the settings.
2. Reseat the following components one at a time,
in the order shown, restarting the blade server
each time:
a. (Trained service technician only) Battery
b. (Trained service technician only)
Microprocessor 1
c. (Trained service technician only)
Microprocessor 2
Replace the following components one at a time,
3.
in the order shown, restarting the blade server
each time:
a. Battery
b. (Trained service technician only)
Microprocessor 1
c. (Trained service technician only)
Microprocessor 2
d. (Trained service technician only) System
board assembly
1. Reseat the microprocessor and then restart the
blade server.
2. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. (Trained service technician only)
Microprocessor 1
b. (Trained service technician only)
Microprocessor 2
c. (Trained service technician only) System
board assembly
18 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
151 Real time clock error
1. Run the Configuration/Setup Utility program.
2. Reseat the battery.
3. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. Battery
b. (Trained service technician only) System
board assembly
161 Real-time clock battery error
1. Run the Configuration/Setup Utility program.
2. Reseat the battery.
3. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. Battery
b. (Trained service technician only) System
board assembly
162 Invalid configuration information or CMOS
RAM checksum failure.
1. Run the Configuration/Setup Utility program,
select Load Default Settings , and save the
settings.
2. Reseat the battery.
3. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. Battery
b. (Trained service technician only) System
board assembly
163 Time of day not set.
1. Run the Configuration/Setup Utility program,
select Load Default Settings , make sure that the
date and time are correct, and save the settings.
2. Reseat the battery.
3. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. Battery
b. (Trained service technician only) System
board assembly
Chapter 2. Diagnostics 19
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
164 Memory configuration changed.
1. Run the Configuration/Setup Utility program,
verify the memory configuration is correct and
save the settings.
2. Reseat the DIMMs.
3. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
175 Bad EEPROM CRC #1 (Trained service technician only) Replace the system
board assembly.
177 Bad checksum (Trained service technician only) Replace the system
board assembly.
178 EEPROM not functional (Trained service technician only) Replace the system
board assembly.
184 Power-on password damaged.
1. Run the Configuration/Setup Utility program,
select Load Default Settings , and save the
settings.
2. (Trained service technician only) Replace the
system board assembly.
185 Corrupted boot sequence
1. Run the Configuration/Setup Utility program,
select Load Default Settings , and save the
settings.
2. (Trained service technician only) Replace the
system board assembly.
186 Security hardware control logic error (Trained service technician only) Replace the system
board assembly.
187 VPD Serial Number not set
1. Run the Configuration/Setup Utility program,
select Load Default Settings , and save the
settings.
2. (Trained service technician only) Replace the
system board assembly.
188 Bad EEPROM CRC #2 (Trained service technician only) Replace the system
board assembly.
189 An attempt was made to access the blade
server with invalid passwords.
198 Processor speed mismatch detected. The
two installed microprocessors are not the
same frequency.
Restart the blade server, run the Configuration/Setup
Utility program, and change the power-on password.
Replace one of the microprocessors with a
microprocessor of the same type and speed as the
remaining installed microprocessor.
20 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
201 Memory test error.
1. Update the BIOS code and run the diagnostics
again.
2. Reseat the DIMMs.
3. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
229 Cache error.
1. Reseat the following components:
a. (Trained service technician only)
Microprocessor 1
b. (Trained service technician only)
Microprocessor 2
Replace the components listed in step 1 one at a
2.
time, in the order shown, restarting the blade
server each time:
301 Keyboard or keyboard controller error.
1. If you have installed a USB keyboard, run the
Configuration/Setup Utility program and enable
keyboardless operation to prevent the POST error
message 301 from being displayed during startup.
2. Reseat the keyboard.
3. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. Keyboard
b. (Trained service technician only) System
board assembly
303 Keyboard controller error. (Trained service technician only) Replace the system
board assembly.
602 Invalid diskette boot record.
1. Remove and reinsert the diskette, and restart the
blade server; or restart the blade server by using
another bootable diskette.
2. Reseat the following components:
a. Diskette drive
b. Diskette drive cable
Replace the following components one at a time,
3.
in the order shown, restarting the blade server
each time:
a. Diskette drive
b. Diskette drive cable
c. (Trained service technician only) System
board assembly
Chapter 2. Diagnostics 21
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
604 Diskette drive error.
1. Run the Configuration/Setup Utility program.
2. Reseat the following components:
a. Diskette drive
b. Diskette drive cable
Replace the following components one at a time,
3.
in the order shown, restarting the blade server
each time:
a. Diskette drive
b. Diskette drive cable
c. (Trained service technician only) System
board assembly
662 Diskette drive configuration error.
1. Run the Configuration/Setup Utility program and
verify that the diskette drive is enabled.
2. Reseat the following components:
a. Diskette drive
b. Diskette drive cable
Replace the following components one at a time,
3.
in the order shown, restarting the blade server
each time:
a. Diskette drive
b. Diskette drive cable
c. (Trained service technician only) System
board assembly
1200 Processor machine check.
1. Reseat the following components:
a. (Trained service technician only)
Microprocessor 1
b. (Trained service technician only)
Microprocessor 2
Replace the following components one at a time,
2.
in the order shown, restarting the blade server
each time:
a. (Trained service technician only)
Microprocessor 1
b. (Trained service technician only)
Microprocessor 2
c. (Trained service technician only) System
board assembly
18xx Unavailable PCI hardware interrupt.
1. Run the Configuration/Setup Utility program and
adjust the adapter settings.
2. Remove each adapter one at a time, restarting
the blade server each time, until the problem is
isolated.
22 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
1962 A drive does not contain a valid boot sector.
1. Make sure that a bootable operating system is
installed.
2. Run the SAS Attached Disk diagnostic test.
3. Reseat the hard disk drive.
4. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. Hard disk drive
b. (Trained service technician only) System
board assembly
2462 Video configuration error.
1. Verify that the keyboard/video/mouse select
button LED on the front of the blade server is lit,
indicating that the blade server is connected to
the shared BladeCenter monitor.
2. Verify that a working monitor is connected
correctly to the BladeCenter unit.
3. (Trained service technician only) Replace the
system board assembly.
5962 IDE CD or DVD drive configuration error.
1. Run the Configuration/Setup Utility program,
select Load Default Settings , and save the
settings.
2. Reseat the following components:
a. CD or DVD drive
b. Battery
Replace the following components one at a time,
3.
in the order shown, restarting the blade server
each time:
a. CD or DVD drive
b. Battery
c. (Trained service technician only) System
board assembly
8603 Pointing-device error.
1. Reseat the pointing device.
2. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. Pointing device
b. (Trained service technician only) System
board assembly
Chapter 2. Diagnostics 23
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
19990301 Hard disk sector error.
1. Reseat the hard disk drive.
2. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. Hard disk drive
b. (Trained service technician only) System
board assembly
Checkout procedure
The checkout procedure is the sequence of tasks that you should follow to
diagnose a problem in the blade server.
About the checkout procedure
Before performing the checkout procedure for diagnosing hardware problems,
review the following information:
v Read the safety information that begins on page vii.
v The diagnostic programs provide the primary methods of testing the major
components of the blade server. If you are not sure whether a problem is caused
by the hardware or by the software, you can use the diagnostic programs to
confirm that the hardware is working correctly.
v When you run the diagnostic programs, a single problem might cause more than
one error message. When this happens, correct the cause of the first error
message. The other error messages usually will not occur the next time you run
the diagnostic programs.
Exception: If there are multiple error codes or light path diagnostics LEDs that
indicate a microprocessor error, the error might be in a microprocessor or in a
microprocessor socket. See “Microprocessor problems” on page 30 for
information about diagnosing microprocessor problems.
v If the blade server is halted and a POST error code is displayed, see “Error logs”
on page 16. If the blade server is halted and no error message is displayed, see
“Troubleshooting tables” on page 25 and “Solving undetermined problems” on
page 49.
v For intermittent problems, check the error log; see “Error logs” on page 16 and
“Diagnostic programs, messages, and error codes” on page 41.
v If no LEDs are lit on the blade server front panel, verify the blade server status
and errors in the management-module Web interface; also see “Solving
undetermined problems” on page 49.
v If device errors occur, see “Troubleshooting tables” on page 25.
24 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Performing the checkout procedure
To perform the checkout procedure, complete the following steps:
1. If the blade server is running, turn off the blade server.
2. Turn on the blade server. Make sure that the blade server has control of the
video (the keyboard/video/mouse button is lit). If the blade server does not start,
see “Troubleshooting tables.”
3. Record any POST beep codes that sound or POST error messages that are
displayed on the monitor. If an error is displayed, look up the first error in the
“POST error codes” on page 18.
4. Check the control panel blade-error LED; if it is lit, check the light path
diagnostics LEDs (see “Light path diagnostics” on page 38).
5. Check for the following results:
v Successful completion of POST, indicated by a single beep
v Successful completion of startup, indicated by a readable display of the
operating-system desktop
Did a single beep sound and are there readable instructions on the main menu?
6.
v No: Find the failure symptom in “Troubleshooting tables”; if necessary, see
“Solving undetermined problems” on page 49.
v Yes: Run the diagnostic programs (see “Running the diagnostic programs” on
page 41).
– If you receive an error, see “Diagnostic error codes” on page 43.
– If the diagnostic programs were completed successfully and you still
suspect a problem, see “Solving undetermined problems” on page 49.
Troubleshooting tables
Use the troubleshooting tables to find solutions to problems that have identifiable
symptoms.
If you cannot find the problem in these tables, see “Running the diagnostic
programs” on page 41 for information about testing the blade server.
If you have just added new software or a new optional device and the blade server
is not working, complete the following steps before using the troubleshooting tables:
1. Remove the software or device that you just added.
2. Run the diagnostic tests to determine whether the blade server is running
correctly.
3. Reinstall the new software or new device.
Chapter 2. Diagnostics 25
CD or DVD drive problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
The CD or DVD drive is not
recognized.
A CD or DVD is not working
correctly.
The CD or DVD drive tray is not
working.
The CD or DVD drive is
detected as /dev/sr0 by SUSE
Linux. (If the SUSE Linux
operating system is installed
remotely onto a blade server
that is not the current owner of
the media tray [CD or DVD
drive, diskette drive, and USB
port], SUSE Linux detects the
CD or DVD drive as /dev/sr0
instead of /dev/cdrom.)
The CD or DVD drive is not
recognized after being switched
back to the blade server running
Windows
®
2000 Advanced
Server with SP3 applied. (When
the CD or DVD drive owned by
blade server x is switched to
another blade server, then is
switched back to blade server x,
the operating system in blade
server x no longer recognizes
the CD or DVD drive. This
happens when you have not
safely stopped the drives before
switching ownership of the
media tray [CD or DVD drive,
diskette drive, and USB port].)
1. Make sure that:
v All cables and jumpers are installed correctly.
v The correct device driver is installed for the CD or DVD drive.
Reseat the CD or DVD drive.
2.
3. Replace the CD or DVD drive.
1. Clean the CD or DVD.
2. Reseat the CD or DVD drive.
3. Replace the CD or DVD drive.
Note: The blade server must have ownership of the CD or DVD drive.
1. Insert the end of a straightened paper clip into the manual tray-release
opening.
2. Reseat the CD or DVD drive.
3. Replace the CD or DVD drive.
Establish a link between /dev/sr0 and /dev/cdrom as follows:
1. Enter the following command:
rm /dev/cdrom; ln -s /dev/sr0 /dev/cdrom
2. Insert the following line in the /etc/fstab file:
/dev/cdrom /media/cdrom auto ro,noauto,user,exec 0 0
Note: Because the BladeCenter unit uses USB to communicate with the media
tray devices, switching ownership of the media tray to another blade server is the
same as disconnecting a USB device. Before you switch ownership of the CD or
DVD drive (media tray) to another blade server, safely stop the media tray devices
on the blade server that currently owns the media tray, as follows:
1. Double-click the Unplug/Eject Hardware icon in the Windows taskbar at the
bottom right of the desktop.
2. Select USB Floppy and click Stop .
3. Select USB Mass Storage Device and click Stop .
4. Click Close .
You
can now safely switch ownership of the media tray to another blade server.
26 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Diskette drive problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
Diskette drive activity LED stays
on, or the system bypasses the
diskette drive.
1. If there is a diskette in the drive, verify that:
v The diskette is inserted correctly in the drive.
v The diskette is good and not damaged. (Try another diskette if you have
one.) The drive LED lights (flashes once per second) when the diskette is
inserted.
v The diskette contains the necessary files to start the blade server.
v The diskette drive is enabled in the Configuration/Setup utility program.
v The software program is working properly.
v The cable is installed correctly (in the proper orientation).
To prevent diskette drive read/write errors, make sure that the distance
2.
between monitors and diskette drives is at least 76 mm (3 in.).
3. Reseat the following components:
a. Diskette drive cable
b. Diskette drive
c. Media tray
Replace the components listed in step 3 one at a time, in the order shown,
4.
restarting the blade server each time.
General problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
A cover lock is broken, an LED
is not working, or a similar
problem has occurred.
If the part is a CRU, replace it. If the part is a FRU, the part must be replaced by a
trained service technician.
Chapter 2. Diagnostics 27
Hard disk drive problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
Not all drives are recognized by
the Fixed Disk or SAS Attached
Disk diagnostic test.
The blade server stops
responding during the Fixed
Disk or SAS Attached Disk
diagnostic test.
A hard disk drive passes the
Fixed Disk or SAS Attached
Disk diagnostics test, but the
problem remains.
Remove the drive that is indicated by the diagnostic tests; then, run the Fixed Disk
or SAS Attached Disk diagnostic test again. If the remaining drives are recognized,
replace the drive that you removed with a new one.
Remove the hard disk drive that was being tested when the blade server stopped
responding, and run the diagnostic test again. If the Fixed Disk or SAS Attached
Disk diagnostic test runs successfully, replace the drive that you removed with a
new one.
Run the SAS Fixed Disk or SAS Attached Disk diagnostic test again. If the
diagnostics continue to pass but the drive continues to have a problem, replace the
drive with a new one.
Intermittent problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
A problem occurs only
occasionally and is difficult to
diagnose.
1. Make sure that:
v When the blade server is turned on, air is flowing from the rear of the blade
server at the blower grille. If there is no airflow, the blower is not working.
This causes the blade server to overheat and shut down.
v The SAS hard disk drives are configured correctly.
Check the BMC log (see “Error logs” on page 16).
2.
28 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Keyboard, mouse, or pointing-device problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
One or more keys on the
keyboard do not work.
The mouse or pointing device
does not work.
1. Make sure that:
v The keyboard cable is securely connected to the BladeCenter
management-module.
v Both the blade server and the monitor are turned on.
v The keyboard/video/mouse select button LED on the front of the blade
server is lit, indicating that the blade server is connected to the shared
BladeCenter monitor.
v The keyboard works with another blade server.
Replace the keyboard.
2.
3. Replace the management-module on the BladeCenter unit; see the Hardware
Maintenance Manual and Troubleshooting Guide or Problem Determination and
Service Guide for your BladeCenter unit.
1. Make sure that:
v The keyboard/video/mouse select button LED on the front of the blade
server is lit, indicating that the blade server is connected to the shared
BladeCenter monitor.
v The mouse or pointing-device cable is securely connected to the
BladeCenter management-module, and the keyboard and mouse cables are
not reversed.
v The mouse works correctly with other blade servers.
v The mouse device drivers are installed correctly.
v Both the blade server and the monitor are turned on.
v The mouse is recognized as a USB, not PS/2, device by the blade server.
Although the mouse may be a PS/2-style device, communication with the
mouse is through USB in the BladeCenter unit. Some operating systems
allow you to select the type of mouse during installation of the operating
system. Select USB.
Replace the mouse or pointing device.
2.
3. Replace the management-module in the BladeCenter unit; see the Hardware
Maintenance Manual and Troubleshooting Guide or Problem Determination and
Service Guide for your BladeCenter unit.
Chapter 2. Diagnostics 29
Memory problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
The amount of system memory
that is displayed is less than the
amount of installed physical
memory.
1. Make sure that:
v The memory modules are seated correctly.
v Yo u have installed the correct type of memory.
v If you changed the memory, you updated the memory configuration in the
Configuration/Setup Utility program.
v All banks of memory are enabled. The blade server might have automatically
disabled a memory bank when it detected a problem, or a memory bank
might have been manually disabled.
Check BMC log for error message 289:
2.
v If a DIMM was disabled by a system-management interrupt (SMI), replace
the DIMM.
v If a DIMM was disabled by the user or by POST, run the Configuration/Setup
Utility program and enable the DIMM.
Reseat the DIMM.
3.
4. Replace the DIMM.
5. (Trained service technician only) Replace the system board assembly.
Microprocessor problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
The blade server emits a
continuous beep during POST,
indicating that the startup (boot)
microprocessor is not working
correctly.
1. (Trained service technician only) Reseat microprocessor 1.
2. (Trained service technician only) Replace microprocessor 1.
30 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Monitor or video problems
Some IBM monitors have their own self-tests. If you suspect a problem with your
monitor, see the documentation that comes with the monitor for instructions for
testing and adjusting the monitor. If you cannot diagnose the problem, call for
service.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
The screen is blank.
Only the cursor appears. Make sure that the keyboard, video, and mouse have not been switched to another
The monitor goes blank when
you direct it to a working blade
server, or it goes blank when
you start some application
programs in the blade servers.
1. Make sure that:
v The keyboard/video/mouse select button LED on the front of the blade
server is lit, indicating that the blade server is connected to the shared
BladeCenter monitor.
Note: You can test the monitor functionality by enabling the
keyboard/video/mouse select button on another blade server.
v The system power cord is connected to the BladeCenter power module and
a working electrical outlet.
v The monitor cables are connected correctly.
v The monitor is turned on and the brightness and contrast controls are
adjusted correctly.
v Damaged BIOS code is not affecting the video; see “Recovering from a
BIOS update failure” on page 48.
Important:
In some memory configurations, the 3-3-3 beep code might sound
during POST followed by a blank monitor screen. If this occurs and the Boot
Fail Count option in the Start Options of the Configuration/Setup Utility
program is enabled, you must restart the blade server three times to reset the
configuration settings to the default configuration (the memory connector or
bank of connectors enabled).
2. Replace the following components:
a. Monitor
b. Management-module in the BladeCenter (see the Hardware Maintenance
Manual and Troubleshooting Guide or Problem Determination and Service
Guide for your BladeCenter unit).
c. (Trained service technician only) System board assembly
blade server. If the problem remains, see “Solving undetermined problems” on
page 49.
Make sure that the monitor cable is connected to the video connector on the
BladeCenter management-module. Some IBM monitors have their own self-tests. If
you suspect a problem with the monitor, see the information that comes with the
monitor instructions for adjusting and testing the monitor.
If you still cannot find the problem, try using the monitor with another blade server.
If the problem remains, see the Hardware Maintenance Manual and
Troubleshooting Guide or Problem Determination and Service Guide for your
BladeCenter unit.
Chapter 2. Diagnostics 31
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
The monitor has screen jitter, or
the screen image is wavy,
unreadable, rolling, or distorted.
1. If the monitor self-tests show that the monitor is working correctly, consider the
location of the monitor. Magnetic fields around other devices (such as
transformers, appliances, fluorescent lights, and other monitors) can cause
screen jitter or wavy, unreadable, rolling, or distorted screen images. If this
happens, turn off the monitor.
Attention: Moving a color monitor while it is turned on might cause screen
discoloration.
Move the device and the monitor at least 305 mm (12 in.) apart. Turn on the
monitor.
Notes:
a. To prevent diskette drive read/write errors, make sure that the distance
between the monitor and any diskette drive is at least 76 mm (3 in.).
b. Non-IBM monitor cables might cause unpredictable problems.
Replace the monitor.
2.
3. Replace the management-module in the BladeCenter unit (see the Hardware
Maintenance Manual and Troubleshooting Guide or Problem Determination and
Service Guide for your BladeCenter unit).
4. (Trained service technician only) Replace the system board assembly.
Wrong characters appear on the
screen.
1. If the wrong language is displayed, update the firmware or operating system
with the correct language in the blade server that has ownership of the monitor.
2. Replace the monitor.
3. (Trained service technician only) Replace the system board assembly.
Network connection problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
One or more blade servers are
unable to communicate with the
network.
32 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Make sure that:
v The switch modules for the network interface that is being used are installed in
the correct BladeCenter bays and are configured and operating correctly. See
the Hardware Maintenance Manual and Troubleshooting Guide or Problem
Determination and Service Guide for your BladeCenter unit for details.
v The settings in the switch module are correct for the blade server (settings in the
switch module are specific to each blade server).
If
the problem remains, see “Solving undetermined problems” on page 49.
Optional-device problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
An IBM optional device that was
just installed does not work.
1. Make sure that:
v The device is designed for the blade server (see http://www.ibm.com/eserver/
serverproven/compat/us).
v Yo u followed the installation instructions that came with the device and the
device is installed correctly.
v Yo u have not loosened any other installed devices or cables.
v Yo u updated the configuration information in the Configuration/Setup Utility
program. Whenever memory or any other device is changed, you must
update the configuration.
If the device comes with its own test instructions, use those instructions to test
2.
the device.
3. Reseat the device that you just installed.
4. Replace the device that you just installed.
Power error messages
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Message Action
System Power Good fault
VRM Power Good fault
System over recommended voltage for +12v.
1. Check the BladeCenter unit power. (See the Hardware
Maintenance Manual and Troubleshooting Guide or Problem
Determination and Service Guide for your BladeCenter unit for
details.)
2. Reseat the blade server.
3. Replace the blade server.
1. Check the BladeCenter unit power. (See the Hardware
Maintenance Manual and Troubleshooting Guide or Problem
Determination and Service Guide for your BladeCenter unit for
details.)
2. Reseat the blade server.
3. Replace the blade server.
1. Check the BladeCenter unit power. (See the Hardware
Maintenance Manual and Troubleshooting Guide or Problem
Determination and Service Guide for your BladeCenter unit for
details.)
2. Reseat the blade server.
3. Replace the blade server.
Chapter 2. Diagnostics 33
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Message Action
System over recommended voltage for
+1.2v.
System over recommended voltage for
+1.25v.
System over recommended voltage for
+1.5v.
System over recommended voltage for
+2.5v.
System over recommended voltage for
+3.3v.
System over recommended 5V fault.
1. Reseat the blade server.
2. Replace the blade server.
1. Reseat the blade server.
2. Replace the blade server.
1. Reseat the blade server.
2. Replace the blade server.
1. Reseat the blade server.
2. Replace the blade server.
1. Reseat the blade server.
2. Replace the blade server.
1. Reseat the blade server.
2. Replace the blade server.
VRM voltage over recommended tolerance.
1. Reseat the blade server.
2. Replace the blade server.
System under recommended voltage for
+12v.
1. Check the BladeCenter unit power. (See the Hardware
Maintenance Manual and Troubleshooting Guide or Problem
Determination and Service Guide for your BladeCenter unit for
details.)
2. Reseat the blade server.
3. Replace the blade server.
System under recommended voltage for
+1.2v.
System under recommended voltage for
+1.25v.
System under recommended voltage for
+1.5v.
System under recommended voltage for
+2.5v.
System under recommended voltage for
+3.3v.
System under recommended 5V fault.
1. Reseat the blade server.
2. Replace the blade server.
1. Reseat the blade server.
2. Replace the blade server.
1. Reseat the blade server.
2. Replace the blade server.
1. Reseat the blade server.
2. Replace the blade server.
1. Reseat the blade server.
2. Replace the blade server.
1. Reseat the blade server.
2. Replace the blade server.
34 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Power problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
Power switch does not work,
and the reset button, does work.
The blade server does not turn
on.
The blade server does not start
and the following conditions are
present:
v The amber system-error LED
on the BladeCenter unit
system LED panel is lit.
v The amber blade error LED
on the blade server control
panel is lit.
v The management-module
event log contains the
message Processor speed
mismatch .
The blade server turns off for no
apparent reason.
1. Reseat the control-panel connector.
2. Replace the bezel assembly.
3. (Trained service technician only) Replace the system board assembly.
1. Make sure that:
a. The power LED on the front of the BladeCenter unit is lit.
b. The LEDs on all the BladeCenter power modules are lit.
c. The BladeCenter unit has the correct power configuration to operate the
blade bay where your blade server is installed (see the documentation for
your BladeCenter unit).
d. The power-on LED on the blade server control panel is flashing slowly.
v If the power LED is flashing rapidly and continues to do so, the blade
server is not communicating with the management-module; reseat the
blade server and go to step 3
v If the power LED is off, the blade bay is not receiving power, the blade
server is defective, or the LED information panel is loose or defective.
e.
Local power control for the blade server is enabled (use the BladeCenter
management-module Web interface to verify), or the blade server was
instructed through the management-module (Web interface or IBM Director)
to start.
If you just installed a device in the blade server, remove it, and restart the
2.
blade server. If the blade server now starts, you might have installed more
devices than the power to that blade bay supports.
3. Try another blade server in the blade bay; if it works, replace the faulty blade
server.
4. See “Solving undetermined problems” on page 49.
Make sure that microprocessors 1 and 2 are identical (number of cores, cache size
and type, clock speed, internal and external clock frequencies).
(Trained service technician only) If microprocessors are not identical, remove the
microprocessor with the incorrect specifications and replace with a microprocessor
that has the correct specifications.
1. Make sure that each blade bay has a blade server, expansion unit, or filler
blade correctly installed. If these components are missing or incorrectly
installed, an over-temperature condition might result in shutdown.
2. (Trained service technician only) If the microprocessor error LED is lit, replace
the microprocessor.
Chapter 2. Diagnostics 35
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
The blade server does not turn
off.
1. Verify whether you are using an Advanced Configuration and Power Interface
(ACPI) or non-ACPI operating system. If you are using a non-ACPI operating
system, complete the following steps:
a. Press Ctrl+Alt+Delete.
b. Turn off the blade server by pressing the power-control button for 4
seconds.
c. If the blade server fails during POST and the power-control button does not
work, remove the blade server from the bay and reseat it.
If the problem remains or if you are using an ACPI-aware operating system,
2.
complete the following steps:
a. Check the power-management policies in the operating system for the
blade server.
b. Suspect the system board.
ServerGuide problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
™
The ServerGuide
Installation CD will not start.
The operating-system
installation program
continuously loops.
The ServerGuide program will
not start the operating-system
CD.
The operating system cannot be
installed; the option is not
available.
Setup and
1. Make sure that the CD or DVD drive is associated with the blade server that
you are configuring.
2. Make sure that the blade server supports the ServerGuide program.
3. If the startup (boot) sequence settings have been changed, make sure that the
CD or DVD drive is first in the startup sequence.
Make more space available on the hard disk.
Make sure that the operating-system CD is supported by the ServerGuide program.
See the ServerGuide Setup and Installation CD label for a list of supported
operating-system versions.
Make sure that the blade server supports the operating system. If it does, either no
logical drive is defined (SAS RAID servers), or the ServerGuide System Partition is
not present. Run the ServerGuide program and make sure that setup is complete.
36 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Service processor problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
The management-module
reports a general monitor
failure.
Disconnect the BladeCenter unit from all electrical sources, wait for 30 seconds,
reconnect the BladeCenter unit to the electrical sources, and restart the blade
server. If the problem remains, see “Solving undetermined problems” on page 49,
and the Hardware Maintenance Manual and Troubleshooting Guide or Problem
Determination and Service Guide for your BladeCenter unit.
Software problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
You suspect a software
problem.
1. To determine whether the problem is caused by the software, make sure that:
v the blade server has the minimum memory that is needed to use the
software. For memory requirements, see the information that comes with the
software.
Note: If you have just installed an adapter or memory, the blade server
might have a memory-address conflict.
v The software is designed to operate on the blade server.
v Other software works on the blade server.
v The software works on another server.
If you received any error messages when using the software, see the
2.
information that comes with the software for a description of the messages and
suggested solutions to the problem.
3. Contact your place of purchase of the software.
Universal Serial Bus (USB) port problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
A USB device does not work. Make sure that:
v The correct USB device driver is installed.
v The operating system supports USB devices.
Chapter 2. Diagnostics 37
Light path diagnostics
Light path diagnostics is a system of LEDs on the control panel and on various
internal components of the blade server. When an error occurs, LEDs can be lit
throughout the blade server to help identify the source of the error.
After you remove the blade server, you can press and hold the light path
diagnostics switch for a maximum of 25 seconds to light the LEDs and locate the
failing component. The following components have this feature:
v Hard disk drives
v Light path diagnostics panel
v Microprocessors
v Memory modules (DIMMs)
Viewing the light path diagnostics LEDs
Before working inside the blade server to view light path diagnostics LEDs, read the
safety information that begins on page vii and “Handling static-sensitive devices” on
page 56.
If an error occurs, view the light path diagnostics LEDs in the following order:
1. Look at the control panel on the front of the blade server (see “Blade server
controls, LEDs, and connectors” on page 4).
v If the information LED is lit, it indicates that information about a suboptimal
condition in the blade server is available in the BMC log or in the
management-module event log.
v If the blade-error LED is lit, it indicates that an error has occurred; go to step
2.
To view the light path diagnostics panel and LEDs, complete the following steps:
2.
a. Remove the blade server from the BladeCenter unit.
b. Place the blade server on a flat, static-protective surface.
c. Remove the cover from the blade server.
d. Press and hold the light path diagnostics switch to light the LEDs of the
failing components in the blade server. The LEDs will remain lit for as long
as you press the switch, to a maximum of 25 seconds.
38 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
The following illustration shows the locations of the system board error LEDs.
DIMM 3 Error LED
(CR5)
DIMM 4 Error LED
(CR6)
I/O expansion
connectors
Light path diagnostics (SW4)
DIMM 1 error LED (CR201)
DIMM 2 error LED (CR4)
Microprocessor 1 error LED (CR12)
SAS 1 hard
disk drive LED
(CR262)
Microprocessor 2 error LED (CR13)
3V lithium battery
SAS 0 Hard disk drive LED (CR260)
SAS trays/connectors
Reserved
Control panel
connector
The following illustration shows LEDs on the light path diagnostics panel on the
system board.
NMI
MIS
S BRD
TEMP
NMI error LED
Microprocessor mismatch error LED
System-board error LED
Over temperature error LED
Light path diagnostics LED
Light path diagnostics switch
When you press the light path diagnostics switch, note which LEDs are lit on the
system board and the light path diagnostics panel. Using this information and the
information in “Light path diagnostics LEDs” on page 40 can often provide enough
information to diagnose the error.
Chapter 2. Diagnostics 39
Light path diagnostics LEDs
The following table describes the LEDs on the light path diagnostics panel and on
the system board, and suggested actions to correct the detected problems.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Lit light path
diagnostics LED Description Action
None An error has occurred and cannot be isolated,
or the service processor has failed.
DIMM x error A memory error occurred.
Microprocessor x
The microprocessor has failed.
error
MIS
The microprocessors do not match. Make sure that microprocessors 1 and 2 are
(Microprocessor
mismatch)
NMI (NMI error) The system board has failed.
S BRD (System
The system board has failed (Trained service technician only) Replace the
board error)
1. Make sure that the light path diagnostics
LED is lit to ensure that there is enough
power in the blade server to light the rest
of the LEDs.
2. Check the BMC log for information about
an error that is not represented by a light
path diagnostics LED.
1. Make sure that the DIMM indicated by the
lit LED is supported.
2. Reseat the DIMM that is indicated by the lit
LED.
3. Replace the DIMM that is indicated by the
lit LED.
Multiple DIMM LEDs do not necessarily
Note:
indicate multiple DIMM failures. If more than
one DIMM LED is lit, reseat or replace one
DIMM at a time until the error is corrected.
1. (Trained service technician only) Make sure
that the microprocessor that is indicated by
the lit LED is installed correctly. (See
“Removing a microprocessor and heat sink”
on page 72 for installation instructions.)
2. (Trained service technician only) Replace
the microprocessor that is indicated by the
lit LED.
identical (number of cores, cache size and
type, clock speed, internal and external clock
frequencies); also, see “Troubleshooting tables”
on page 25.
1. Replace the blade server cover, reinsert
the blade server in the BladeCenter unit,
and then restart the blade server. Check
the BMC log for information about the error.
2. (Trained service technician only) Replace
the system board assembly.
system board assembly.
40 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Lit light path
diagnostics LED Description Action
TEMP (Over
temperature error)
SAS hard disk
drive-LED
The system temperature has exceeded a
threshold level.
Reserved Reserved
1. Check to see whether a blower on the
BladeCenter unit has failed. If it has,
replace the blower (see the Hardware
Maintenance Manual and Troubleshooting
Guide or Problem Determination and
Service Guide for your BladeCenter unit for
more information).
2. Make sure that the room temperature is not
too high. (See “Features and specifications”
on page 3 for temperature information.)
Diagnostic programs, messages, and error codes
The diagnostic programs are the primary method of testing the major components
of the blade server. As you run the diagnostic programs, text messages and error
codes are displayed on the screen and are saved in the test log. A diagnostic text
message or error code indicates that a problem has been detected; to determine
what action you should take as a result of a message or error code, see the table in
“Diagnostic error codes” on page 43.
Running the diagnostic programs
To run the diagnostic programs, complete the following steps:
1. If the blade server is running, turn off the blade server.
2. Turn on the blade server.
3. When the prompt F2 for Diagnostics appears, press F2.
4. From the top of the screen, select either Extended or Basic .
5. From the menu, select the test that you want to run, and follow the instructions
on the screen.
For help with the diagnostic programs, press F1. You also can press F1 from within
a help screen to obtain online documentation from which you can select different
categories. To exit from the help information, press Esc.
To determine what action you should take as a result of a diagnostic text message
or error code, see the table in “Diagnostic error codes” on page 43.
If the diagnostic programs do not detect any hardware errors but the problem
remains during normal server operations, a software error might be the cause. If
you suspect a software problem, see the information that comes with your software.
Chapter 2. Diagnostics 41
A single problem might cause more than one error message. When this happens,
correct the cause of the first error message. The other error messages usually will
not occur the next time you run the diagnostic programs.
Exception: If there are multiple error codes or light path diagnostics LEDs that
indicate a microprocessor error, the error might be in a microprocessor or in a
microprocessor socket. See “Microprocessor problems” on page 30 for information
about diagnosing microprocessor problems.
If the blade server stops during testing and you cannot continue, restart the blade
server and try running the diagnostic programs again. If the problem remains,
replace the component that was being tested when the blade server stopped.
The diagnostic programs assume that a keyboard and mouse are attached to the
BladeCenter unit and that the blade server controls them. If you run the diagnostic
programs with either no mouse or a mouse attached to the BladeCenter unit that is
not controlled by the blade server, you cannot use the Next Cat and Prev Cat
buttons to select categories. All other mouse-selectable functions are available
through function keys.
To view server configuration information (such as system configuration, memory
contents, interrupt request (IRQ) use, direct memory access (DMA) use, device
drivers, and so on), select Hardware Info from the top of the screen.
Diagnostic text messages
Diagnostic text messages are displayed while the tests are running. A diagnostic
text message contains one of the following results:
Passed: The test was completed without any errors.
Failed: The test detected an error.
User Aborted: Yo u stopped the test before it was completed.
Not Applicable: Yo u attempted to test a device that is not present in the blade
server.
Aborted: The test could not proceed because of the blade server configuration.
Warning: The test could not be run. There was no failure of the hardware that was
being tested, but there might be a hardware failure elsewhere, or another problem
prevented the test from running; for example, there might be a configuration
problem, or the hardware might be missing or is not being recognized.
The result is followed by an error code or other additional information about the
error.
42 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Viewing the test log
To view the test log when the tests are completed, select Utility from the top of the
screen and then select View Test Log. The test-log data is maintained only while
you are running the diagnostic programs. When you exit from the diagnostic
programs, the test log is cleared.
To save the test log to a file on a diskette or to the hard disk, select Save Log on
the diagnostic programs screen and specify a location and name for the saved log
file.
Note: To save the test log to a diskette, you must use a diskette that you have
formatted yourself; this function does not work with preformatted diskettes. If the
diskette has sufficient space for the test log, the diskette can contain other data.
Diagnostic error codes
The following table describes the error codes that the diagnostic programs might
generate and suggested actions to correct the detected problems.
If the diagnostic programs generate error codes that are not listed in the table,
make sure that the latest level of the BIOS code is installed.
In the error codes, x can be any numeral or letter. However, if the three-digit
number in the central position of the code is 000, 195, or 197, do not replace a
CRU or FRU. These numbers appearing in the central position of the code have the
following meanings:
000 The blade server passed the test. Do not replace a CRU or FRU.
195 The Esc key was pressed to end the test. Do not replace a CRU or FRU.
197 This is a warning error, but it does not indicate a hardware failure; do not
replace a CRU or FRU. Take the action that is indicated in the Action
column, but do not replace a CRU or a FRU . See the description for
Warning in the section “Diagnostic text messages” on page 42 for more
information.
Chapter 2. Diagnostics 43
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
001-250-000 Failed processor board ECC.
1. Reseat the following components:
a. DIMMs
b. (Trained service technician only)
Microprocessor 1
c. (Trained service technician only)
Microprocessor 2
Replace the following components one at a time,
2.
in the order shown, restarting the blade server
each time:
a. DIMMs
b. (Trained service technician only)
Microprocessor
c. (Trained service technician only)
Microprocessor 2
d. (Trained service technician only) System
board assembly
001-292-000 Core system: failed/CMOS checksum failed. Load the BIOS default settings by using the
Configuration/Setup Utility program and run the test
again (see the Installation and User’s Guide for your
blade server).
001-xxx-000 Failed core tests. (Trained service technician only) Replace the system
board assembly.
001-xxx-001 Failed core tests. (Trained service technician only) Replace the system
board assembly.
005-xxx-000 Failed video test. (Trained service technician only) Replace the system
board assembly.
030-xxx-000 Failed internal SAS interface test. (Trained service technician only) Replace the system
board assembly.
035-xxx-099 No adapters were found. Reseat the adapter (if installed).
075-xxx-000 Failed power supply test. Replace the power supply (see the Hardware
Maintenance Manual and Troubleshooting Guide or
Problem Determination and Service Guide for your
BladeCenter unit).
44 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
089-xxx-00n Failed microprocessor test.
1. (Trained service technician only) Reseat
microprocessor 1 if n = 0 or 6.
2. (Trained service technician only) Reseat
microprocessor 2 if n = 1 or 7.
3. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. (Trained service technician only)
Microprocessor 1 if n = 0 or 6
b. (Trained service technician only)
Microprocessor 2 if n = 1 or 7
c. (Trained service technician only) System
board assembly
165-060-000 Service Processor: ASM may be busy
1. Rerun the diagnostics test again.
2. Correct other error conditions that might be
keeping the ASM busy. See the error log and
diagnostic panel.
3. Turn off the blade server and reseat it in the
BladeCenter unit.
4. (Trained service technician only) Replace the
system board assembly.
165-198-000 Service Processor: Aborted
1. Rerun the diagnostic test.
2. Correct other error conditions that might be
keeping the ASM busy. See the error log and
diagnostic panel.
3. Turn off the blade server and reseat it in the
BladeCenter unit.
4. (Trained service technician only) Replace the
system board assembly.
165-201-000 Service Processor: Failed
1. Power down blade server and reseat it in the
chassis.
2. (Trained service technician only) Replace the
system board assembly.
165-330-000 Service Processor: Failed Update the ROM diagnostic to the latest level and
run the diagnostics again.
165-342-000 Service Processor: Failed
1. Make sure that latest firmware levels for ASM and
BIOS are installed.
2. Turn off the blade server and reseat it in the
BladeCenter unit.
3. (Trained service technician only) Replace the
system board assembly.
Chapter 2. Diagnostics 45
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
166-406-001 BMC indicates failure in I2C bus test.
1. Remove the blade server from the BladeCenter
unit, wait 30 seconds, reseat it in the BladeCenter
unit, and run the test again.
2. Update the BMC firmware (see “Firmware
updates” on page 79).
3. (Trained service technician only) Replace the
system board assembly.
166-407-001 System Management: Failed. BMC indicates
failure in I2C bus test.
1. Remove the blade server from the BladeCenter
unit, wait 30 seconds, reseat it in the BladeCenter
unit, and run the test again.
2. Update the BMC firmware (see “Firmware
updates” on page 79).
3. (Trained service technician only) Replace the
system board assembly.
166-nnn-001 System Management: Failed. BMC indicates
failure in self test.
Note: nnn = 300 to 320.
1. Remove the blade server from the BladeCenter
unit, wait 30 seconds, reseat it in the BladeCenter
unit, and run the test again.
2. Update the BMC firmware (see “Firmware
updates” on page 79).
3. (Trained service technician only) Replace the
system board assembly.
166-nnn-001 System Management: Failed. BMC indicates
failure in self test.
Note: nnn = 400 to 420, excluding 406 and
407.
1. Remove the blade server from the BladeCenter
unit, wait 30 seconds, reseat it in the BladeCenter
unit, and run the test again.
2. Update the BMC firmware (see “Firmware
updates” on page 79).
3. (Trained service technician only) Replace the
system board assembly.
180-xxx-000 Diagnostics LED failure. Run the LED test in the diagnostics program.
180-xxx-001 Failed front LED panel test.
1. Reseat the control-panel connector.
2. Replace the bezel assembly.
3. (Trained service technician only) Replace the
system board assembly.
180-xxx-002 Failed diagnostics LED panel test. (Trained service technician only) Replace the system
board assembly.
180-xxx-003 Failed system board LED test. (Trained service technician only) Replace the system
board assembly.
46 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
201-xxx-0nn Failed memory test.
Note: nn = DIMM slot number, 01 to 04
1. Reseat the DIMM in the indicated slot.
2. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. DIMM in indicated slot
b. (Trained service technician only) System
board assembly
201-xxx-n99 Multiple DIMM failure, see error text.
Note: n = number of failing pair; see
“Installing a memory module” on page 64.
202-xxx-00n Failed system cache test.
Note: n = microprocessor 1 or
microprocessor 2.
1. See the error text for failing DIMMs.
2. (Trained service technician only) Replace the
system board assembly.
1. Reseat microprocessor n .
2. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. (Trained service technician only)
Microprocessor n
b. (Trained service technician only) System
board assembly
217-198--xxx Could not establish drive parameters.
1. Reseat hard disk drive x.
2. Update the BIOS code.
3. Replace the following components one at a time,
in the order shown, restarting the blade server
each time:
a. Hard disk drive x
b. (Trained service technician only) System
board assembly
217-xxx-000 Failed hard disk drive test.
Note: If RAID is configured, the SAS
Attached Disk number refers to the RAID
1. Reseat hard disk drive 1.
2. Replace hard disk drive 1.
logical drive.
217-xxx-001 Failed hard disk drive test.
Note: If RAID is configured, the SAS
Attached Disk number refers to the RAID
1. Reseat hard disk drive 2.
2. Replace hard disk drive 2.
logical drive.
405-xxx-000 Failed Ethernet test on controller on the
system board.
1. Make sure that Ethernet is not disabled in the
Configuration/Setup Utility program.
2. (Trained service technician only) Replace the
system board assembly.
Chapter 2. Diagnostics 47
Recovering from a BIOS update failure
The blade server has an advanced recovery feature that will automatically switch to
a backup BIOS page if the BIOS code in the blade server has become damaged,
such as from a power failure during an update.
The flash memory of the blade server consists of a primary page and a backup
page. If the BIOS code in the primary page is damaged, the baseboard
management controller will detect the error and automatically switch to the backup
page to start the blade server. If this happens, a POST message Booted from
backup POST/BIOS image is displayed. The backup page version may not be the
same version as the primary image.
You can then recover or restore the original primary page BIOS by using a BIOS
flash diskette.
To recover the BIOS code and restore the blade server operation to the primary
page, complete the following steps:
1. Download the latest version of the BIOS code from http://www.ibm.com/support/.
2. Update the BIOS code, following the instructions that come with the update file
that you downloaded. This will automatically restore and update the primary
page.
3. Restart the blade server.
If that procedure fails, the blade server might not restart correctly or might not
display video. To manually restore the BIOS code, complete the following steps:
1. Read the safety information that begins on page vii and “Handling
static-sensitive devices” on page 56.
2. Turn off the blade server.
3. Remove the blade server from the BladeCenter unit (see “Removing the blade
server from a BladeCenter unit” on page 57).
4. Remove the cover (see “Removing the blade server cover” on page 59).
5. Locate switch block SW2 on the system board (see “System-board switches”
on page 7).
6. Move the BIOS backup page switch (SW2-1) to the ON position to enable the
backup page.
7. Replace the cover and reinstall the blade server in the BladeCenter unit,
making sure that the media tray is selected by the relevant blade server.
8. Insert the BIOS flash diskette into the diskette drive.
9. Restart the blade server. The system begins the power-on self-test (POST).
10. Select 1 - Update POST/BIOS from the menu that contains various flash
(update) options.
Attention: Do not type Y when you are prompted to back up the ROM
location; doing so causes the damaged BIOS to be copied into the secondary
page.
11. When you are prompted whether you want to move the current POST/BIOS
image to the backup ROM location, type N.
12. When you are prompted whether you want to save the current code to a
diskette, type N.
13. Select Update the BIOS .
Attention: Do not restart the blade server at this time.
48 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
14. When the update is complete, remove the flash diskette from the diskette
drive.
15. Turn off the blade server, remove it from the BladeCenter unit, and remove the
cover of the blade server.
16. Move switch SW2-1 to OFF to return to the normal startup mode.
17. Replace the cover and reinstall the blade server in the BladeCenter unit; then
restart the blade server.
Service processor (BMC) error codes
The baseboard management controller (BMC) log contains up to 512 of the most
recent service processor errors in IPMI format. These messages are a combination
of plain text and error code numbers. You can view the BMC log from the
Configuration/Setup Utility menu by selecting Advanced Setup > Baseboard
Management Controller (BMC) Settings > BMC System Event Log .
You can view additional information and error codes in plain text by viewing the
management-module event log in your BladeCenter unit.
Solving SAS hard disk drive problems
For any SAS error message, one or more of the following devices might be causing
the problem:
v A failing SAS device (adapter, drive, or controller)
v An improper SAS configuration
For any SAS error message, make sure that the SAS devices are configured
correctly.
Solving undetermined problems
Note: When you are diagnosing a problem in the blade server, you must determine
whether the problem is in the blade server or in the BladeCenter unit.
v If all of the blade servers have the same symptom, it is probably a BladeCenter
unit problem; for more information, see the Hardware Maintenance Manual and
Troubleshooting Guide or Problem Determination and Service Guide for your
BladeCenter unit.
v If the BladeCenter unit contains more than one blade server and only one of the
blade servers has the problem, troubleshoot the blade server that has the
problem.
the diagnostic tests did not diagnose the failure or if the blade server is
If
inoperative, use the information in this section.
If you suspect that a software problem is causing failures (continuous or
intermittent), see “Software problems” on page 37.
Damaged data in CMOS memory or damaged BIOS code can cause undetermined
problems. To reset the CMOS data, remove and replace the battery to override the
power-on password and clear the CMOS memory; see “Removing the battery” on
page 70. If you suspect that the BIOS code is damaged, see “Recovering from a
BIOS update failure” on page 48.
Chapter 2. Diagnostics 49
Check the LEDs on all the power supplies of the BladeCenter unit in which the
blade server is installed. If the LEDs indicate that the power supplies are working
correctly and reseating the blade server does not correct the problem, complete the
following steps:
1. Make sure that the control panel connector is correctly seated on the system
board (see “System-board connectors” on page 7 for the location of the
connector).
2. If no LEDs on the control panel are working, replace the bezel assembly; then,
try to turn on the blade server from the management-module (see the
BladeCenter documentation for more information).
3. Turn off the blade server.
4. Remove the blade server from the BladeCenter unit and remove the cover.
5. Make sure that the blade server is cabled correctly.
6. Remove or disconnect the following devices, one at a time, until you find the
failure. Reinstall, turn on, and reconfigure the blade server each time.
v I/O-expansion card.
v Drives.
v Memory modules. The minimum configuration requirement is 1 GB (two 512
MB DIMMs).
The following minimum configuration is required for the blade server to start:
v System board
v One microprocessor
v Two 512 MB DIMMs
v A functioning BladeCenter unit
Install and turn on the blade server. If the problem remains, suspect the
7.
following components in the following order:
a. DIMM
b. System board
c. Microprocessor
the problem is solved when you remove an I/O-expansion card from the blade
If
server but the problem recurs when you reinstall the same adapter, suspect the
I/O-expansion card; if the problem recurs when you replace the adapter with a
different one, suspect the system board.
If you suspect a networking problem and the blade server passes all the system
tests, suspect a network cabling problem that is external to the system.
Calling IBM for service
See Appendix A, “Getting help and technical assistance,” on page 83 for information
about calling IBM for service.
When you call for service, have as much of the following information available as
possible:
v Machine type and model
v Microprocessor and hard disk drive upgrades
v Failure symptoms
– Does the blade server fail the diagnostic programs? If so, what are the error
codes?
– What occurs? When? Where?
– Is the failure repeatable?
– Has the current server configuration ever worked?
– What changes, if any, were made before it failed?
50 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
– Is this the original reported failure, or has this failure been reported before?
Diagnostic program type and version level
v
v Hardware configuration (print screen of the system summary)
v BIOS code level
v Operating-system type and version level
can solve some problems by comparing the configuration and software setups
You
between working and nonworking servers. When you compare servers to each
other for diagnostic purposes, consider them identical only if all the following factors
are exactly the same in all the blade servers:
v Machine type and model
v BIOS level
v Adapters and attachments, in the same locations
v Address jumpers, terminators, and cabling
v Software versions and levels
v Diagnostic program type and version level
v Configuration option settings
v Operating-system control-file setup
Chapter 2. Diagnostics 51
52 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Chapter 3. Parts listing, Type 7981 and 1884
The following replaceable components are available for the HS20 Type 7981 and
1884 blade servers. To check for an updated parts listing on the Web, complete the
following steps:
1. Go to http://www.ibm.com/support.
2. Under Search technical support , type 7981 or 1884 and click Search .
3. Under Document type , select Parts information and click Go .
The illustrations in this document might differ slightly from your hardware.
Note:
8
1
2
3
7
6
4
5
Replaceable components are of three types:
v Tier 1 customer replaceable unit (CRU): Replacement of Tier 1 CRUs is your
responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for
the installation.
v Tier 2 customer replaceable unit: You may install a Tier 2 CRU yourself or
request IBM to install it, at no additional charge, under the type of warranty
service that is designated for your server.
v Field replaceable unit (FRU): FRUs must be installed only by trained service
technicians.
information about the terms of the warranty and getting service and assistance,
For
see the Warranty and Support Information document.
Index System, Type 8843 and 1884.
1 Heat sink, microprocessor (all models) 40K5867
2 Microprocessor 1.6 GHz (model 2Fx) 39Y6598
2 Microprocessor 2.0 GHz (model 3Fx) 39Y6599
Memory, 512 MB PC3200 ECC DDR (non-Chipkill™, RoHS) (all
3
models)
CRU part
number
(Tier 1)
39M5817
CRU part
number
(Tier 2)
FRU part
number
© Copyright IBM Corp. 2006 53
Index System, Type 8843 and 1884.
CRU part
number
(Tier 1)
CRU part
number
(Tier 2)
/3 Memory, 512 MB PC3200 ECC DDR (non-Chipkill, non-RoHS) 13N1424
3 Memory, 512 MB PC3200 ECC DDR (Chipkill, RoHS) (option) 39M5820
3 Memory, 1 GB PC3200 ECC DDR (Chipkill, RoHS) (option) 39M5808
3 Memory, 2 GB PC3200 ECC DDR (Chipkill, RoHS) (option) 39M5811
3 Memory, 4 GB PC3200 ECC DDR (Chipkill, RoHS) (option) 30R5146
4 Front bezel with LEDs and switches (all models) 40K5866
5 Filler, microprocessor heat sink (all models) 40K5863
6 System board assembly (all models) 40K0060
7 Hard disk drive, 73 GB SAS (option) 26K5779
8 Cover and label (all models) 26K5967
Alcohol wipe (all models) 59P4739
Adapter, iSCSI DC (option) 32R1926
Battery, 3.0 volt (all models) 33F8354
Fibre channel expansion card (option) 26R0836
Fibre channel expansion card (option) 59P6624
Fibre channel expansion card, SFF (option) 26K4859
Fibre channel expansion card, 4 GB - SFF (option) 26R0893
Fibre channel expansion card, 4 GB - SFF - Intel (option) 32R1842
Infiniband daughter card assembly (option) 26K6459
Gigabit Ethernet expansion card (RoHS) (option) 39M4630
Gigabit Ethernet expansion card, SFF (option) 13N2306
Kit, miscellaneous parts (all models)
40K5868
v Screw, M3.5 x 6 Phillips pan head, system board
v Screw, M3 x 3 Phillips (large head)
v Screw, 4-40 x 4.76 mm flat head
v Guide, blade top edge
v Socket, alignment
v Pin, alignment
v Flyer, ID stickers with instructions
v Assembly, system board lightbox with transparency
v Cover, connector plug
v Tray, daughter card mount, new form factor
Label, FRU list (all models) 40K5870
Label, system service (all models) 40K5869
Myrinet cluster expansion card (option) 32R1845
Tray, I/O-expansion card (standard form factor) (all models) 40K5865
Tray, SAS hard disk drive with left hand side lever release
(Channel 0) (all models)
Tray, SAS hard disk drive with right hand side lever release
(Channel 1) (all models)
31R2239
40K5859
FRU part
number
54 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Chapter 4. Removing and replacing blade server components
Replaceable components are of three types:
v Tier 1 customer replaceable unit (CRU): Replacement of Tier 1 CRUs is your
responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for
the installation.
v Tier 2 customer replaceable unit: You may install a Tier 2 CRU yourself or
request IBM to install it, at no additional charge, under the type of warranty that
is designated for your server.
v Field replaceable unit (FRU): FRUs must be installed only by trained service
technicians.
Chapter 3, “Parts listing, Type 7981 and 1884,” on page 53 to determine
See
whether a component is a Tier 1 CRU, Tier 2 CRU, or FRU.
For information about the terms of the warranty and getting service and assistance,
see the Warranty and Support Information document.
Installation guidelines
Before you install options, read the following information:
v Read the safety information that begins on page vii and the guidelines in
“Handling static-sensitive devices” on page 56. This information will help you
work safely.
v When you install your new blade server, take the opportunity to download and
apply the most recent firmware updates. This step will help to ensure that any
known issues are addressed and that your blade server is ready to function at
maximum levels of performance. To download firmware updates for your blade
server, go to http://www.ibm.com/support and click Downloads and drivers .
v Observe good housekeeping in the area where you are working. Place removed
covers and other parts in a safe place.
v Back up all important data before you make changes to disk drives.
v Before you remove a hot-swap blade server from the BladeCenter unit, you must
shut down the operating system and turn off the blade server. Yo u do not have to
shut down the BladeCenter unit itself.
v Blue on a component indicates touch points, where you can grip the component
to remove it from or install it in the blade server, open or close a latch, and so
on.
v Orange on a component or an orange label on or near a component indicates
that the component can be hot-swapped, which means that if the server and
operating system support hot-swap capability, you can remove or install the
component while the server is running. (Orange can also indicate touch points on
hot-swap components.) See the instructions for removing or installing a specific
hot-swap component for any additional procedures that you might have to
perform before you remove or install the component.
v For a list of supported options for the blade server, see http://www.ibm.com/
servers/eserver/serverproven/compat/us/.
© Copyright IBM Corp. 2006 55
System reliability guidelines
To help ensure proper cooling and system reliability, observe the following
guidelines:
v Make sure that processor socket 2 always contains either a microprocessor heat
sink-filler or a microprocessor and heat sink. If the blade server has only one
microprocessor, it must be installed in microprocessor socket 1.
v To maintain proper system cooling, do not operate the BladeCenter unit without a
blade server, expansion unit, or blade filler installed in each blade bay. See the
documentation for your BladeCenter unit for additional information.
Handling static-sensitive devices
Attention: Static electricity can damage the blade server and other electronic
devices. To avoid damage, keep static-sensitive devices in their static-protective
packages until you are ready to install them.
To reduce the possibility of damage from electrostatic discharge, observe the
following precautions:
v When working on the BladeCenter unit, use an electrostatic discharge (ESD)
wrist strap, especially when you will be handling modules, options, and blade
servers. To work properly, the wrist strap must have a good contact at both ends
(touching your skin at one end and firmly connected to the front or back of the
BladeCenter unit).
v Limit your movement. Movement can cause static electricity to build up around
you.
v Handle the device carefully, holding it by its edges or its frame.
v Do not touch solder joints, pins, or exposed circuitry.
v Do not leave the device where others can handle and damage it.
v While the device is still in the static-protective package, touch it to an unpainted
metal part of the BladeCenter unit or any unpainted metal surface on any other
grounded rack component in the rack you are installing the device in for at least
2 seconds. This drains static electricity from the package and from your body.
v Remove the device from the package and install it directly into the blade server
without setting down the device. If it is necessary to set down the device, put it
back into the static-protective package. Do not place the device on the blade
server cover or on a metal surface.
v Take additional care when handling devices during cold weather. Heating reduces
indoor humidity and increases static electricity.
Returning a device or component
If you are instructed to return a device or component, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to you.
56 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Removing the blade server from a BladeCenter unit
Attention:
v To maintain proper system cooling, do not operate the BladeCenter unit without a
blade server, expansion unit, or blade filler installed in each blade bay.
v Note the bay number. Reinstalling a blade server into a different bay than the
one from which it was removed could have unintended consequences. Some
configuration information and update options are established according to bay
number; if you reinstall the blade server into a different bay, you might have to
reconfigure the blade server.
remove the blade server from a BladeCenter unit, complete the following steps.
To
Release handles
(open)
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 55.
2. If the blade server is running, shut down the operating system; then, press the
power-control button (behind the blade server control panel door) to turn off the
blade server (see “Turning off the blade server” on page 6 for more information).
Attention: Wait at least 30 seconds, until the hard disk drives stop spinning,
before proceeding to the next step.
3. Open the two release handles as shown in the illustration. The blade server
moves out of the bay approximately 0.6 cm (0.25 inch).
4. Pull the blade server out of the bay. Spring-loaded doors farther back in the bay
move into place to cover the bay temporarily.
5. Place either a blade filler or another blade server in the bay within 1 minute.
The recessed spring-loaded doors move out of the way as you insert the blade
server or blade filler.
Chapter 4. Removing and replacing blade server components 57
Installing the blade server in a BladeCenter unit
To install a blade server in a BladeCenter unit, complete the following steps.
Release handles
(open)
Statement 21:
CAUTION:
Hazardous energy is present when the blade server is connected to the power
source. Always replace the blade cover before installing the blade server.
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 55.
2. Make sure that the release handles on the blade server are in the open position
(perpendicular to the blade server).
3. If you installed a blade filler or another blade server in the bay from which you
removed the blade server, remove it from the bay.
Attention: Yo u must install the blade server in the same blade bay from which
you removed it. Some blade server configuration information and update options
are established according to bay number. Reinstalling a blade server into a
different blade bay from the one from which it was removed could have
unintended consequences, and you might have to reconfigure the blade server.
4. Slide the blade server into the blade bay from which you removed it until it
stops. The spring-loaded doors farther back in the bay that cover the bay
opening move out of the way as you insert the blade server.
5. Push the release handles on the front of the blade server closed.
6. Turn on the blade server (see “Turning on the blade server” on page 6 for
instructions).
7. Make sure that the power-on LED on the blade server control panel is lit
continuously, indicating that the blade server is receiving power and is turned
on.
8. (Optional) Write identifying information on one of the labels that come with the
blade servers and place the label on the BladeCenter unit bezel.
Important: Do not place the label on the blade server or in any way block the
ventilation holes on the blade server. See the documentation for your
BladeCenter unit for information about the label placement.
58 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
If you have changed the configuration of the blade server or if you are installing a
different blade server from the one that you removed, you must configure the blade
server through the Configuration/Setup Utility, and you might have to install the
blade server operating system. Detailed information about these tasks is available
in the Installation and User’s Guide .
Removing and replacing Tier 1 CRUs
Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at
your request, you will be charged for the installation.
The illustrations in this document might differ slightly from your hardware.
Removing the blade server cover
To remove the blade server cover, complete the following steps.
Cover
Pins
Blade-Cover
Release
Blade-Cover
Release
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 55.
2. If the blade server is installed in a BladeCenter unit, remove it (see “Removing
the blade server from a BladeCenter unit” on page 57 for instructions).
3. Carefully lay the blade server down on a flat, static-protective surface, with the
cover side up.
4. Press the blade-cover release on each side of the blade server and lift the
cover open.
5. Lift the cover from the blade server and store it for future use.
Chapter 4. Removing and replacing blade server components 59
Installing the blade server cover
To install the blade server cover, complete the following steps.
Statement 21:
CAUTION:
Hazardous energy is present when the blade server is connected to the power
source. Always replace the blade cover before installing the blade server.
Cover
Pins
Blade-Cover
Release
Blade-Cover
Release
Important: The blade server cannot be inserted into the BladeCenter unit until the
cover is installed and closed. Do not attempt to override this protection.
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 55.
2. Lower the cover so that the slots at the rear slide down onto the pins at the rear
of the blade server. Before closing the cover, check that all components are
installed and seated correctly and that you have not left loose tools or parts
inside the blade server.
3. Pivot the cover to the closed position until it clicks into place.
4. Install the blade server into the BladeCenter unit. See “Installing the blade
server in a BladeCenter unit” on page 58 for instructions.
60 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Removing the bezel assembly
To remove the bezel assembly, complete the following steps.
Bezel-assembly
release
Bezel-assembly
release
Control panel
connector
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 55.
2. Open the blade server cover (see “Removing the blade server cover” on page
59 for instructions).
3. Press the bezel-assembly release on each side of the blade server and pull the
bezel assembly away from the blade server approximately 1.2 cm (0.5 inch).
4. Disconnect the control panel cable from the control panel connector.
5. Pull the bezel assembly away from the blade server.
6. Store the bezel assembly in a safe place.
Installing the bezel assembly
To install the bezel assembly, complete the following steps.
Control-panel
cable
Bezel-assembly
release
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 55.
2. Connect the control panel cable to the control panel connector on the system
board.
3. Carefully slide the bezel assembly onto the blade server until it clicks into place.
Bezel-assembly
release
Control panel
connector
Control-panel
cable
Chapter 4. Removing and replacing blade server components 61
Removing a SAS hard disk drive
To remove a SAS hard disk drive, complete the following steps:
Hard
drive
release
lever
SAS ID 1
SAS ID 0
Hard drive
release
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 55.
2. Shut down the operating system, turn off the blade server, and remove the
blade server from the BladeCenter unit. See “Removing the blade server from a
BladeCenter unit” on page 57 for instructions.
3. Remove the blade server cover (see “Removing the blade server cover” on
page 59 for instructions).
4. Locate the hard disk drive that is to be removed (SAS ID 0 or SAS ID 1).
5. While pulling the blue release lever at the front of the hard disk drive tray, slide
the drive out of the SAS connector and disengage it from the drive tray.
6. The SAS drive tray can be removed by removing the four screws that secure it
to the system board and then lifting the tray from the system board.
7. If you are instructed to return the SAS drive, follow all packaging instructions,
and use any packaging materials for shipping that are supplied to you.
lever
62 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Installing a SAS hard disk drive
To install a SAS hard disk drive, complete the following steps.
Hard
drive
release
lever
SAS ID 1
SAS ID 0
Hard drive
release
1. Identify the location (SAS ID 0 or SAS ID 1) in which the hard disk drive will be
installed.
lever
Note: If you will be installing the SAS hard disk drive in to the SAS ID 1
location, you may need to remove a standard-form-factor expansion card tray
(see “Removing a standard-form-factor expansion card” on page 67) and then
install the SAS hard disk drive tray with the four screws you removed from the
expansion card tray.
Attention: Do not press on the top of the drive. Pressing the top might
damage the drive.
2. Place the drive into the hard disk drive tray and push it toward the rear of the
drive, into the connector until the drive moves past the lever at the front of the
tray.
3. Install the blade server cover (see “Installing the blade server cover” on page 60
for instructions).
4. Install the blade server into the BladeCenter unit. See “Installing the blade
server in a BladeCenter unit” on page 58 for instructions.
Removing a memory module
The following illustration shows the locations of the DIMM sockets on the system
board.
DIMM 2
DIMM 1
DIMM slot 4
DIMM slot 3
Chapter 4. Removing and replacing blade server components 63
DIMM
Retaining clip
Complete the following steps to remove a DIMM:
1. Read the safety information that begins “Safety” on page vii and “Installation
guidelines” on page 55
2. If the blade server is installed in a BladeCenter unit, remove it (see “Removing
the blade server from a BladeCenter unit” on page 57).
3. Remove the blade server cover (see “Removing the blade server cover” on
page 59).
4. Locate the DIMMs on the system board. Determine which DIMM you want to
remove from the blade server.
5. Move the DIMM retaining clips on the side of the DIMM socket to the open
position by pressing the retaining clips away from the center of the DIMM
socket.
Attention: To avoid breaking the retaining clips or damaging the DIMM
connectors, handle the clips gently.
6. Using your fingers, pull the DIMM out of the DIMM socket.
7. If you are instructed to return the DIMM, follow all packaging instructions, and
use any packaging materials for shipping that are supplied to you.
Installing a memory module
Note: If you are replacing a faulty DIMM, make sure that the replacement DIMM is
the correct type of memory. If you are adding DIMMs, install the DIMMs in the
sequence shown in the following table. See the Installation and User ’s Guide for
additional information about the type of memory that is compatible with the server.
Pair DIMM connectors
First 1 (J113) and 2 (J111)
Second 3 (J112) and 4 (J110)
64 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
The following illustration shows the locations of the DIMM sockets on the system
board.
DIMM 2
DIMM 1
DIMM slot 4
DIMM slot 3
Complete the following steps to install a DIMM:
DIMM
Retaining clip
1. Read the documentation that comes with the DIMM.
2. Locate the DIMM connectors on the system board. Determine the connectors
into which you will install the DIMMs.
3. Touch the static-protective package that contains the DIMM to any unpainted
metal surface on the BladeCenter unit or any unpainted surface on any other
grounded rack component for at least 2 seconds. Then, remove the DIMM from
the package.
4. Make sure that both of the connector retaining clips are in the fully open
position.
5. Turn the DIMM so that the DIMM keys align correctly with the connector on the
system board.
Attention: To avoid breaking the DIMM retaining clips or damaging the DIMM
connectors, open and close the clips gently.
6. Insert the DIMM by pressing the DIMM along the guides into the connector.
Make sure that the retaining clips snap into the closed positions.
Attention: If there is a gap between the DIMM and the retaining clips, the
DIMM has not been correctly installed. In this case, open the retaining clips and
remove the DIMM; then, reinsert the DIMM.
7. Install the blade server cover (see “Installing the blade server cover” on page
60).
8. Install the blade server into the BladeCenter unit (see “Installing the blade
server in a BladeCenter unit” on page 58).
Chapter 4. Removing and replacing blade server components 65
Removing a small-form-factor expansion card
To remove a small-form-factor expansion card, complete the following steps:
INSTALLING CARD
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 55.
2. If the blade server is installed in a BladeCenter unit, remove it (see “Removing
the blade server from a BladeCenter unit” on page 57).
3. Remove the blade server cover (see “Removing the blade server cover” on
page 59 for instructions).
4. Gently pivot the wide end of the card out of the expansion card connectors;
then, slide the notched end of the card out of the raised hook on the tray and lift
the card out of the blade server.
Expansion
card
PRESS HEREWHEN
Installing a small-form-factor expansion card
To install a small-form-factor expansion card, complete the following steps:
INSTALLING CARD
Expansion
card
PRESS HEREWHEN
66 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
1. Touch the static-protective package that contains the expansion card to any
unpainted metal surface on the BladeCenter unit or any unpainted metal surface
on any other grounded rack component; then, remove the expansion card from
the package.
2. Orient the expansion card over the system board.
3. Slide the notch in the narrow end of the card into the raised hook on the tray;
then, gently pivot the card into the expansion card connector.
Note: For device-driver and configuration information to complete the
installation of the expansion card, see the documentation that comes with the
expansion card.
4. Install the blade server cover (see “Installing the blade server cover” on page
60).
5. Install the blade server into the BladeCenter unit (see “Installing the blade
server in a BladeCenter unit” on page 58).
6. If you are instructed to return the expansion card, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to
you.
Removing a standard-form-factor expansion card
To remove a standard-form-factor expansion card, complete the following steps.
Expansion
card
PRESS HEREWHEN
INSTALLING CARD
Expansion
card tray
Hard disk
drive tray
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 55.
2. If the blade server is installed in a BladeCenter unit, remove it (see “Removing
the blade server from a BladeCenter unit” on page 57).
3. Remove the blade server cover (see “Removing the blade server cover” on
page 59 for instructions).
4. Gently pivot the wide end of the card out of the expansion card connectors;
then, slide the notched end of the card out of the raised hook on the tray and lift
the card out of the blade server.
Chapter 4. Removing and replacing blade server components 67
5. If you will be installing a SAS hard disk drive into the SAS ID 1 location, you will
need to remove the expansion card tray. Remove the screws that secure the
expansion card tray to the system board; then, lift the tray from the system
board.
6. Install the blade server cover (see “Installing the blade server cover” on page
60).
7. Install the blade server into the BladeCenter unit (see “Installing the blade
server in a BladeCenter unit” on page 58).
8. If you are instructed to return the expansion card, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to
you.
68 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Installing a standard-form-factor expansion card
To install a replacement standard-form-factor expansion card, complete the
following steps.
Expansion
card
Expansion
card tray
Hard disk
drive tray
PRESS HEREWHEN
INSTALLING CARD
1. If the expansion card tray was removed, secure the tray to the system board
with the screws from the option kit or from the removed drive tray. If a SAS hard
disk or SAS hard disk drive tray is installed in SAS hard disk drive bay 1,
remove the SAS hard disk drive and drive tray (see “Removing a SAS hard disk
drive” on page 62).
2. Touch the static-protective package that contains the expansion card to any
unpainted metal surface on the BladeCenter unit or any unpainted metal surface
on any other grounded rack component; then, remove the expansion card from
the package.
3. Orient the expansion card and slide the narrow end of the card into the raised
hook on the tray; then, gently pivot the wide end of the card into the expansion
card connectors.
4. Install the blade server cover (see “Installing the blade server cover” on page
60).
5. Install the blade server into the BladeCenter unit (see “Installing the blade
server in a BladeCenter unit” on page 58).
Chapter 4. Removing and replacing blade server components 69
Removing the battery
To remove the battery, complete the following steps.
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 55
2. If the blade server is installed in a BladeCenter unit, remove it (see “Removing
the blade server from a BladeCenter unit” on page 57 for instructions).
3. Remove the blade server cover (see “Removing the blade server cover” on
page 59 for instructions).
4. Locate the battery on the system board.
5. Use one fingernail to press the top of the battery clip away from the battery. The
battery pops up when it is released.
6. Use your thumb and index finger to lift the battery from the socket.
Installing the battery
The following notes describe information that you must consider when you are
replacing the battery in the blade server.
v When you are replacing the battery, you must replace it with a lithium battery of
the same type from the same manufacturer.
v To order replacement batteries, call 1-800-426-7378 within the United States, and
1-800-465-7999 or 1-800-465-6666 within Canada. Outside the U.S. and
Canada, call your IBM marketing representative or authorized reseller.
v After you replace the battery, you must reconfigure the server and reset the
system date and time.
v To avoid possible danger, read and follow the following safety statement.
Battery
70 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Statement 2:
CAUTION:
When replacing the lithium battery, use only IBM Part Number 33F8354 or an
equivalent type battery recommended by the manufacturer. If your system has
a module containing a lithium battery, replace it only with the same module
type made by the same manufacturer. The battery contains lithium and can
explode if not properly used, handled, or disposed of.
Do not:
v Throw or immerse into water
v Heat to more than 100°C (212°F)
v Repair or disassemble
Dispose
To install the battery, complete the following steps.
of the battery as required by local ordinances or regulations.
1. Follow any special handling and installation instructions that come with the
battery.
2. Insert the battery:
a. Tilt the battery so that you can insert it into the socket on the side opposite
the battery clip.
b. Press the battery down into the socket until it clicks into place. Make sure
that the battery clip holds the battery securely.
Install the blade server cover (see “Installing the blade server cover” on page
3.
60).
4. Install the blade server into the BladeCenter unit (see “Installing the blade
server in a BladeCenter unit” on page 58).
5. Turn on the blade server and run the Configuration/Setup Utility program. Set
configuration parameters as needed (see “Using the Configuration/Setup Utility
program” on page 79 for information).
Chapter 4. Removing and replacing blade server components 71
Removing and replacing FRUs
FRUs must be installed only by trained service technicians.
The illustrations in this document might differ slightly from your hardware.
Removing a microprocessor and heat sink
Read the following important guidelines before removing a microprocessor that is
not faulty (for example, when you are replacing the system board assembly).
Attention: Do not use an alcohol wipe with the microprocessor.
If you are not replacing a defective heat sink or microprocessor, the thermal
material on the heat sink and microprocessor will remain effective if you complete
the following steps:
1. Carefully handle the heat sink and microprocessor when removing or installing
these components. Do not touch the thermal material or otherwise allow it to
become contaminated.
2. In a dual-microprocessor blade server, the microprocessor and the heat sink are
a matched set. First transfer the heat sink and microprocessor from one socket
to the new system board; then, transfer the other heat sink and microprocessor.
(This will ensure that the thermal material remains evenly distributed between
each heat sink and microprocessor.)
Notes:
v The heat-sink FRU is packaged with the thermal material applied to the
underside. This thermal material is not available as a separate FRU. The heat
sink must be replaced when new thermal material is required, such as when a
defective microprocessor is replaced or if the thermal material is contaminated or
has come in contact with another object other than its paired microprocessor.
v The microprocessor FRU for this system board includes a heat sink.
v A heat-sink FRU can be ordered separately if the thermal material becomes
contaminated.
To remove a microprocessor, complete the following steps:
1. Read the safety information that begins on page vii, the “Installation guidelines”
on page 55 and “Handling static-sensitive devices” on page 56.
2. Remove the blade server from the BladeCenter unit (see “Removing the blade
server from a BladeCenter unit” on page 57).
3. Remove the blade server cover (see “Removing the blade server cover” on
page 59 for instructions).
4. Remove the bezel assembly (see “Removing the bezel assembly” on page 61
for instructions).
5. Identify the microprocessor that is to be removed.
Note: If you are replacing a failed microprocessor, make sure that you have
selected the correct microprocessor for replacement (see “Light path
diagnostics” on page 38).
6. Remove the heat sink.
72 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Attention: Do not touch the thermal material on the bottom of the heat sink.
Touching the thermal material will contaminate it. If the thermal material on the
microprocessor or heat sink becomes contaminated, you must replace the heat
sink.
a. Press firmly on the captive screws and loosen them with a screwdriver.
Follow the pattern that is shown on the following label. Rotate each screw
two full turns each time. Repeat the sequence until the screws are loose.
REPEAT SEQUENCE
UNTIL TIGHT.
720°
720°
b. Use your fingers to gently pull the heat sink from the processor.
7. Use your fingers to release the microprocessor locking lever by moving it away
from the microprocessor socket and around the locking lever retainer, tab then,
rotate it upward to the fully open position (approximately 90° angle).
Microprocessorlocking lever
Locking lever
retainer tab
Attention: Make sure that the locking lever on the microprocessor socket is in
the fully open position before you remove the microprocessor from or insert the
microprocessor into the socket. Failure to do so might result in permanent
damage to the microprocessor, microprocessor socket, and system board.
Chapter 4. Removing and replacing blade server components 73
8. Use your fingers to pull the microprocessor out of the socket.
Microprocessorlocking lever
Installing a microprocessor and heat sink
To install a microprocessor and heat sink, complete the following steps
1. If there is a sticker on the microprocessor socket, use your fingers to remove it.
ATTENTION!
Microprocessor
Microprocessor
socket
90°
Lever must be fully open before
Lever must be fully open before
installing the microprocessor
installing the microprocessor
2. If the microprocessor locking lever is closed, release it by moving it away from
the microprocessor socket and around the locking lever retainer tab; then, rotate
it upward to the fully open position.
Microprocessorlocking lever
Locking lever
retainer tab
74 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Attention:
v Do not use any tools or sharp objects to lift the locking lever on the
microprocessor socket. Doing so might result in permanent damage to the
system board.
v Make sure that the locking lever on the microprocessor socket is in the fully
open position before you install the microprocessor into the microprocessor
socket. Failure to do so might result in permanent damage to the
microprocessor, microprocessor socket, and system board.
Microprocessor
Microprocessor-
socket
locking lever
v Avoid touching the components and gold pins on the microprocessor. Make
sure that the microprocessor is completely and correctly seated in the socket.
Incomplete insertion might cause damage to the system board or to the
microprocessor.
Touch the static-protective package that contains the replacement
3.
microprocessor to any unpainted metal surface on the blade server or any
unpainted metal surface on any other grounded rack component for at least 2
seconds; then, remove the microprocessor from the package.
Attention: Do not use excessive force when you are pressing the
microprocessor into the socket.
4. Center the microprocessor over the microprocessor socket. Align the triangle on
the corner of the microprocessor with the triangle on the corner of the socket
and make sure that the pin patterns of the microprocessor and microprocessor
socket match; then, use your fingers to carefully press the microprocessor into
the socket.
Microprocessor
Microprocessorlocking lever
Microprocessor
socket
Chapter 4. Removing and replacing blade server components 75
Attention: Make sure that the microprocessor is oriented and aligned correctly
in the socket before you try to close the lever.
5. Carefully close the lever to secure the microprocessor in the socket.
6. Install a heat sink on the microprocessor.
Note: The microprocessor FRU comes with a new heat sink, do not install a
used heat sink with a new microprocessor.
Heat Sink
Microprocessor 2
Microprocessor 1
and Heat Sink
Attention:
v Do not set down the heat sink after you remove the plastic cover.
v Do not touch the thermal material on the bottom of the heat sink. Touching
the thermal material will contaminate it. If the thermal material on the
microprocessor or heat sink becomes contaminated, you must replace the
heat sink.
a. Remove the plastic protective cover from the thermal material on the bottom
of the heat sink.
b. Make sure that the thermal material is still on the bottom of the heat sink;
then, align and place the heat sink on top of the microprocessor in the
retention bracket, thermal material side down. Press firmly on the heat sink.
c. Align the four screws on the heat sink with the holes on the heat-sink
retention module.
d. Press firmly on the captive screws and tighten them with a screwdriver.
Follow the pattern that is shown on the following label. Rotate each screw
two full rotations each time. Repeat the sequence until the screws are tight.
Do not overtighten the screws by using excessive force.
720°
7. Install the bezel assembly (see “Installing the bezel assembly” on page 61).
8. Install the blade server cover (see “Installing the blade server cover” on page
60).
76 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
REPEAT SEQUENCE
UNTIL TIGHT.
720°
9. Install the blade server into the BladeCenter unit (see “Installing the blade
server in a BladeCenter unit” on page 58).
Chapter 4. Removing and replacing blade server components 77
Removing the system board assembly
When replacing the system board, you will replace the system board and blade
base as one assembly. After replacement, you must either update the blade server
with the latest firmware or restore the pre-existing firmware that the customer
provides on a diskette or CD image.
Note: See “System-board layouts” on page 7 for more information on the locations
of the connectors, jumpers and LEDs on the system board.
To remove the system board assembly, complete the following steps:
1. Read the safety information that begins on page vii, the “Installation guidelines”
on page 55, and “Handling static-sensitive devices” on page 56.
2. Remove the blade server from the BladeCenter (see “Removing the blade
server from a BladeCenter unit” on page 57).
3. Remove the blade server cover (see “Removing the blade server cover” on
page 59).
4. Remove the blade server bezel assembly (see “Removing the bezel assembly”
on page 61).
5. Remove any of the installed components in the following list from the system
board assembly; then, place them on a non-conductive surface or install them
on the new system board assembly.
v I/O expansion card. See “Removing a small-form-factor expansion card” on
page 66 and “Removing a standard-form-factor expansion card” on page 67.
v Hard disk drives. See “Removing a SAS hard disk drive” on page 62.
v Microprocessors and heat sinks. See “Removing a microprocessor and heat
sink” on page 72.
v DIMMs. See “Removing a memory module” on page 63.
Installing the system board assembly
To install the system board assembly, complete the following steps:
1. Install any of the components in the following list that were removed from the
old system board assembly onto the new system board assembly.
v I/O expansion card. See “Installing a small-form-factor expansion card” on
page 66 and “Installing a standard-form-factor expansion card” on page 69.
v Hard disk drives. See “Installing a SAS hard disk drive” on page 63.
v Microprocessors and heat sinks. See “Installing a microprocessor and heat
sink” on page 74.
v DIMMs. See “Installing a memory module” on page 64.
2. Install the bezel assembly (see “Installing the bezel assembly” on page 61).
3. Install the blade server cover (see “Installing the blade server cover” on page
60).
4. Install the blade server into the BladeCenter unit. See “Installing the blade
server in a BladeCenter unit” on page 58.
78 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Chapter 5. Configuration information and instructions
This chapter provides information about updating the firmware and using the
configuration utilities.
Firmware updates
IBM periodically makes BIOS, service processor (BMC), and diagnostic firmware
updates available for the blade server. Go to http://www.ibm.com/support/ to
download the latest firmware for the blade server. Install any updates, using the
instructions that are included with the downloaded file.
Important: To avoid problems and to maintain proper system performance, always
ensure that the blade server BIOS, service processor, and diagnostic firmware
levels are consistent for all blade servers of the same type within the BladeCenter
unit.
Using the Configuration/Setup Utility program
The Configuration/Setup Utility program is part of the BIOS. You can use it to:
v Change interrupt request (IRQ) settings
v Change the startup drive sequence
v Configure serial-port assignments
v Enable USB keyboard and mouse support
v Resolve configuration conflicts
v Set the date and time
v Set passwords and security settings
start the Configuration/Setup Utility program, complete the following steps:
To
1. Turn on the server.
2. When the message Press F1 for Configuration/Setup appears, press F1. If
an administrator password has been set, you must type the administrator
password to access the full Configuration/Setup Utility menu.
3. Follow the instructions on the screen.
Configuring the Gigabit Ethernet controllers
Two Ethernet controllers are integrated on the blade server system board. Each
controller provides a 1-Gbps full-duplex interface for connecting to one of the
Ethernet-compatible switch modules in I/O-module bays 1 and 2, which enables
simultaneous transmission and reception of data on the Ethernet local area network
(LAN). Each Ethernet controller on the system board is routed to a different switch
module in I/O module bay 1 or bay 2. See “Blade server Ethernet controller
enumeration” on page 80 for information about how to determine the routing from
Ethernet controller to I/O-module bay for your blade server.
You do not have to set any jumpers or configure the controllers for the blade server
operating system. However, you must install a device driver to enable the blade
server operating system to address the Ethernet controllers. For device drivers and
information about configuring the Ethernet controllers, see the Broadcom NetXtreme
Gigabit Ethernet Software CD that comes with the blade server. To find updated
information about configuring the controllers, complete the following steps.
© Copyright IBM Corp. 2006 79
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
1. Go to http://www.ibm.com/support.
2. Under Search technical support , type 7981 or 1884, and click Search .
3. In the Additional search terms field, type ethernet, and click Go .
The Ethernet controllers support failover, which provides automatic redundancy for
the Ethernet controllers. Without failover, you can have only one Ethernet controller
from each server attached to each virtual LAN or subnet. With failover, you can
configure more than one Ethernet controller from each server to attach to the same
virtual LAN or subnet. Either one of the integrated Ethernet controllers can be
configured as the primary Ethernet controller. If you have configured the controllers
for failover and the primary link fails, the secondary controller takes over. When the
primary link is restored, the Ethernet traffic switches back to the primary Ethernet
controller. (See your operating system device driver documentation for information
about configuring for failover.)
Important: To support failover on the blade server Ethernet controllers, the Ethernet
switch modules in the BladeCenter unit must have identical configurations.
Blade server Ethernet controller enumeration
The enumeration of the Ethernet controllers in a blade server is operating-system
dependent. Through the operating-system settings, you can verify the Ethernet
controller designations that a blade server uses.
The routing of an Ethernet controller to a particular I/O-module bay depends on the
type of blade server. Yo u can verify which Ethernet controller is routed to which
I/O-module bay by using the following test:
1. Install only one Ethernet switch module or pass-thru module in I/O-module bay
1.
2. Make sure that the ports on the switch module or pass-thru module are enabled
(I/O Module Tasks → Management → Advanced Management in the
management-module Web interface).
3. Enable only one of the Ethernet controllers on the blade server. Note the
designation that the blade server operating system has for the controller.
4. Ping an external computer on the network that the switch module is connected
to. If you can ping the external computer, the Ethernet controller that you
enabled is associated with the switch module in I/O-module bay 1. The other
Ethernet controller in the blade server is associated with the switch module in
I/O-module bay 2.
If you have installed an expansion card in a blade server, communication from the
expansion card is routed to I/O-module bays 3 and 4, if these bays are supported
by your BladeCenter unit. You can verify which controller on the card is routed to
which I/O-module bay by performing the same test and using a controller on the
expansion card and a compatible switch module or pass-thru module in I/O-module
bay 3 or 4.
80 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Configuring a SAS RAID
Note: Configuring a SAS RAID array applies only to a blade server in which two
SAS hard disk drives are installed.
You can configure a SAS redundant array of independent disks (RAID) array for
your blade server. Yo u can use two SAS hard disk drives in the blade server to
implement and manage a RAID level-0 (striping) or RAID level-1 (mirroring) array
under an operating system that is listed at http://www.ibm.com/eserver/
serverproven/compat/us.
Guide .
Important: Yo u must create the RAID array before you install the operating system
on the blade server.
You can use the LSI Logic Configuration Utility program to configure the SAS hard
disk drives and SAS controller. To start the LSI Logic Configuration Utility, complete
the following steps:
1. Turn on the blade server (make sure that the blade server is the owner of the
keyboard, video, and mouse) and watch the monitor screen.
2. When the
Press Ctrl-C to start LSI Logic Configuration Utility
For more information, see the Installation and User’s
prompt appears, press Ctrl+C.
3. Follow the instructions on the screen to modify the SAS hard disk drive and
SAS controller settings.
Chapter 5. Configuration information and instructions 81
82 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide
Appendix A. Getting help and technical assistance
If you need help, service, or technical assistance or just want more information
about IBM products, you will find a wide variety of sources available from IBM to
assist you. This appendix contains information about where to go for additional
information about IBM and IBM products, what to do if you experience a problem
with your BladeCenter
is necessary.
Before you call
Before you call, make sure that you have taken these steps to try to solve the
problem yourself:
v Check all cables to make sure that they are connected.
v Check the power switches to make sure that the system and any optional
devices are turned on.
v Use the troubleshooting information in your system documentation, and use the
diagnostic tools that come with your system. Information about diagnostic tools is
in the Hardware Maintenance Manual and Troubleshooting Guide or Problem
Determination and Service Guide on the IBM Documentation CD that comes with
your system.
v Go to http://www.ibm.com/bladecenter/ and click Support to check for information
to help you solve the problem.
®
product or optional device, and whom to call for service, if it
can solve many problems without outside assistance by following the
You
troubleshooting procedures that IBM provides in the online help or in the
documentation that is provided with your IBM product. The documentation that
comes with BladeCenter systems also describes the diagnostic tests that you can
perform. Most BladeCenter systems, operating systems, and programs come with
documentation that contains troubleshooting procedures and explanations of error
messages and error codes. If you suspect a software problem, see the
documentation for the software.
Using the documentation
Information about your IBM BladeCenter system and preinstalled software, if any, or
optional device is available in the documentation that comes with the product. That
documentation can include printed documents, online documents, readme files, and
help files. See the troubleshooting information in your system documentation for
instructions for using the diagnostic programs. The troubleshooting information or
the diagnostic programs might tell you that you need additional or updated device
drivers or other software. IBM maintains pages on the World Wide Web where you
can get the latest technical information and download device drivers and updates.
To access these pages, go to http://www.ibm.com/bladecenter/, click Support, and
follow the instructions. Also, some documents are available through the IBM
Publications Center at http://www.ibm.com/shop/publications/order/.
Getting help and information from the World Wide Web
On the World Wide Web, the IBM Web site has up-to-date information about IBM
BladeCenter systems, optional devices, services, and support at
http://www.ibm.com/bladecenter/. For service information, click Support .
© Copyright IBM Corp. 2006 83
Software service and support
Through IBM Support Line, you can get telephone assistance, for a fee, with usage,
configuration, and software problems with BladeCenter products. For information
about which products are supported by Support Line in your country or region, see
http://www.ibm.com/services/sl/products/.
For more information about Support Line and other IBM services, see
http://www.ibm.com/services/, or see http://www.ibm.com/planetwide/ for support
telephone numbers. In the U.S. and Canada, call 1-800-IBM-SERV
(1-800-426-7378).
Hardware service and support
You can receive hardware service through IBM Services or through your IBM
reseller, if your reseller is authorized by IBM to provide warranty service. See
http://www.ibm.com/planetwide/ for support telephone numbers, or in the U.S. and
Canada, call 1-800-IBM-SERV (1-800-426-7378).
In the U.S. and Canada, hardware service and support is available 24 hours a day,
7 days a week. In the U.K., these services are available Monday through Friday,
from 9 a.m. to 6 p.m.
84 BladeCenter HS20 Type 7981 and 1884: Problem Determination and Service Guide