IBM LS21, BladeCenter LS21 Type 7971, BladeCenter LS41 Type 7972 Service Manual

BladeCenter LS2 1 Ty pe 7971 and LS4 1 Ty p e 7972

Problem Dete rminatio n and Service Guid e
BladeCenter LS2 1 Ty p e 7971 and LS4 1 Ty p e 7972

Problem Dete rminatio n and Service Guid e
Note: Before using this information and the product it supports, read the general information in Appendix B, “Notices,” on page
103, and the Warranty and Support Information document on the IBM BladeCenter Documentation CD.
Second Edition (February 2007)
US Government Users Restricted Rights Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
Contents
Safety . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii
Guidelines for trained service technicians . . . . . . . . . . . . . . . viii
Inspecting for unsafe conditions . . . . . . . . . . . . . . . . . viii
Guidelines for servicing electrical equipment . . . . . . . . . . . . . viii
Safety statements . . . . . . . . . . . . . . . . . . . . . . . .ix
Chapter 1. Introduction . . . . . . . . . . . . . . . . . . . . . .1
Related documentation . . . . . . . . . . . . . . . . . . . . . .1
Notices and statements in this document . . . . . . . . . . . . . . . .2
Features and specifications . . . . . . . . . . . . . . . . . . . . .3
Blade server controls and LEDs . . . . . . . . . . . . . . . . . . .4
Turning on the blade server . . . . . . . . . . . . . . . . . . . . .6
Turning off the blade server . . . . . . . . . . . . . . . . . . . . .6
System board layouts . . . . . . . . . . . . . . . . . . . . . . .7
System board connectors . . . . . . . . . . . . . . . . . . . .7
System board switches . . . . . . . . . . . . . . . . . . . . .8
System board LEDs . . . . . . . . . . . . . . . . . . . . . .9
Chapter 2. Diagnostics . . . . . . . . . . . . . . . . . . . . .11
Diagnostic tools . . . . . . . . . . . . . . . . . . . . . . . . .11
POST . . . . . . . . . . . . . . . . . . . . . . . . . . . .11
POST beep codes . . . . . . . . . . . . . . . . . . . . . .12
POST error codes . . . . . . . . . . . . . . . . . . . . . . .15
Error logs . . . . . . . . . . . . . . . . . . . . . . . . . .22
Checkout procedure . . . . . . . . . . . . . . . . . . . . . . .24
About the checkout procedure . . . . . . . . . . . . . . . . . .24
Performing the checkout procedure . . . . . . . . . . . . . . . .24
Troubleshooting tables . . . . . . . . . . . . . . . . . . . . . .25
General problems . . . . . . . . . . . . . . . . . . . . . . .25
Hard disk drive problems . . . . . . . . . . . . . . . . . . . .26
Intermittent problems . . . . . . . . . . . . . . . . . . . . . .26
Keyboard or mouse problems . . . . . . . . . . . . . . . . . .27
Memory problems . . . . . . . . . . . . . . . . . . . . . . .28
Microprocessor problems . . . . . . . . . . . . . . . . . . . .28
Monitor or video problems . . . . . . . . . . . . . . . . . . . .29
Network connection problems . . . . . . . . . . . . . . . . . .30
Optional-device problems . . . . . . . . . . . . . . . . . . . .30
Power error messages . . . . . . . . . . . . . . . . . . . . .31
Power problems . . . . . . . . . . . . . . . . . . . . . . .33
Removable-media drive problems . . . . . . . . . . . . . . . . .35
ServerGuide problems . . . . . . . . . . . . . . . . . . . . .36
Service processor problems . . . . . . . . . . . . . . . . . . .36
Software problems . . . . . . . . . . . . . . . . . . . . . .37
Universal Serial Bus (USB) port problems . . . . . . . . . . . . . .37
Light path diagnostics . . . . . . . . . . . . . . . . . . . . . .38
Viewing the light path diagnostics LEDs . . . . . . . . . . . . . . .38
Light path diagnostics LEDs . . . . . . . . . . . . . . . . . . .41
Diagnostic programs, messages, and error codes . . . . . . . . . . . .42
Running the diagnostic programs . . . . . . . . . . . . . . . . .43
Diagnostic text messages . . . . . . . . . . . . . . . . . . . .43
Viewing the test log . . . . . . . . . . . . . . . . . . . . . .44
Diagnostic error codes . . . . . . . . . . . . . . . . . . . . .44
Recovering from a BIOS update failure . . . . . . . . . . . . . . . .50
© Copyright IBM Corp. 2006 iii
Service processor (BMC) error codes . . . . . . . . . . . . . . . .51
Solving SAS hard disk drive problems . . . . . . . . . . . . . . . .52
Solving shared BladeCenter resource problems . . . . . . . . . . . . .52
Keyboard or mouse problems . . . . . . . . . . . . . . . . . .52
Media tray problems . . . . . . . . . . . . . . . . . . . . . .53
Network connection problems . . . . . . . . . . . . . . . . . .54
Power problems . . . . . . . . . . . . . . . . . . . . . . .55
Video problems . . . . . . . . . . . . . . . . . . . . . . . .55
Solving undetermined problems . . . . . . . . . . . . . . . . . . .56
Calling IBM for service . . . . . . . . . . . . . . . . . . . . . .58
Chapter 3. Parts listing, Types 7971 and 7972 . . . . . . . . . . . .59
Chapter 4. Removing and replacing blade server components . . . . . .63
Installation guidelines . . . . . . . . . . . . . . . . . . . . . .63
System reliability guidelines . . . . . . . . . . . . . . . . . . .64
Handling static-sensitive devices . . . . . . . . . . . . . . . . .64
Returning a device or component . . . . . . . . . . . . . . . . .64
Removing the blade server from a BladeCenter unit . . . . . . . . . . .65
Installing the blade server in a BladeCenter unit . . . . . . . . . . . . .66
Removing and replacing Tier 1 CRUs . . . . . . . . . . . . . . . .68
Removing the blade server cover . . . . . . . . . . . . . . . . .68
Installing the blade server cover . . . . . . . . . . . . . . . . .69
Removing an expansion unit . . . . . . . . . . . . . . . . . . .70
Installing an expansion unit . . . . . . . . . . . . . . . . . . .71
Removing the bezel assembly . . . . . . . . . . . . . . . . . .72
Installing the bezel assembly . . . . . . . . . . . . . . . . . . .73
Removing a SAS hard disk drive . . . . . . . . . . . . . . . . .74
Installing a SAS hard disk drive . . . . . . . . . . . . . . . . . .75
Removing a memory module . . . . . . . . . . . . . . . . . . .76
Installing a memory module . . . . . . . . . . . . . . . . . . .76
Removing and installing an I/O expansion card . . . . . . . . . . . .78
Removing the battery . . . . . . . . . . . . . . . . . . . . .84
Installing the battery . . . . . . . . . . . . . . . . . . . . . .84
Removing and replacing Tier 2 CRUs . . . . . . . . . . . . . . . .86
Removing a Concurrent KVM Feature Card . . . . . . . . . . . . .86
Installing a Concurrent KVM Feature Card . . . . . . . . . . . . . .87
Removing the hard disk drive tray . . . . . . . . . . . . . . . . .88
Installing the hard disk drive tray . . . . . . . . . . . . . . . . .89
Removing and replacing FRUs . . . . . . . . . . . . . . . . . . .90
Removing a microprocessor and heat sink . . . . . . . . . . . . . .90
Installing a microprocessor and heat sink . . . . . . . . . . . . . .93
Removing the system board assembly . . . . . . . . . . . . . . .95
Installing the system board assembly . . . . . . . . . . . . . . . .96
Chapter 5. Configuration information and instructions . . . . . . . . .97
Firmware updates . . . . . . . . . . . . . . . . . . . . . . . .97
Using the Configuration/Setup Utility program . . . . . . . . . . . . .97
Configuring the Gigabit Ethernet controllers . . . . . . . . . . . . . .97
Blade server Ethernet controller enumeration . . . . . . . . . . . . . .98
Configuring a SAS RAID array . . . . . . . . . . . . . . . . . . .99
Appendix A. Getting help and technical assistance . . . . . . . . . . 101
Before you call . . . . . . . . . . . . . . . . . . . . . . . . 101
Using the documentation . . . . . . . . . . . . . . . . . . . . . 101
Getting help and information from the World Wide Web . . . . . . . . . 101
iv BladeCenter LS21 Type 7971 and LS41 Type 7972: Problem Determination and Service Guide
Software service and support . . . . . . . . . . . . . . . . . . . 102
Hardware service and support . . . . . . . . . . . . . . . . . . . 102
Appendix B. Notices . . . . . . . . . . . . . . . . . . . . . . 103
Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . 104
Important notes . . . . . . . . . . . . . . . . . . . . . . . . 104
Product recycling and disposal . . . . . . . . . . . . . . . . . . 105
Battery return program . . . . . . . . . . . . . . . . . . . . . 106
Electronic emission notices . . . . . . . . . . . . . . . . . . . . 107
Federal Communications Commission (FCC) statement . . . . . . . . 107
Industry Canada Class A emission compliance statement . . . . . . . . 107
Australia and New Zealand Class A statement . . . . . . . . . . . . 107
United Kingdom telecommunications safety requirement . . . . . . . . 107
European Union EMC Directive conformance statement . . . . . . . . 108
Taiwanese Class A warning statement . . . . . . . . . . . . . . . 108
Chinese Class A warning statement . . . . . . . . . . . . . . . . 108
Japanese Voluntary Control Council for Interference (VCCI) statement 108
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
Contents v
vi BladeCenter LS21 Type 7971 and LS41 Type 7972: Problem Determination and Service Guide
Safety
Before installing this product, read the Safety Information.
Antes de instalar este produto, leia as Informações de Segurança.
Pred instalací tohoto produktu si prectete prírucku bezpecnostních instrukcí.
Læs sikkerhedsforskrifterne, før du installerer dette produkt.
Lees voordat u dit product installeert eerst de veiligheidsvoorschriften.
Ennen kuin asennat tämän tuotteen, lue turvaohjeet kohdasta Safety Information.
Avant d’installer ce produit, lisez les consignes de sécurité.
Vor der Installation dieses Produkts die Sicherheitshinweise lesen.
Prima di installare questo prodotto, leggere le Informazioni sulla Sicurezza.
Les sikkerhetsinformasjonen (Safety Information) før du installerer dette produktet.
Antes de instalar este produto, leia as Informações sobre Segurança.
Antes de instalar este producto, lea la información de seguridad.
Läs säkerhetsinformationen innan du installerar den här produkten.
© Copyright IBM Corp. 2006 vii
Guidelines for trained service technicians
This section contains information for trained service technicians.
Inspecting for unsafe conditions
Use the information in this section to help you identify potential unsafe conditions in an IBM product on which you are working. Each IBM product, as it was designed and manufactured, has required safety items to protect users and service technicians from injury. The information in this section addresses only those items. Use good judgment to identify potential unsafe conditions that might be caused by non-IBM alterations or attachment of non-IBM features or options that are not addressed in this section. If you identify an unsafe condition, you must determine how serious the hazard is and whether you must correct the problem before you work on the product.
Consider the following conditions and the safety hazards that they present:
v Electrical hazards, especially primary power. Primary voltage on the frame can
cause serious or fatal electrical shock.
v Explosive hazards, such as a damaged CRT face or a bulging capacitor.
v Mechanical hazards, such as loose or missing hardware.
inspect the product for potential unsafe conditions, complete the following steps:
To
1. Make sure that the power is off and the power cord is disconnected.
2. Make sure that the exterior cover is not damaged, loose, or broken, and
observe any sharp edges.
3. Check the power cord:
v Make sure that the third-wire ground connector is in good condition. Use a
meter to measure third-wire ground continuity for 0.1 ohm or less between the external ground pin and the frame ground.
v Make sure that the power cord is the correct type, as specified in the
documentation for your BladeCenter unit type.
v Make sure that the insulation is not frayed or worn.
Remove the cover.
4.
5. Check for any obvious non-IBM alterations. Use good judgment as to the safety
of any non-IBM alterations.
6. Check inside the server for any obvious unsafe conditions, such as metal filings,
contamination, water or other liquid, or signs of fire or smoke damage.
7. Check for worn, frayed, or pinched cables.
8. Make sure that the power-supply cover fasteners (screws or rivets) have not
been removed or tampered with.
Guidelines for servicing electrical equipment
Observe the following guidelines when servicing electrical equipment:
v Check the area for electrical hazards such as moist floors, non-grounded power
extension cords, and missing safety grounds.
v Use only approved tools and test equipment. Some hand tools have handles that
are covered with a soft material that does not provide insulation from live
electrical current.
v Regularly inspect and maintain your electrical hand tools for safe operational
condition. Do not use worn or broken tools or testers.
viii BladeCenter LS21 Type 7971 and LS41 Type 7972: Problem Determination and Service Guide
v Do not touch the reflective surface of a dental mirror to a live electrical circuit.
The surface is conductive and can cause personal injury or equipment damage if it touches a live electrical circuit.
v Some rubber floor mats contain small conductive fibers to decrease electrostatic
discharge. Do not use this type of mat to protect yourself from electrical shock.
v Do not work alone under hazardous conditions or near equipment that has
hazardous voltages.
v Locate the emergency power-off (EPO) switch, disconnecting switch, or electrical
outlet so that you can turn off the power quickly in the event of an electrical accident.
v Disconnect all power before you perform a mechanical inspection, work near
power supplies, or remove or install main units.
v Before you work on the equipment, disconnect the power cord. If you cannot
disconnect the power cord, have the customer power-off the wall box that supplies power to the equipment and lock the wall box in the off position.
v Never assume that power has been disconnected from a circuit. Check it to
make sure that it has been disconnected.
v If you have to work on equipment that has exposed electrical circuits, observe
the following precautions:
Make sure that another person who is familiar with the power-off controls is
near you and is available to turn off the power if necessary.
When you are working with powered-on electrical equipment, use only one
hand. Keep the other hand in your pocket or behind your back to avoid creating a complete circuit that could cause an electrical shock.
When using a tester, set the controls correctly and use the approved probe
leads and accessories for that tester.
Stand on a suitable rubber mat to insulate you from grounds such as metal
floor strips and equipment frames.
Use extreme care when measuring high voltages.
v
v To ensure proper grounding of components such as power supplies, pumps,
blowers, fans, and motor generators, do not service these components outside of their normal operating locations.
v If an electrical accident occurs, use caution, turn off the power, and send another
person to get medical aid.
Safety statements
Important:
Each caution and danger statement in this documentation begins with a number.
This number is used to cross reference an English-language caution or danger
statement with translated versions of the caution or danger statement in the Safety
Information document.
For example, if a caution statement begins with a number 1, translations for that
caution statement appear in the Safety Information document under statement 1.
Be sure to read all caution and danger statements in this documentation before
performing the instructions. Read any additional safety information that comes with
your server or optional device before you install the device.
Safety ix
Statement 1:
DANGER
Electrical
current from power, telephone, and communication cables is
hazardous.
To avoid a shock hazard: v Do not connect or disconnect any cables or perform installation,
maintenance, or reconfiguration of this product during an electrical storm.
v Connect all power cords to a properly wired and grounded electrical
outlet.
v Connect to properly wired outlets any equipment that will be attached to
this product.
v When possible, use one hand only to connect or disconnect signal
cables.
v Never turn on any equipment when there is evidence of fire, water, or
structural damage.
v Disconnect the attached power cords, telecommunications systems,
networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures.
v Connect and disconnect cables as described in the following table when
installing, moving, or opening covers on this product or attached devices.
To Connect: To Disconnect:
1. Turn everything OFF.
2. First, attach all cables to devices.
3. Attach signal cables to connectors.
4. Attach power cords to outlet.
1. Turn everything OFF.
2. First, remove power cords from outlet.
3. Remove signal cables from connectors.
4. Remove all cables from devices.
5. Turn device ON.
x BladeCenter LS21 Type 7971 and LS41 Type 7972: Problem Determination and Service Guide
Statement 2:
CAUTION:
When replacing the lithium battery, use only IBM Part Number 33F8354 or an
equivalent type battery recommended by the manufacturer. If your system has
a module containing a lithium battery, replace it only with the same module
type made by the same manufacturer. The battery contains lithium and can
explode if not properly used, handled, or disposed of.
Do not:
v Throw or immerse into water
v Heat to more than 100°C (212°F)
v Repair or disassemble
Dispose
Statement 3:
of the battery as required by local ordinances or regulations.
CAUTION:
When laser products (such as CD-ROMs, DVD drives, fiber optic devices, or
transmitters) are installed, note the following:
v Do not remove the covers. Removing the covers of the laser product could
result in exposure to hazardous laser radiation. There are no serviceable parts inside the device.
v Use of controls or adjustments or performance of procedures other than
those specified herein might result in hazardous radiation exposure.
DANGER
laser products contain an embedded Class 3A or Class 3B laser
Some diode. Note the following.
Laser radiation when open. Do not stare into the beam, do not view directly with optical instruments, and avoid direct exposure to the beam.
Safety xi
Statement 4:
18 kg (39.7 lb) 32 kg (70.5 lb) 55 kg (121.2 lb)
CAUTION: Use safe practices when lifting.
Statement 5:
CAUTION: The power control button on the device and the power switch on the power supply do not turn off the electrical current supplied to the device. The device also might have more than one power cord. To remove all electrical current from the device, ensure that all power cords are disconnected from the power source.
1 2
xii BladeCenter LS21 Type 7971 and LS41 Type 7972: Problem Determination and Service Guide
Statement 8:
CAUTION:
Never remove the cover on a power supply or any part that has the following
label attached.
Hazardous voltage, current, and energy levels are present inside any
component that has this label attached. There are no serviceable parts inside
these components. If you suspect a problem with one of these parts, contact
a service technician.
Statement 10:
CAUTION:
Do not place any object on top of rack-mounted devices.
Statement 21:
CAUTION:
Hazardous energy is present when the blade is connected to the power
source. Always replace the blade cover before installing the blade.
Safety xiii
xiv BladeCenter LS21 Type 7971 and LS41 Type 7972: Problem Determination and Service Guide
Chapter 1. Introduction
This Problem Determination and Service Guide contains information to help you
solve problems that might occur in your IBM® BladeCenter® LS21 Type 7971 and
LS41 Type 7972 blade server. It describes the diagnostic tools that come with the
blade server, error codes and suggested actions, and instructions for replacing
failing components.
Replaceable components are of three types:
v Tier 1 customer replaceable unit (CRU): Replacement of Tier 1 CRUs is your
responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation.
v Tier 2 customer replaceable unit: You may install a Tier 2 CRU yourself or
request IBM to install it, at no additional charge, under the type of warranty service that is designated for your server.
v Field replaceable unit (FRU): FRUs must be installed only by trained service
technicians.
information about the terms of the warranty and getting service and assistance,
For
see the Warranty and Support Information document.
Related documentation
In addition to this document, the following documentation also comes with the
server:
v Installation and User’s Guide
This printed document contains general information about the server, including how to install supported options and how to configure the server.
v Safety Information
This document is in Portable Document Format (PDF) on the IBM Documentation CD. It contains translated caution and danger statements. Each caution and danger statement that appears in the documentation has a number that you can use to locate the corresponding statement in your language in the Safety
Information document.
v Warranty and Support Information
This document is in PDF on the IBM Documentation CD. It contains information about the terms of the warranty and about service and assistance.
Depending
IBM Documentation CD.
The blade server might have features that are not described in the documentation
that comes with the server. The documentation might be updated occasionally to
include information about those features, or technical updates might be available to
provide additional information that is not included in the blade server
documentation. The most recent versions of all BladeCenter documentation are at
http://www.ibm.com/bladecenter/. In addition to the documentation in this library, be
sure to review the IBM BladeCenter Planning and Installation Guide for your
BladeCenter unit type for information to help you prepare for system installation and
configuration. This document is available at http://www.ibm.com/bladecenter/.
on the server model, additional documentation might be included on the
© Copyright IBM Corp. 2006 1
Notices and statements in this document
The caution and danger statements that appear in this document are also in the multilingual Safety Information document, which is on the IBM Documentation CD. Each statement is numbered for reference to the corresponding statement in the
Safety Information document.
The following notices and statements are used in this document:
v Note: These notices provide important tips, guidance, or advice. v Important: These notices provide information or advice that might help you avoid
inconvenient or problem situations.
v Attention: These notices indicate potential damage to programs, devices, or
data. An attention notice is placed just before the instruction or situation in which
damage could occur.
v Caution: These statements indicate situations that can be potentially hazardous
to you. A caution statement is placed just before the description of a potentially
hazardous procedure step or situation.
v Danger: These statements indicate situations that can be potentially lethal or
extremely hazardous to you. A danger statement is placed just before the
description of a potentially lethal or extremely hazardous procedure step or
situation.
2 BladeCenter LS21 Type 7971 and LS41 Type 7972: Problem Determination and Service Guide
Features and specifications
The following table provides a summary of the features and specifications of the
blade server.
Notes:
v Power, cooling, removable-media drives, external ports, and advanced system
management are provided by the BladeCenter unit.
v The operating system in the blade server must provide USB support for the blade
server to recognize and use the removable-media drives and front-panel USB ports. The BladeCenter unit uses USB for internal communications with these devices.
Microprocessor: Supports dual-core
AMD Opteron® microprocessors: Up to two in single-width models or four in double-width models
Note: Use the Configuration/Setup
Utility program to determine the type and speed of the microprocessors in your blade server.
Memory:
v Dual-channel DIMMs: 8 DIMM slots
in single-width models or 16 DIMM slots in double-width models
v Type: Very-low profile (VLP),
double-data rate 2 (DDR2), ECC SDRAM registered x4 (Chipkill) or x8 (non-Chipkill) DIMMs
v Supports 512 MB, 1 GB, and 2 GB
(as of the date of this publication): Total memory of up to 16 GB in single-width models or 32 GB in double-width models
Drives:
Supports small-form-factor,
Serial Attached SCSI (SAS) drives: One in single-width models or two in double-width models
Integrated functions:
v Dual Gigabit Ethernet controllers v Expansion card interface v Local service processor:
Baseboard management controller (BMC) with Intelligent Platform Management Interface (IPMI) firmware
v ATI RN-50 video controller v LSI 1064 Serial Attached SCSI
(SAS) controller
v Light path diagnostics v RS-485 interface for
communication with the management module
v Automatic server restart (ASR) v Serial over LAN (SOL) v Four USB buses for
communication with keyboard, mouse, and removable media drives
Predictive alerts:
v Microprocessor
v Memory
v Hard disk drives
Electrical
Failure Analysis
Input: 12 V dc
®
(PFA)
Environment:
v Air temperature:
Blade server on: 10° to 35° C
(50° to 95° F). Altitude: 0 to 914 m (0 to 3000 ft)
Blade server on: 10° to 32° C
(50° to 90° F). Altitude: 914 to 2134 m (3000 to 7000 ft)
Blade server off: -40° to 60° C
(-40° to 140° F)
Humidity:
v
Blade server on: 8% to 80% Blade server off: 5% to 80%
Size:
v Height: 24.5 cm (9.7 inches)
v Depth: 44.6 cm (17.6 inches)
v Width:
Single-width models: 2.9 cm (1.14
inches)
Double-width models: 5.8 cm
(2.28 inches)
Maximum weight:
v
Single-width models: 4.9 kg (10.8
lb)
Double-width models: 11.2 kg
(24.7 lb)
Chapter 1. Introduction 3
Blade server controls and LEDs
This section describes the controls and LEDs on the blade server.
Note: The control panel door is shown in the closed (normal) position in the
following illustration. To access the power-control button, you must open the control panel door.
Activity LED
Location LED
KVM select button
Information LED
Blade-error LED
Media-tray select button
Power-control button
Power-on LED
KVM select button: Press this button to associate the shared BladeCenter unit
keyboard, video, and mouse port (KVM) with the blade server. The LED on this button flashes while the request is being processed, and then is lit when the ownership of the keyboard, video, and mouse has been transferred to the blade server. It can take approximately 20 seconds to switch the keyboard, video, and mouse control to the blade server.
Using a keyboard that is directly attached to the management-module, you can press keyboard keys in the following sequence to switch KVM control between blade servers:
NumLock NumLock blade_server_number Enter
Where blade_server_number is the two-digit number for the blade bay in which the blade server is installed. A blade server that occupies more than one blade bay is identified by the lowest bay number that it occupies.
If there is no response when you press the KVM select button, you can use the management-module Web interface to determine whether local control has been disabled on the blade server.
Notes:
1. The operating system in the blade server must provide USB support for the
blade server to recognize and use the keyboard and mouse, even if the keyboard and mouse have PS/2-style connectors.
2. If you install a supported Microsoft Windows operating system on the blade
server while it is not the current owner of the KVM, a delay of up to 1 minute occurs the first time that you switch the KVM to the blade server. All subsequent switching takes place in the normal KVM switching time frame (up to 20 seconds).
4 BladeCenter LS21 Type 7971 and LS41 Type 7972: Problem Determination and Service Guide
Activity LED: When this green LED is lit, it indicates that there is activity on the
hard disk drive or network.
Location LED: When this blue LED is lit, it has been turned on by the system
administrator to aid in visually locating the blade server. The location LED on the
BladeCenter unit is lit also. The location LED can be turned off through the
management-module Web interface or through IBM Director Console.
Information LED: When this amber LED is lit, it indicates that information about a
system error for the blade server has been placed in the management-module
event log. The information LED can be turned off through the management-module
Web interface or through IBM Director Console.
Blade-error LED: When this amber LED is lit, it indicates that a system error has
occurred in the blade server. The blade-error LED will turn off only after the error is
corrected.
Media-tray select button: Press this button to associate the shared BladeCenter
unit media tray (removable-media drives and front-panel USB ports) with the blade
server. The LED on the button flashes while the request is being processed, and
then is lit when the ownership of the media tray has been transferred to the blade
server. It can take approximately 20 seconds for the operating system in the blade
server to recognize the media tray.
If there is no response when you press the media-tray select button, you can use
the management-module Web interface to determine whether local control has been
disabled on the blade server.
Note: The operating system in the blade server must provide USB support for the
blade server to recognize and use the removable-media drives and front-panel USB
ports.
Power-control button: This button is behind the control panel door. Press this
button to turn on or turn off the blade server.
Note: The power-control button has effect only if local power control is enabled for
the blade server. Local power control is enabled and disabled through the
management-module Web interface.
Power-on LED: This green LED indicates the power status of the blade server in
the following manner:
v Flashing rapidly: The service processor (BMC) on the blade server is
communicating with the management module.
v Flashing slowly: The blade server has power but is not turned on.
v Lit continuously: The blade server has power and is turned on.
Chapter 1. Introduction 5
Turning on the blade server
After you connect the blade server to power through the BladeCenter unit, the blade server can start in any of the following ways:
v You can press the power-control button on the front of the blade server (behind
the control panel door, see “Blade server controls and LEDs” on page 4) to start
the blade server.
Notes:
1. Wait until the power-on LED on the blade server flashes slowly before pressing the power-control button. While the service processor in the management module is initializing, the power-on LED does not flash, and the power-control button on the blade server does not respond.
2. While the blade server is starting, the power-on LED on the front of the blade server is lit. See “Blade server controls and LEDs” on page 4 for the power-on LED states.
If a power failure occurs, the BladeCenter unit and then the blade server can
v
start automatically when power is restored, if the blade server is configured through the management module to do so.
v You can turn on the blade server remotely by using the management module.
v If the blade server is connected to power (the power-on LED is flashing slowly),
the operating system supports the Wake on LAN® feature, and the Wake on LAN feature has not been disabled through the management module, the Wake on LAN feature can turn on the blade server.
Turning off the blade server
When you turn off the blade server, it is still connected to power through the BladeCenter unit. The blade server can respond to requests from the service processor, such as a remote request to turn on the blade server. To remove all power from the blade server, you must remove it from the BladeCenter unit.
Shut down the operating system before you turn off the blade server. See the operating-system documentation for information about shutting down the operating system.
The blade server can be turned off in any of the following ways:
v You can press the power-control button on the blade server (behind the control
panel door, see “Blade server controls and LEDs” on page 4). This starts an orderly shutdown of the operating system, if this feature is supported by the operating system.
v If the operating system stops functioning, you can press and hold the
power-control button for more than 4 seconds to turn off the blade server.
v The management module can turn off the blade server.
If the system is not operating correctly, the management module will
automatically turn off the blade server.
Through the management-module Web interface, you can also configure the
management module to turn off the blade server. For additional information, see the IBM BladeCenter Management Module User’s Guide.
6 BladeCenter LS21 Type 7971 and LS41 Type 7972: Problem Determination and Service Guide
System board layouts
The following illustrations show the connectors, LEDs, and switches on the blade server system board and the Multiprocessor Expansion (MPE) unit. The illustrations in this document might differ slightly from your hardware.
System board connectors
The following illustration shows the connectors on the blade server system board.
Blade expansion
Microprocessor 1
DIMM 4 DIMM 3 DIMM 2 DIMM 1
Microprocessor 2
Concurrent KVM
I/O-expansion card
Battery
SAS hard disk drive
DIMM 8 DIMM 7 DIMM 6 DIMM 5
The following illustration shows the connectors on the MPE unit.
Blade expansion
Microprocessor 1
DIMM 4 DIMM 3 DIMM 2 DIMM 1
Microprocessor 2
MPE unit
Control panel
I/O-expansion card
SAS hard disk drive
DIMM 8 DIMM 7 DIMM 6 DIMM 5
Chapter 1. Introduction 7
System board switches
The following illustration shows the location of the switches on the blade server system board.
Switch block 4 (SW4)
The following table defines the function of each switch on the blade server system board.
Light path diagnostics switch (SW1)
Switch number Description
SW1 Light path diagnostics switch Press and hold to
relight the system-board LEDs that were lit before you removed the blade server from the BladeCenter unit.
SW4 Switch block 4 has four switches.
v 1 Reserved
v 2 BIOS backup page
Open: BIOS starts from the primary BIOS
page (default)
Closed: BIOS starts from the backup BIOS
page
3 Password override enable
v
Open: Disabled (default)
Closed: Enabled Bypass password during
next server start
4 Wake On LAN enable
v
Open: Disabled
Closed: Enabled (default)
8 BladeCenter LS21 Type 7971 and LS41 Type 7972: Problem Determination and Service Guide
The following illustration shows the location of the light path diagnostics switch on the MPE unit.
Light path diagnostics switch (SW1)
The following table defines the function of the light path diagnostics switch on the MPE unit.
Switch number Description
SW1 Light path diagnostics switch Press and hold to
System board LEDs
The following illustration shows the LEDs on the blade server system board. You must remove the blade server from the BladeCenter unit, open the cover or remove any optional expansion units, and press the light path diagnostics switch to light any error LEDs that were turned on during processing.
Microprocessor 1 error LED
relight the MPE-unit LEDs that were lit before you removed the blade server from the BladeCenter unit.
DIMM 4 error LED DIMM 3 error LED DIMM 2 error LED DIMM 1 error LED
Light path diagnostics panel
SAS hard disk drive error LED
DIMM 8 error LED DIMM 7 error LED DIMM 6 error LED DIMM 5 error LED
Microprocessor 2 error LED
Chapter 1. Introduction 9
The following illustration shows the light path diagnostics panel on the blade server system board.
NMI
NMI error LED
MIS S BRD
TEMP
LP 1
Microprocessor mismatch error LED
System-board error LED
Over temperature error LED
Light path diagnostics LED
Light path diagnostics switch
The following illustration shows the LEDs on the MPE unit. You must remove the blade server from the BladeCenter unit, open the cover, and press the light path diagnostics switch to light any error LEDs that were turned on during processing.
Microprocessor 1 error LED
DIMM 4 error LED DIMM 3 error LED DIMM 2 error LED DIMM 1 error LED
Light path diagnostics panel
SAS hard disk drive error LED
DIMM 8 error LED DIMM 7 error LED DIMM 6 error LED DIMM 5 error LED
Microprocessor 2 error LED
The following illustration shows the light path diagnostics panel on the MPE unit.
MIS
S BRD
LP 1
LP 2
Microprocessor mismatch error LED
MPE-board error LED
Light path diagnostics LED (for blade server)
Light path diagnostics LED (for MPE)
Light path diagnostics switch
10 BladeCenter LS21 Type 7971 and LS41 Type 7972: Problem Determination and Service Guide
Chapter 2. Diagnostics
This chapter describes the diagnostic tools that are available to help you solve problems that might occur in the blade server.
Note: The blade server uses shared resources that are installed in the BladeCenter
unit. Problems with these shared resources might appear to be in the blade server (see “Solving shared BladeCenter resource problems” on page 52 for information about isolating problems with these resources). See the Problem Determination and
Service Guide or the Hardware Maintenance Manual and Troubleshooting Guide for
your BladeCenter unit and other BladeCenter component documentation for diagnostic procedures for shared BladeCenter components.
If you cannot locate and correct the problem using the information in this chapter, see Appendix A, “Getting help and technical assistance,” on page 101 for more information.
Diagnostic tools
The following tools are available to help you diagnose and solve hardware-related problems:
v POST beep codes, error messages, and error logs
The power-on self-test (POST) generates beep codes and messages to indicate successful test completion or the detection of a problem. See “POST” for more information.
v Troubleshooting tables
These tables list problem symptoms and actions to correct the problems. See “Troubleshooting tables” on page 25 for more information.
v Light path diagnostics
Use the light path diagnostics to diagnose system errors quickly. See “Light path diagnostics” on page 38 for more information.
v Diagnostic programs, messages, and error codes
The diagnostic programs are the primary method of testing the major components of the blade server. These programs are stored in read-only memory (ROM) on the blade server. See “Diagnostic programs, messages, and error codes” on page 42 for more information.
POST
When you turn on the blade server, it performs a series of tests to check the operation of the blade server components and some optional devices in the blade server. This series of tests is called the power-on self-test, or POST.
If a power-on password is set, you must type the password and press Enter, when prompted, for POST to run.
If POST is completed without detecting any problems, a single beep sounds, and the blade server startup is completed.
If POST detects a problem, more than one beep might sound, or an error message is displayed. See “Beep code descriptions” on page 12 and “POST error codes” on page 15 for more information.
© Copyright IBM Corp. 2006 11
POST beep codes
A beep code is a combination of short or long beeps or a series of short beeps that are separated by pauses. For example, a “1-2-3” beep code is one short beep, a pause, two short beeps, a pause, and three short beeps. A beep code other than one beep indicates that POST has detected a problem. To determine the meaning of a beep code, see “Beep code descriptions.” If no beep code sounds, see “No-beep symptoms” on page 15.
Beep code descriptions
The following table describes the beep codes and suggested actions to correct the detected problems.
A single problem might cause more than one error message. When this occurs, correct the cause of the first error message. The other error messages usually will not occur the next time POST runs.
Exception: If there are multiple error codes or light path diagnostics LEDs that
indicate a microprocessor error, the error might be in a microprocessor or in a microprocessor socket. See “Microprocessor problems” on page 28 for information about diagnosing microprocessor problems.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Types 7971 and 7972,” on page 59 to determine which components are CRUs
and which components are FRUs.
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Beep code Description Action
1-1-3 CMOS write/read test failed.
1-1-4 BIOS ROM checksum failed.
1-2-1 Programmable interval timer failed. (Trained service technician only) Replace the
1-2-2 DMA initialization failed. (Trained service technician only) Replace the
1-2-3 DMA page register write/read failed. (Trained service technician only) Replace the
1. Reseat the battery
2. Replace the following components one at a time, in the order shown, restarting the blade server each time:
a. Battery
b. (Trained service technician only) System
board assembly
1. Update the BIOS code.
2. Reseat the DIMMs.
3. Replace the following components one at a time, in the order shown, restarting the blade server each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
system board assembly.
system board assembly.
system board assembly.
12 BladeCenter LS21 Type 7971 and LS41 Type 7972: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Types 7971 and 7972,” on page 59 to determine which components are CRUs
and which components are FRUs.
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Beep code Description Action
1-2-4 RAM refresh verification failed.
1. Reseat the following components:
a. DIMMs
b. MPE unit (if one is installed)
Replace the following components one at a
2. time, in the order shown, restarting the blade server each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
c. (Trained service technician only) MPE
board assembly (if one is installed)
1-3-1 First 64K RAM test failed.
1. Reseat the DIMMs.
2. Replace the following components one at a time, in the order shown, restarting the blade server each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
2-1-1 Secondary DMA register test failed. (Trained service technician only) Replace the
system board assembly.
2-1-2 Primary DMA register test failed. (Trained service technician only) Replace the
system board assembly.
2-1-3 Primary interrupt mask register test
failed.
2-1-4 Secondary interrupt mask register test
failed.
2-2-2 Keyboard controller test failed.
(Trained service technician only) Replace the system board assembly.
(Trained service technician only) Replace the system board assembly.
1. Check the function of the shared BladeCenter unit resources (see “Solving shared BladeCenter resource problems” on page 52).
2. (Trained service technician only) Replace the system board assembly.
2-3-1 Screen initialization failed. (Trained service technician only) Replace the
system board assembly.
3-1-1 Timer tick interrupt failed. (Trained service technician only) Replace the
system board assembly.
3-1-2 Interval timer channel 2 failed. (Trained service technician only) Replace the
system board assembly.
Chapter 2. Diagnostics 13
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, “Parts listing, Types 7971 and 7972,” on page 59 to determine which components are CRUs
and which components are FRUs.
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Beep code Description Action
3-1-4 Time-of-day clock failed.
1. Reseat the battery.
2. Replace the following components one at a time, in the order shown, restarting the blade server each time:
a. Battery
b. (Trained service technician only) System
board assembly
3-3-2 I2C error occurred.
1. Power down the blade server and reseat it in the BladeCenter unit.
2. Reseat the following components:
a. DIMMs
b. MPE unit (if one is installed)
Replace the following components one at a
3. time, in the order shown, restarting the blade server each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
3-3-3 No operational memory in system. Important: In some memory configurations,
the 3-3-3 beep code might sound during POST followed by a blank display screen. If this occurs and the Boot Fail Count feature in the Start Options of the Configuration/Setup Utility program is set to Enabled (its default setting), you must restart the blade server three times to force the system BIOS to reset the memory connector or bank of connectors from Disabled to Enabled.
1. Install or reseat DIMMs and restart the blade server three times.
2. Replace the following components one at a time, in the order shown, restarting the blade server each time:
a. DIMMs
b. (Trained service technician only) System
board assembly
14 BladeCenter LS21 Type 7971 and LS41 Type 7972: Problem Determination and Service Guide
Loading...
+ 100 hidden pages