Note: Before using this information and the product it supports, read the general information in Appendix B, “Notices,” on page 281,
the IBM Safety Information and IBM Environmental Notices and User's Guide on the IBM System x Documentation CD, and the
IBM Warranty Information document that comes with your server.
Returning a device or component................172
Internal cable routing and connectors ...............172
Removing and replacing consumable parts and Tier 1 CRUs.......176
Removing the cover.....................177
Installing the cover ......................178
Removing the microprocessor 2 air baffle..............178
Installing the microprocessor 2 air baffle ..............180
Removing the DIMM air baffle ..................180
Installing the DIMM air baffle ..................182
Removing the fan bracket...................182
Installing the fan bracket ....................184
Removing an IBM virtual media key................185
Installing an IBM virtual media key ................186
Removing a USB hypervisor memory key.............186
Installing a USB hypervisor memory key ..............187
Removing a PCI riser-card assembly ...............188
Installing a PCI riser-card assembly ................189
Removing a PCI adapter from a PCI riser-card assembly ........190
Installing a PCI adapter in a PCI riser-card assembly .........191
Removing the optional two-port Ethernet adapter ...........193
Installing the optional two-port Ethernet adapter ...........194
Storing the full-length-adapter bracket ...............196
Removing the SAS riser-card and controller assembly .........197
Installing the SAS riser-card and controller assembly.........198
Removing a ServeRAID SAS controller from the SAS riser card .....201
Installing a ServeRAID SAS controller on the SAS riser card......202
Removing an optional ServeRAID adapter advanced feature key .....204
ivIBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Installing an optional ServeRAID adapter advanced feature key.....205
Removing a ServeRAID SAS controller battery from the remote battery tray207
Installing a ServeRAID SAS controller battery on the remote battery tray208
Removing a hot-swap hard disk drive ...............210
Installing a hot-swap hard disk drive...............210
Removing a simple-swap hard disk drive ..............211
Installing a simple-swap hard disk drive ..............212
Removing an optional CD-RW/DVD drive..............213
Installing an optional CD-RW/DVD drive ..............214
Removing a tape drive....................214
Installing a tape drive .....................216
Removing a memory module (DIMM) ...............217
Installing a memory module ...................217
Installing a DIMM......................222
Removing a hot-swap fan...................223
Installing a hot-swap fan ....................224
Removing a hot-swap ac power supply..............225
Installing a hot-swap ac power supply ...............226
Removing the battery .....................229
Installing the battery.....................231
Removing the operator information panel assembly ..........233
Installing the operator information panel assembly ..........233
Removing and replacing Tier 2 CRUs ................234
Removing the bezel.....................234
Installing the bezel ......................234
Removing the SAS hard disk drive backplane ............235
Installing the SAS hard disk drive backplane............235
Removing the simple-swap hard disk drive backplate .........236
Installing the simple-swap hard disk drive backplate ..........237
Removing and replacing FRUs..................238
Removing a microprocessor and heat sink .............238
Installing a microprocessor and heat sink..............240
Thermal grease .......................244
Removing a heat-sink retention module ..............246
Installing a heat-sink retention module ...............246
Removing the system board..................247
Installing the system board ...................248
Removing the 240 VA safety cover ................250
Installing the 240 VA safety cover................251
Chapter 6. Configuration information and instructions........253
Updating the firmware ......................253
Configuring the server ......................253
Using the Setup utility .....................255
Using the Boot Selection Menu program ..............260
Starting the backup server firmware................260
Using the ServerGuide Setup and Installation CD...........260
Using the integrated management module .............263
Using the USB memory key for VMware hypervisor ..........264
Using the remote presence capability and blue-screen capture ......265
Using the embedded hypervisor .................266
Enabling the Broadcom Gigabit Ethernet Utility program ........267
Configuring the Gigabit Ethernet controller .............267
Using the LSI Configuration Utility program .............268
IBM Advanced Settings Utility program...............270
Updating IBM Systems Director ..................270
Contentsv
Updating the Universal Unique Identifier (UUID) ............271
Updating the DMI/SMBIOS data ..................274
Appendix A. Getting help and technical assistance ..........279
Before you call........................279
Using the documentation.....................279
Getting help and information from the World Wide Web.........279
Software service and support ...................280
Hardware service and support ...................280
IBM Taiwan product service ....................280
Bu ürünü kurmadan önce güvenlik bilgilerini okuyun.
Guidelines for trained service technicians
This section contains information for trained service technicians.
Inspecting for unsafe conditions
Use the information in this section to help you identify potential unsafe conditions in
an IBM product that you are working on. Each IBM product, as it was designed and
manufactured, has required safety items to protect users and service technicians
from injury. The information in this section addresses only those items. Use good
judgment to identify potential unsafe conditions that might be caused by non-IBM
alterations or attachment of non-IBM features or optional devices that are not
addressed in this section. If you identify an unsafe condition, you must determine
how serious the hazard is and whether you must correct the problem before you
work on the product.
Consider the following conditions and the safety hazards that they present:
v Electrical hazards, especially primary power. Primary voltage on the frame can
cause serious or fatal electrical shock.
v Explosive hazards, such as a damaged CRT face or a bulging capacitor.
v Mechanical hazards, such as loose or missing hardware.
To inspect the product for potential unsafe conditions, complete the following steps:
1. Make sure that the power is off and the power cord is disconnected.
2. Make sure that the exterior cover is not damaged, loose, or broken, and
observe any sharp edges.
3. Check the power cord:
v Make sure that the third-wire ground connector is in good condition. Use a
meter to measure third-wire ground continuity for 0.1 ohm or less between
the external ground pin and the frame ground.
v Make sure that the power cord is the correct type, as specified in “Power
cords” on page 165.
v Make sure that the insulation is not frayed or worn.
4. Remove the cover.
5. Check for any obvious non-IBM alterations. Use good judgment as to the safety
of any non-IBM alterations.
viiiIBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
6. Check inside the server for any obvious unsafe conditions, such as metal filings,
contamination, water or other liquid, or signs of fire or smoke damage.
7. Check for worn, frayed, or pinched cables.
8. Make sure that the power-supply cover fasteners (screws or rivets) have not
been removed or tampered with.
Guidelines for servicing electrical equipment
Observe the following guidelines when you service electrical equipment:
v Check the area for electrical hazards such as moist floors, nongrounded power
extension cords, and missing safety grounds.
v Use only approved tools and test equipment. Some hand tools have handles that
are covered with a soft material that does not provide insulation from live
electrical currents.
v Regularly inspect and maintain your electrical hand tools for safe operational
condition. Do not use worn or broken tools or testers.
v Do not touch the reflective surface of a dental mirror to a live electrical circuit.
The surface is conductive and can cause personal injury or equipment damage if
it touches a live electrical circuit.
v Some rubber floor mats contain small conductive fibers to decrease electrostatic
discharge. Do not use this type of mat to protect yourself from electrical shock.
v Do not work alone under hazardous conditions or near equipment that has
hazardous voltages.
v Locate the emergency power-off (EPO) switch, disconnecting switch, or electrical
outlet so that you can turn off the power quickly in the event of an electrical
accident.
v Disconnect all power before you perform a mechanical inspection, work near
power supplies, or remove or install main units.
v Before you work on the equipment, disconnect the power cord. If you cannot
disconnect the power cord, have the customer power-off the wall box that
supplies power to the equipment and lock the wall box in the off position.
v Never assume that power has been disconnected from a circuit. Check it to
make sure that it has been disconnected.
v If you have to work on equipment that has exposed electrical circuits, observe
the following precautions:
– Make sure that another person who is familiar with the power-off controls is
near you and is available to turn off the power if necessary.
– When you are working with powered-on electrical equipment, use only one
hand. Keep the other hand in your pocket or behind your back to avoid
creating a complete circuit that could cause an electrical shock.
– When you use a tester, set the controls correctly and use the approved probe
leads and accessories for that tester.
– Stand on a suitable rubber mat to insulate you from grounds such as metal
floor strips and equipment frames.
v Use extreme care when you measure high voltages.
v To ensure proper grounding of components such as power supplies, pumps,
blowers, fans, and motor generators, do not service these components outside of
their normal operating locations.
v If an electrical accident occurs, use caution, turn off the power, and send another
person to get medical aid.
Safetyix
Safety statements
Important:
Each caution and danger statement in this document is labeled with a number. This
number is used to cross reference an English-language caution or danger
statement with translated versions of the caution or danger statement in the SafetyInformation document.
For example, if a caution statement is labeled “Statement 1,” translations for that
caution statement are in the Safety Information document under “Statement 1.”
Be sure to read all caution and danger statements in this document before you
perform the procedures. Read any additional safety information that comes with the
server or optional device before you install the device.
Attention:Use No. 26 AWG or larger UL-listed or CSA certified
telecommunication line cord.
xIBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Statement 1:
DANGER
Electrical current from power, telephone, and communication cables is
hazardous.
To avoid a shock hazard:
v Do not connect or disconnect any cables or perform installation,
maintenance, or reconfiguration of this product during an electrical
storm.
v Connect all power cords to a properly wired and grounded electrical
outlet.
v Connect to properly wired outlets any equipment that will be attached to
this product.
v When possible, use one hand only to connect or disconnect signal
cables.
v Never turn on any equipment when there is evidence of fire, water, or
structural damage.
v Disconnect the attached power cords, telecommunications systems,
networks, and modems before you open the device covers, unless
instructed otherwise in the installation and configuration procedures.
v Connect and disconnect cables as described in the following table when
installing, moving, or opening covers on this product or attached
devices.
To Connect:To Disconnect:
1. Turn everything OFF.
2. First, attach all cables to devices.
3. Attach signal cables to connectors.
4. Attach power cords to outlet.
5. Turn device ON.
1. Turn everything OFF.
2. First, remove power cords from outlet.
3. Remove signal cables from connectors.
4. Remove all cables from devices.
Safetyxi
Statement 2:
CAUTION:
When replacing the lithium battery, use only IBM Part Number 33F8354 or an
equivalent type battery recommended by the manufacturer. If your system has
a module containing a lithium battery, replace it only with the same module
type made by the same manufacturer. The battery contains lithium and can
explode if not properly used, handled, or disposed of.
Do not:
v Throw or immerse into water
v Heat to more than 100°C (212°F)
v Repair or disassemble
Dispose of the battery as required by local ordinances or regulations.
xiiIBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Statement 3:
CAUTION:
When laser products (such as CD-ROMs, DVD drives, fiber optic devices, or
transmitters) are installed, note the following:
v Do not remove the covers. Removing the covers of the laser product could
result in exposure to hazardous laser radiation. There are no serviceable
parts inside the device.
v Use of controls or adjustments or performance of procedures other than
those specified herein might result in hazardous radiation exposure.
DANGER
Some laser products contain an embedded Class 3A or Class 3B laser
diode. Note the following.
Laser radiation when open. Do not stare into the beam, do not view directly
with optical instruments, and avoid direct exposure to the beam.
Class 1 Laser Product
Laser Klasse 1
Laser Klass 1
Luokan 1 Laserlaite
Appareil A Laser de Classe 1
`
Safetyxiii
Statement 4:
≥ 18 kg (39.7 lb)≥ 32 kg (70.5 lb)≥ 55 kg (121.2 lb)
CAUTION:
Use safe practices when lifting.
Statement 5:
CAUTION:
The power control button on the device and the power switch on the power
supply do not turn off the electrical current supplied to the device. The device
also might have more than one power cord. To remove all electrical current
from the device, ensure that all power cords are disconnected from the power
source.
2
1
xivIBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Statement 8:
CAUTION:
Never remove the cover on a power supply or any part that has the following
label attached.
Hazardous voltage, current, and energy levels are present inside any
component that has this label attached. There are no serviceable parts inside
these components. If you suspect a problem with one of these parts, contact
a service technician.
Statement 12:
CAUTION:
The following label indicates a hot surface nearby.
Statement 26:
CAUTION:
Do not place any object on top of rack-mounted devices.
This server is suitable for use on an IT power-distribution system whose maximum
phase-to-phase voltage is 240 V under any distribution fault condition.
Important: This product is not suitable for use with visual display workplace
devices according to Clause 2 of the German Ordinance for Work with Visual
Display Units.
Safetyxv
xviIBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Chapter 1. Start here
You can solve many problems without outside assistance by following the
troubleshooting procedures in this Problem Determination and Service Guide and
on the IBM Web site. This document describes the diagnostic tests that you can
perform, troubleshooting procedures, and explanations of error messages and error
codes. The documentation that comes with your operating system and software
also contains troubleshooting information.
Diagnosing a problem
Before you contact IBM or an approved warranty service provider, follow these
procedures in the order in which they are presented to diagnose a problem with
your server:
1. Determine what has changed.
Determine whether any of the following items were added, removed, replaced,
or updated before the problem occurred:
v BIOS code
v Device drivers
v Firmware
v Hardware components
v Software
If possible, return the server to the condition it was in before the problem
occurred.
2. Collect data.
Thorough data collection is necessary for diagnosing hardware and software
problems.
a. Document error codes and system-board LEDs.
v System error codes: See “Viewing the test log” on page 73 for
information about error codes.
v Software or operating-system error codes: See the documentation for
the software or operating system for information about a specific error
code. See the manufacturer's Web site for documentation.
v Light path diagnostics LEDs: See “Light path diagnostics LEDs” on
page 65 for information about light path diagnostics LEDs that are lit.
v System-board LEDs: See “System-board LEDs” on page 20 for
information about system-board LEDs that are lit.
b. Collect system data.
Run Dynamic System Analysis (DSA) to collect information about the
hardware, firmware, software, and operating system. Have this information
available when you contact IBM or an approved warranty service provider.
For instructions for running the DSA program, see “Running the diagnostic
programs” on page 72.
If you have to download the latest version of DSA, go to
http://www.ibm.com/systems/support/supportsite.wss/
docdisplay?brandind=5000008&lndocid=SERV-DSA or complete the
following steps.
Note: Changes are made periodically to the IBM Web site. The actual
procedure might vary slightly from what is described in this document.
1) Go to http://www.ibm.com/systems/support/.
2) Under Product support, click System x.
3) Under Popular links, click Software and device drivers.
4) Under Related downloads, click Dynamic System Analysis (DSA).
For information about DSA command-line options, go to
http://publib.boulder.ibm.com/infocenter/toolsctr/v1r0/index.jsp?topic=/
com.ibm.xseries.tools.doc/erep_tools_dsa.html or complete the following
steps:
1) Go to http://publib.boulder.ibm.com/infocenter/toolsctr/v1r0/index.jsp.
2) In the navigation pane, click IBM System x and BladeCenter ToolsCenter.
3) Click Tools reference > Error reporting and analysis tools > IBMDynamic System Analysis.
3. Follow the problem-resolution procedures.
The four problem-resolution procedures are presented in the order in which they
are most likely to solve your problem. Follow these procedures in the order in
which they are presented:
a. Check for and apply code updates.
Most problems that appear to be caused by faulty hardware are actually
caused by BIOS code, system firmware, device firmware, or device drivers
that are not at the latest levels.
Important: Some cluster solutions require specific code levels or
coordinated code updates. If the device is part of a cluster solution, verify
that the latest level of code is supported for the cluster solution before you
update the code.
1) Determine the existing code levels.
In DSA, click Firmware/VPD to view system firmware levels, or click
Software to view operating-system levels.
2) Download and install updates of code that is not at the latest level.
To display a list of available updates for your server, go to
http://www.ibm.com/systems/support/supportsite.wss/
docdisplay?brandind=5000008&lndocid=MIGR-4JTS2T or complete the
following steps.
Note: Changes are made periodically to the IBM Web site. The actual
procedure might vary slightly from what is described in this document.
a) Go to http://www.ibm.com/systems/support/.
b) Under Product support, click System x.
c) Under Popular links, click Software and device drivers.
d) Click System x3650 M3 to display the list of downloadable files for
the server.
You can install code updates that are packaged as an UpdateXpress
System Pack or UpdateXpress CD image. An UpdateXpress System
Pack contains an integration-tested bundle of online firmware and
device-driver updates for your server.
Be sure to separately install any listed critical updates that have release
dates that are later than the release date of the UpdateXpress System
Pack or UpdateXpress image.
2IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
When you click an update, an information page is displayed, including a
list of the problems that the update fixes. Review this list for your
specific problem; however, even if your problem is not listed, installing
the update might solve the problem.
b. Check for and correct an incorrect configuration.
If the server is incorrectly configured, a system function can fail to work
when you enable it; if you make an incorrect change to the server
configuration, a system function that has been enabled can stop working.
1) Make sure that all installed hardware and software are supported.
See http://www.ibm.com/servers/eserver/serverproven/compat/us/ to
verify that the server supports the installed operating system, optional
devices, and software levels. If any hardware or software component is
not supported, uninstall it to determine whether it is causing the problem.
You must remove nonsupported hardware before you contact IBM or an
approved warranty service provider for support.
2) Make sure that the server, operating system, and software are
installed and configured correctly.
Many configuration problems are caused by loose power or signal
cables or incorrectly seated adapters. You might be able to solve the
problem by turning off the server, reconnecting cables, reseating
adapters, and turning the server back on. For information about
performing the checkout procedure, see “Checkout procedure” on page
41.
If the problem is associated with a specific function (for example, if a
RAID hard disk drive is marked offline in the RAID array), see the
documentation for the associated controller and management or
controlling software to verify that the controller is correctly configured.
Problem determination information is available for many devices such as
RAID and network adapters.
For problems with operating systems or IBM software or devices,
complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual
procedure might vary slightly from what is described in this document.
a) Go to http://www.ibm.com/systems/support/.
b) Under Product support, click System x.
c) From the Product family list, select System x3650 M3.
d) Under Support & downloads, click Documentation, Install, and
Use to search for related documentation.
c. Check for troubleshooting procedures and RETAIN tips.
Troubleshooting procedures and RETAIN tips document known problems
and suggested solutions. To search for troubleshooting procedures and
RETAIN tips, complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual
procedure might vary slightly from what is described in this document.
1) Go to http://www.ibm.com/systems/support/.
2) Under Product support, click System x.
3) From the Product family list, select System x3650 M3.
4) Under Support & downloads, click Troubleshoot.
Chapter 1. Start here3
5) Select the troubleshooting procedure or RETAIN tip that applies to your
problem:
v Troubleshooting procedures are under Diagnostic.
v RETAIN tips are under Troubleshoot.
d. Check for and replace defective hardware.
If a hardware component is not operating within specifications, it can cause
unpredictable results. Most hardware failures are reported as error codes in
a system or operating-system log. For more information, see
“Troubleshooting tables” on page 43 and Chapter 5, “Removing and
replacing server components,” on page 169. Hardware errors are also
indicated by light path diagnostics LEDs.
A single problem might cause multiple symptoms. Follow the troubleshooting
procedure for the most obvious symptom. If that procedure does not
diagnose the problem, use the procedure for another symptom, if possible.
If the problem remains, contact IBM or an approved warranty service
provider for assistance with additional problem determination and possible
hardware replacement. To open an online service request, go to
http://www.ibm.com/support/electronic/. Be prepared to provide information
about any error codes and collected data.
Undocumented problems
If you have completed the diagnostic procedure and the problem remains, the
problem might not have been previously identified by IBM. After you have verified
that all code is at the latest level, all hardware and software configurations are valid,
and no light path diagnostics LEDs or log entries indicate a hardware component
failure, contact IBM or an approved warranty service provider for assistance. To
open an online service request, go to http://www.ibm.com/support/electronic/. Be
prepared to provide information about any error codes and collected data and the
problem determination procedures that you have used.
4IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Chapter 2. Introduction
This Problem Determination and Service Guide contains information to help you
solve problems that might occur in your IBM
or 7949 server. It describes the diagnostic tools that come with the server, error
codes and suggested actions, and instructions for replacing failing components.
Replaceable components are of four types:
v Consumable Parts: Purchase and replacement of consumable
parts(components, such as batteries and printer cartridges, that have depletable
life) is your responsibility. If IBM acquires or installs a consumable part at your
request, you will be charged for the service.
v Tier 1 customer replaceable unit (CRU): Replacement of Tier 1 CRUs is your
responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for
the installation.
v Tier 2 customer replaceable unit: You may install a Tier 2 CRU yourself or
request IBM to install it, at no additional charge, under the type of warranty
service that is designated for your server.
v Field replaceable unit (FRU): FRUs must be installed only by trained service
technicians.
For information about the terms of the warranty and getting service and assistance,
see the Warranty and Support Information document on the IBM Documentation
CD.
®
System x3650 M3 Type 4255, 7945,
Related documentation
In addition to this document, the following documentation also comes with the
server:
v Installation and User's Guide
This document is in Portable Document Format (PDF) on the IBM Documentation
CD. It provides general information about setting up and cabling the server,
including information about features, and how to configure the server. It also
contains detailed instructions for installing, removing, and connecting optional
devices that the server supports.
v Rack Installation Instructions
This printed document contains instructions for installing the server in a rack.
v Safety Information
This document is in PDF on the IBM Documentation CD. It contains translated
caution and danger statements. Each caution and danger statement that appears
in the documentation has a number that you can use to locate the corresponding
statement in your language in the Safety Information document.
v Warranty and Support Information
This printed document contains information about the terms of the warranty.
v Environmental Notices and User Guide
This document is in PDF on the IBM Documentation CD. It contains translated
environmental notices.
v IBM License Agreement for Machine Code
This document is in PDF on the IBM Documentation CD. It provides translated
versions of the IBM License Agreement for Machine code for your product.
This document is in PDF. It contains information about the open-source notices.
Depending on the server model, additional documentation might be included on the
IBM Documentation CD.
®
The System x
and xSeries®Tools Center is an online information center that
contains information about tools for updating, managing, and deploying firmware,
device drivers, and operating systems. The System x and xSeries Tools Center is at
http://publib.boulder.ibm.com/infocenter/toolsctr/v1r0/index.jsp.
The server might have features that are not described in the documentation that
you received with the server. The documentation might be updated occasionally to
include information about those features, or technical updates might be available to
provide additional information that is not included in the server documentation.
These updates are available from the IBM Web site. To check for updated
documentation and technical updates, complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
1. Go to http://www.ibm.com/systems/support/.
2. Under Product support, click System x.
3. Under Popular links, click Publications lookup.
4. From the Product family menu, select System x3650 M3 and click Continue.
Notices and statements in this document
The caution and danger statements in this document are also in the multilingual
Safety Information document, which is on the Documentation CD. Each statement is
numbered for reference to the corresponding statement in your language in the
Safety Information document.
The following notices and statements are used in this document:
v Note: These notices provide important tips, guidance, or advice.
v Important: These notices provide information or advice that might help you avoid
inconvenient or problem situations.
v Attention: These notices indicate potential damage to programs, devices, or
data. An attention notice is placed just before the instruction or situation in which
damage might occur.
v Caution: These statements indicate situations that can be potentially hazardous
to you. A caution statement is placed just before the description of a potentially
hazardous procedure step or situation.
v Danger: These statements indicate situations that can be potentially lethal or
extremely hazardous to you. A danger statement is placed just before the
description of a potentially lethal or extremely hazardous procedure step or
situation.
6IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Features and specifications
The following information is a summary of the features and specifications of the
server. Depending on the model, some features might not be available, or some
specifications might not apply.
Racks are marked in vertical increments of 4.45 cm (1.75 inches). Each increment
is referred to as a unit, or “U.” A 1-U-high device is 1.75 inches tall.
Notes:
1. Power consumption and heat output vary depending on the number and type of
optional features that are installed and the power-management optional features
that are in use.
2. The sound levels were measured in controlled acoustical environments
according to the procedures specified by the American National Standards
Institute (ANSI) S12.10 and ISO 7779 and are reported in accordance with ISO
9296. Actual sound-pressure levels in a given location might exceed the
average values stated because of room reflections and other nearby noise
sources. The declared sound-power levels indicate an upper limit, below which
a large number of computers will operate.
Chapter 2. Introduction7
Table 1. Features and specifications
Microprocessor:
v Supports up to two Intel Xeon
multi-core microprocessors (one
installed)
v Level-3 cache
v QuickPath Interconnect (QPI) links
speed up to 6.4 GT per second
Note:
v Do not install an Intel Xeon
series microprocessor and an Xeon
5600 series microprocessor in the same
server.
v Use the Setup utility to determine the
type and speed of the microprocessors.
v For a list of supported microprocessors,
see http://www.ibm.com/servers/eserver/
serverproven/compat/us/.
Memory:
v Minimum: 2 GB
v Maximum: 288 GB
– 48 GB using unbuffered DIMMs
(UDIMMs)
– 288 GB using registered DIMMs
(RDIMMs)
v Type: PC3-10600R-999, 800, 1067, and
1333 MHz, ECC, DDR3 registered or
unbuffered SDRAM DIMMs
v Slots: 18 dual inline
v Supports (depending on the model):
– 2 GB and 4 GB unbuffered DIMMs
– 2 GB, 4 GB, 8 GB, and 16 GB
registered DIMMs
SATA optical drives (optional):
v DVD-ROM
v Multi-burner
Hard disk drive expansion bays
(depending on the model:
v Eight 2.5-inch SAS hot-swap bays for
hard disk drive bays with option to add
eight more 2.5-inch SAS hot-swap hard
disk drive bays
v Four 2.5-inch simple-swap, solid state
SATA hard disk drive bays
PCI expansion slots:
v Two PCI Express riser cards with two
PCI Express x8 slots (x8 lanes) each,
standard
v Support for the following optional riser
cards:
– Two 133 MHz/64-bit PCI-X 1.0a slots
– One PCI Express x16 slot (x16
lanes)
™
™
5500
™
Size (2U):
v Height: 85.2 mm (3.346 in.)
v Depth: EIA flange to rear - 698 mm
(27.480 in.), Overall - 729 mm (28.701
in.)
v Width: With top cover - 443.6 mm
(17.465 in.), With front bezel - 482.0
mm (18.976 in.)
v Weight: approximately 21.09 kg (46.5
lb) to 25 kg (55 lb) depending upon
configuration
Integrated functions:
v Integrated management module (IMM),
which provides service processor
control and monitoring functions, video
controller, and (when the optional
virtual media key is installed) remote
keyboard, video, mouse, and remote
hard disk drive capabilities
v Dedicated or shared management
network connections
v Serial over LAN (SOL) and serial
redirection over Telnet or Secure Shell
(SSH)
v One systems-management RJ-45 for
connection to a dedicated
systems-management network
v Support for remote management
presence through an optional virtual
media key
v Broadcom BCM5709 Gb Ethernet
controller with TCP/IP Offload Engine
(TOE) and Wake on LAN support
v Four Ethernet ports (two on system
board and two additional ports when
the optional IBM Dual-Port 1 Gb
Ethernet Daughter Card is installed)
v One serial port, shared with the
integrated management module (IMM)
v Four Universal Serial Bus (USB) ports
(two on front, two on rear of server),
v2.0 supporting v1.1, plus one or more
dedicated internal USB ports on the
SAS riser card
v Two video ports (one on front and one
on rear of server)
v One SATA tape connector, one USB
tape connector, and one tape power
connector on SAS riser card (some
models)
v Support for hypervisor function through
an optional USB flash device on the
SAS riser card (not available on
simple-swap models)
Note: In messages and documentation,
the term service processor refers to the
integrated management module (IMM).
Video controller (integrated into IMM):
v Matrox G200eV (two analog ports - one
front and one rear that can be connected
at the same time)
Note: The maximum video resolution is
1600 x 1200 at 75 Hz.
– SVGA compatible video controller
– DDR2 250 MHz SDRAM video
memory controller
– Avocent Digital Video Compression
– 16 MB of video memory (not
expandable)
ServeRAID controller (depending on the
model):
v A ServeRAID-BR10il v2 SAS/SATA
adapter that provides RAID levels 0, 1,
and 1E (comes standard on some
hot-swap models).
v An optional ServeRAID-BR10il SAS/SATA
adapter that provides RAID levels 0, 1,
and 1E can be ordered.
v An optional ServeRAID-MR10i SAS/SATA
adapter that provides RAID levels 0, 1, 5,
6, 10, 50, and 60 can be ordered.
v An optional ServerRAID M1015
SAS/SATA adapter that provides RAID
levels 0, 1, and 10 with optional RAID
5/50 and SED (Self Encrypting Drive)
upgrade.
v An optional ServeRAID-M5014 SAS/SATA
adapter that provides RAID levels 0, 1, 5,
10 and 50 with optional battery and RAID
6/60 and SED upgrade.
v An optional ServeRAID-M5015 SAS/SATA
adapter with battery that provides RAID
levels 0, 1, 5, 10, and 50 with optional
RAID 6/60 and SED upgrade
Notes:
1. RAID is supported in hot-swap models
only.
2. The ServeRAID controllers are installed
in a PCI Express x8 mechanical slot (x4
electrical); however, the controllers run
at x4 bandwidth.
8IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 1. Features and specifications (continued)
Electrical input with hot-swap ac power
supplies:
v Sine-wave input (47 - 63 Hz) required
v Input voltage range automatically
selected
v Input voltage low range:
– Minimum: 100 V ac
– Maximum: 127 V ac
v Input voltage high range:
– Minimum: 200 V ac
– Maximum: 240 V ac
v Input kilovolt-amperes (kVA)
approximately:
– Minimum: 0.090 kVA
– Maximum: 0.700 kVA
Environment:
v Air temperature:
– Server on: 10°C to 35°C (50.0°F to
95.0°F); altitude: 0 to 914.4 m (3000
ft). Decrease system temperature
by 1°C for every 1000-foot increase
in altitude.
– Server off: 5°C to 45°C (41.0°F to
113.0°F); maximum altitude: 3048 m
(10000 ft)
– Shipment: -40°C to +60°C (-40°F to
140°F); maximum altitude: 3048 m
(10000 ft)
v Humidity:
– Server on: 20% to 80%; maximum
dew point: 21°C; maximum rate of
change 5°C per hour.
– Server off: 8% to 80%; maximum
dew point: 27°C
– Shipment: 5% to 100%
v Particulate contamination:
Attention: Airborne particulates and
reactive gases acting alone or in
combination with other environmental
factors such as humidity or
temperature might pose a risk to the
server. For information about the limits
for particulates and gases, see
“Particulate contamination” on page
283.
Hot-swap fans:
Three - provide redundant cooling.
Power supply:
v Up to two hot-swap power supplies for
redundancy support
– 460-watt ac
– 675-watt ac
– 675-watt high-efficiency ac
– 675-watt dc
Note: You cannot mix 460-watt and
675-watt power supplies, or high-efficiency
and non-high-efficiency power supplies, or
ac and dc power supplies in the server.
Acoustical noise emissions:
v Declared sound power, idle: 6.3 bel
v Declared sound power, operating: 6.5 bel
Heat output:
Approximate heat output:
v Minimum configuration: 662 Btu per hour
(194 watts)
v Maximum configuration: 2302 Btu per
hour (675 watts)
Server controls, LEDs, and connectors
This section describes the controls, light-emitting diodes (LEDs), and connectors.
Front view
The following illustration shows the controls, connectors, and hard disk drive bays
on the front of the server.
Hard disk drive
activity LED (green)
Hard disk drive
status LED (amber)
Rack
release
latch
Hard disk
drive bays
Hard disk drive activity LED: Each hard disk drive has an activity LED. When this
LED is flashing, it indicates that the drive is in use.
Hard disk drive status LED: Each hard disk drive has a status LED. When this
LED is lit, it indicates that the drive has failed. When this LED is flashing slowly
(one flash per second), it indicates that the drive is being rebuilt as part of a RAID
Video
connector
USB 1
connector
Bay 15Bay 0
CD/DVD drive
activity LED
USB 2
connector
CD/DVD
eject button
Operator
information panel
Rack
release
latch
CD/DVD drive
(optical drive)
Chapter 2. Introduction9
configuration. When the LED is flashing rapidly (three flashes per second), it
indicates that the controller is identifying the drive.
Video connector: Connect a monitor to this connector. The video connectors on
the front and rear of the server can be used simultaneously.
USB connectors: Connect a USB device, such as USB mouse, keyboard, or other
USB device, to either of these connectors.
Operator information panel: This panel contains controls, light-emitting diodes
(LEDs), and connectors. For information about the controls and LEDs on the
operator information panel, see “Operator information panel.”
Rack release latches: Press these latches to release the server from the rack.
Optional CD/DVD-eject button: Press this button to release a CD or DVD from the
CD-RW/DVD drive.
Optional CD/DVD drive activity LED: When this LED is lit, it indicates that the
CD-RW/DVD drive is in use.
Operator information panel
The following illustration shows the controls and LEDs on the operator information
panel.
The following controls and LEDs are on the operator information panel:
v Power-control button and power-on LED: Press this button to turn the server
on and off manually or to wake the server from a reduced-power state. The
states of the power-on LED are as follows:
– Off: AC power is not present, or the power supply or the LED itself has failed.
– Flashing rapidly (4 times per second): The server is turned off and is not
ready to be turned on. The power-control button is disabled. This will last
approximately 20 to 40 seconds.
Note: Approximately 40 seconds after the server is connected to ac power,
the power-control button becomes active.
– Flashing slowly (once per second): The server is turned off and is ready to
be turned on. You can press the power-control button to turn on the server.
– Lit: The server is turned on.
– Fading on and off: The server is in a reduced-power state. To wake the
server, press the power-control button or use the IMM Web interface.
10IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Note: If this LED is off, it does not mean that there is no electrical power in the
server. The LED might be burned out. To remove all electrical power from the
server, you must disconnect the power cord from the electrical outlet.
v Ethernet icon LED: This LED lights the Ethernet icon.
v Ethernet activity LEDs: When any of these LEDs is lit, it indicates that the
server is transmitting to or receiving signals from the Ethernet LAN that is
connected to the Ethernet port that corresponds to that LED.
v Information LED: When this LED is lit, it indicates that a noncritical event has
occurred. An LED on the light path diagnostics panel is also lit to help isolate the
error.
v System-error LED: When this LED is lit, it indicates that a system error has
occurred. An LED on the light path diagnostics panel is also lit to help isolate the
error.
v Release latch: Slide this latch to the left to access the light path diagnostics
panel, which is behind the operator information panel.
v Locator button and locator LED: Use this LED to visually locate the server
among other servers. It is also used as the physical presence for Trusted
Platform Module (TPM). Press this button to turn on or turn off this LED locally.
You can use IBM Systems Director to light this LED remotely.
Light path diagnostics panel
The light path diagnostics panel is on the top of the operator information panel.
To access the light path diagnostics panel, slide the blue release button on the
operator information panel to the left. Pull forward on the operator information panel
until the hinge of the panel is free of the server chassis. Then pull down on the
operator information panel, so that you can view the light path diagnostics panel
information.
Operator information
panel
Light path
diagnostics LEDs
Release latch
The following illustration shows the controls and LEDs on the light path diagnostics
panel.
Notes:
1. Do not run the server for an extended period of time while the light path
diagnostics panel is pulled out of the server.
2. Light path diagnostics LEDs remain lit only while the server is connected to
power.
Chapter 2. Introduction11
Rear view
Checkpoint
code display
v Remind button: This button places the system-error LED on the front panel into
Remind mode. In Remind mode, the system-error LED flashes rapidly until the
problem is corrected, the server is restarted, or a new problem occurs.
By placing the system-error LED indicator in Remind mode, you acknowledge
that you are aware of the last failure but will not take immediate action to correct
the problem. The remind function is controlled by the IMM.
v NMI button: Press this button to force a nonmaskable interrupt to the
microprocessor, if you are directed by IBM service and support.
v Reset button: Press this button to reset the server and run the power-on
self-test (POST). You might have to use a pen or the end of a straightened paper
clip to press the button. The reset button is in the lower-right corner of the light
path diagnostics panel.
The following illustration shows the connectors on the rear of the server.
Ethernet connectors: Use any of these connectors to connect the server to a
network.
Power-cord connector: Connect the power cord to this connector.
USB connectors: Connect a USB device, such as USB mouse, keyboard, or other
USB device, to any of these connectors.
Serial connector: Connect a 9-pin serial device to this connector. The serial port is
shared with the integrated management module (IMM). The IMM can take control of
the shared serial port to perform text console redirection and to redirect serial
traffic, using Serial over LAN (SOL).
Video connector: Connect a monitor to this connector. The video connectors on
the front and rear of the server can be used simultaneously.
12IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Note: The maximum video resolution is 1600 x 1200 at 75 Hz.
Systems-management Ethernet connector: Use this connector to connect the
server to a network for systems-management information control. This connector is
used only by the IMM.
The following illustration shows the LEDs on the rear of the server.
Ethernet
activity LED
Ethernet
link LED
Power-on
LED (green)
Locator LED (blue)
AC power
LED (green)
DC power
LED (green)
Power-supply
error LED (amber)
System-error
LED (amber)
Ethernet activity LEDs: When any of these LEDs is lit, it indicates that the server
is transmitting to or receiving signals from the Ethernet LAN that is connected to the
Ethernet port that corresponds to that LED.
Ethernet link LEDs: When these LEDs are lit, they indicate that there is an active
link connection on the 10BASE-T, 100BASE-TX, or 1000BASE-TX interface for the
Ethernet port.
AC power LED: Each hot-swap power supply has an ac power LED and a dc
power LED. When the ac power LED is lit, it indicates that sufficient power is
coming into the power supply through the power cord. During typical operation, both
the ac and dc power LEDs are lit. For any other combination of LEDs, see
“Power-supply LEDs” on page 69.
IN OK power LED: Each hot-swap dc power supply has an IN OK power LED and
an OUT OK power LED. When the IN OK power LED is lit, it indicates that sufficient
power is coming into the power supply through the power cord. During typical
operation, both the IN OK and OUT OK power LEDs are lit. For any other
combination of LEDs, see “Power-supply LEDs” on page 69.
DC power LED: Each hot-swap power supply has a dc power LED and an ac
power LED. When the dc power LED is lit, it indicates that the power supply is
supplying adequate dc power to the system. During typical operation, both the ac
and dc power LEDs are lit. For any other combination of LEDs, see “Power-supply
LEDs” on page 69.
OUT OK power LED: Each hot-swap dc power supply has an IN OK power LED
and an OUT OK power LED. When the OUT OK power LED is lit, it indicates that
the power supply is supplying adequate dc power to the system. During typical
operation, both the IN OK and OUT OK power LEDs are lit. For any other
combination of LEDs, see “Power-supply LEDs” on page 69.
Power-supply error LED: When the power-supply error LED is lit, it indicates that
the power supply has failed.
Note: Power supply 1 is the default/primary power supply. If power supply 1 fails,
you must replace the power supply immediately.
Chapter 2. Introduction13
System-error LED: When this LED is lit, it indicates that a system error has
occurred. An LED on the light path diagnostics panel is also lit to help isolate the
error. This LED is the same as the system-error LED on the front of the server.
Locator LED: Use this LED to visually locate the server among other servers. You
can use IBM Systems Director to light this LED remotely. This LED is the same as
the system-locator LED on the front of the server.
Power-on LED: The states of the power-on LED are as follows:
v Off: AC power is not present, or the power supply or the LED itself has failed.
v Flashing rapidly (4 times per second): The server is turned off and is not
ready to be turned on. The power-control button is disabled. This will last
approximately 20 to 40 seconds.
Note: Approximately 40 seconds after the server is connected to ac power, the
power-control button becomes active.
v Flashing slowly (once per second): The server is turned off and is ready to be
turned on. You can press the power-control button to turn on the server.
v Lit: The server is turned on.
v Fading on and off: The server is in a reduced-power state. To wake the server,
press the power-control button or use the IMM Web interface.
14IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Internal connectors, LEDs, and jumpers
The illustrations in this section show the LEDs, connectors, and jumpers on the
internal boards. The illustrations might differ slightly from your hardware.
System-board internal connectors
The following illustration shows the internal connectors on the system board.
Chapter 2. Introduction15
System-board external connectors
The following illustration shows the external input/output connectors on the system
board.
16IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
System-board switches and jumpers
The following illustration shows the location and description of the switches and
jumpers.
Note: If there is a clear protective sticker on the top of the switch blocks, you must
remove and discard it to access the switches.
The default positions for the UEFI and the IMM recovery jumpers are pins 1 and 2.
UEFI boot recovery
jumper (J29)
IMM recovery jumper
(J147)
1
3
2
1
2
3
The following table describes the jumper settings for J29 and J147 on the system
board.
Table 2. System board jumpers
Jumper
number
J29UEFI boot
SW3 switch blockSW4 switch block
Jumper
nameJumper setting
recovery
jumper
v Pins 1 and 2: Normal (default) Loads the primary server
firmware ROM page.
v Pins 2 and 3: Loads the secondary (backup) server firmware
ROM page.
Chapter 2. Introduction17
Table 2. System board jumpers (continued)
Jumper
number
J147IMM
Notes:
1. If no jumper is present, the server responds as if the pins are set to 1 and 2.
2. Changing the position of the UEFI boot recovery jumper from pins 1 and 2 to pins 2 and
3 before the server is turned on alters which flash ROM page is loaded. Do not change
the jumper pin position after the server is turned on. This can cause an unpredictable
problem.
Jumper
nameJumper setting
recovery
jumper
v Pins 1 and 2: Normal (default) Loads the primary IMM
firmware ROM page.
v Pins 2 and 3: Loads the secondary (backup) IMM firmware
ROM page.
The following illustration shows the jumper settings for switch blocks SW3 and SW4
on the system board.
Table 3 on page 19 and Table 4 on page 19 describe the function of each switch on
SW3 and SW4 switch blocks on the system board.
18IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
1234 1234
OffOff
SW3 switch blockSW4 switch block
Table 3. System board switch block 3, switches1-4
Switch
numberDefault valueSwitch description
1OffClear CMOS memory. When this switch is toggled to On, it clears the data in
CMOS memory.
2OffTrusted Platform Module (TPM) physical presence. Turning this switch to the on
position indicates a physical presence to the TPM.
3OffReserved.
4OffReserved.
Table 4. System board switch block 4, switches1-4
Switch
numberDefault valueSwitch description
1OffPower-on password override. Changing the position of this switch bypasses the
power-on password check the next time the server is turned on and starts the
Setup utility so that you can change or delete the power-on password. You do not
have to move the switch back to the default position after the password is
overridden.
Changing the position of this switch does not affect the administrator password
check if an administrator password is set.
See “Passwords” on page 259 for additional information about the power-on
password.
2OffPower-on override. When this switch is toggled to On and then to Off, you force a
power-on which overrides the power-on and power-off button on the server and
they become nonfunctional.
3OffForced power permission overrides the IMM power-on checking process. (Trained
service technician only)
4OffReserved.
Notes:
1. Before you change any switch settings or move any jumpers, turn off the server;
then, disconnect all power cords and external cables. (Review the information in
“Safety” on page vii, “Installation guidelines” on page 169, and “Handling
static-sensitive devices” on page 171.)
2. Any system-board switch or jumper blocks that are not shown in the illustrations
in this document are reserved.
Chapter 2. Introduction19
System-board LEDs
The following illustration shows the light-emitting diodes (LEDs) on the system
board.
Note: Error LEDs remain lit only while the server is connected to power.
Table 5. System-board LEDs
LEDDescription
Error LEDsThe associated component has failed.
12-volt power (A, B, C, D, E
and AUX) error LEDs
20IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
If any of these LEDs are lit, there is a failure in the
associated system-board power channel (see “Power
problems” on page 55).
Table 6. System Pulse LEDs
LEDDescriptionAction
Enclosure manager heartbeat Indicates the status of
power-on and power-off
sequencing.
(Trained service technician
only) If the server is
connected to power and the
LED is not flashing, replace
When the server is
the system board.
connected to power, this LED
flashes slowly to indicate that
the enclosure manager is
working correctly.
IMM heartbeatIndicates the status of the
boot process of the IMM.
If the LED does not begin
flashing within 30 seconds of
when the server is connected
When the server is
connected to power this LED
flashes quickly to indicate
that the IMM code is loading.
When the loading is
complete, the LED stops
flashing briefly and then
flashes slowly to indicate that
the IMM if fully operational
and you can press the
power-control button to start
the server.
to power, complete the
following steps:
1. (Trained service
technician only) Use the
IMM recovery jumper to
recover the firmware (see
“System-board switches
and jumpers” on page
17).
2. (Trained service
technician only) Replace
the system board.
Chapter 2. Introduction21
System-board optional device connectors
The following illustration shows the connectors for user-installable options.
PCI riser-card adapter connectors
The following illustration shows the connectors on the PCI riser card for
user-installable PCI adapters.
PCI riser-card assembly LEDs
The following illustration shows the light-emitting diodes (LEDs) on the PCI
riser-card assembly.
22IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Note: Error LEDs remain lit only while the server is connected to power.
Upper PCI slot error LED
(Adaptor card error LED)
Lower PCI slot
error LED
Chapter 2. Introduction23
SAS riser-card connectors and LEDs
The following illustrations show the connectors and LEDs on the SAS riser cards.
Note: Error LEDs remain lit only while the server is connected to power.
A 12-drive-capable model server contains the riser card that is shown in the
following illustration.
USB hypervisor
connector
PCI Express SAS
controller connector
SAS controller
error LED
SAS riser
card
A tape-enabled model server contains the riser card that is shown in the following
illustration.
PCI Express SAS
USB hypervisor
connector
USB
tape
signal
controller connector
SATA tape signal
Tape drive
power
SAS controller
error LED
SAS riser card
(tape-enabled model server)
24IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Chapter 3. Diagnostics
This chapter describes the diagnostic tools that are available to help you solve
problems that might occur in the server.
If you cannot locate and correct a problem by using the information in this chapter,
see Appendix A, “Getting help and technical assistance,” on page 279 for more
information.
Diagnostic tools
The following tools are available to help you diagnose and solve hardware-related
problems:
v Troubleshooting tables
These tables list problem symptoms and actions to correct the problems. See
“Troubleshooting tables” on page 43.
v Light path diagnostics
Use the light path diagnostics to diagnose system errors quickly. See “Light path
diagnostics” on page 62 for more information.
v Dynamic System Analysis Preboot (DSA) diagnostic programs
The DSA Preboot diagnostic programs provide problem isolation, configuration
analysis, and error log collection. The diagnostic programs are the primary
method of testing the major components of the server and are stored in
integrated USB memory. The diagnostic programs collect the following
information about the server:
– System configuration
– Network interfaces and settings
– Installed hardware
– Light path diagnostics status
– Service processor status and configuration
– Vital product data, firmware, and IBM's implementation of UEFI configuration
– Hard disk drive health
– RAID controller configuration
– ServeRAID controller and service processor event logs, including the following
information:
- System-event logs
- Temperature, voltage, and fan speed information
- Tape drive presence and read/write test results
- Systems management analysis and reporting technology (SMART) data
- USB information
- Monitor configuration information
- PCI slot information
The diagnostic programs create a merged log that includes events from all
collected logs. The information is collected into a file that you can send to IBM
service and support. Additionally, you can view the server information locally
through a generated text report file. You can also copy the log to removable
media and view the log from a Web browser. See “Running the diagnostic
programs” on page 72 for more information.
v IBM Electronic Service Agent
IBM Electronic Service Agent is a software tool that monitors the server for
hardware error events and automatically submits electronic service requests to
IBM service and support. Also, it can collect and transmit system configuration
information on a scheduled basis so that the information is available to you and
your support representative. It uses minimal system resources, is available free
of charge, and can be downloaded from the Web. For more information and to
download Electronic Service Agent, go to http://www.ibm.com/support/electronic/.
When you turn on the server, it performs a series of tests to check the operation of
the server components and some optional devices in the server. This series of tests
is called the power-on self-test, or POST. This server does not use beep codes for
server status.
If a power-on password is set, you must type the password and press Enter, when
you are prompted, for POST to run.
Error codes and messages are displayed in the following types of event logs:
v POST event log: This log contains the three most recent error codes and
messages that were generated during POST. You can view the POST event log
through the Setup utility.
v System-event log: This log contains all BMC, POST, and system management
interrupt (SMI) events. You can view the system-event log from the Setup utility
and through the Dynamic System Analysis (DSA) program (as the IPMI event
log).
The system-event log is limited in size. When it is full, new entries will not
overwrite existing entries; therefore, you must periodically save and then clear
the system-event log through the Setup utility when the IMM logs an event that
indicates that the log is more than 75% full. When you are troubleshooting, you
might have to save and then clear the system-event log to make the most recent
events available for analysis.
Messages are listed on the left side of the screen, and details about the selected
message are displayed on the right side of the screen. To move from one entry
to the next, use the Up Arrow (↑) and Down Arrow (↓) keys.
Some IMM sensors cause assertion events to be logged when their setpoints are
reached. When a setpoint condition no longer exists, a corresponding
deassertion event is logged. However, not all events are assertion-type events.
v Integrated management module (IMM) event log: This log contains a filtered
subset of all IMM, POST, and system management interrupt (SMI) events. You
can view the IMM event log through the IMM Web interface and through the
Dynamic System Analysis (DSA) program (as the ASM event log).
v DSA log: This log is generated by the Dynamic System Analysis (DSA) program,
and it is a chronologically ordered merge of the system-event log (as the IPMI
event log), the IMM event log (as the ASM event log), and the operating-system
event logs. You can view the DSA log through the DSA program.
Viewing event logs from the Setup utility
To view the POST event log or system-event log, complete the following steps:
1. Turn on the server.
2. When the prompt <F1> Setup is displayed, press F1. If you have set both a
power-on password and an administrator password, you must type the
administrator password to view the event logs.
3. Select System Event Logs and use one of the following procedures:
v To view the POST event log, select POST Event Viewer.
26IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v To view the system-event log, select System Event Log.
Viewing event logs without restarting the server
If the server is not hung, methods are available for you to view one or more event
logs without having to restart the server.
If you have installed Portable or Installable Dynamic System Analysis (DSA), you
can use it to view the system-event log (as the IPMI event log), the IMM event log
(as the ASM event log), the operating-system event logs, or the merged DAA log.
You can also use DSA Preboot to view these logs, although you must restart the
server to use DSA Preboot. To install Portable DSA, Installable DSA, or DSA
Preboot or to download a DSA Preboot CD image, go to http://www.ibm.com/
systems/support/supportsite.wss/docdisplay?lndocid=SERV-DSA
&brandind=5000008 or complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
1. Go to http://www.ibm.com/systems/support/.
2. Under Product support, click System x.
3. Under Popular links, click Software and device drivers.
4. Under Related downloads, click Dynamic System Analysis (DSA) to display
the matrix of downloadable DSA files.
If IPMItool is installed in the server, you can use it to view the system-event log.
Most recent versions of the Linux operating system come with a current version of
IPMItool. For information about IPMItool, see http://publib.boulder.ibm.com/
infocenter/toolsctr/v1r0/index.jsp?topic=/com.ibm.xseries.tools.doc/
config_tools_ipmitool.html or complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
1. Go to http://publib.boulder.ibm.com/infocenter/toolsctr/v1r0/index.jsp.
2. In the navigation pane, click IBM System x and BladeCenter Tools Center.
For an overview of IPMI, go to http://publib.boulder.ibm.com/infocenter/toolsctr/v1r0/
index.jsp?topic=/com.ibm.xseries.tools.doc/co... or complete the following steps:
1. Go to http://publib.boulder.ibm.com/infocenter/systems/index.jsp.
2. In the navigation pane, click IBM Systems Information Center.
3. Expand Operating systems, expand Linux information, expand Blueprints
for Linux on IBM systems, and click Using Intelligent Platform Management
Interface (IPMI) on IBM Linux platforms.
You can view the IMM event log through the Event Log link in the integrated
management module (IMM) Web interface.
The following table describes the methods that you can use to view the event logs,
depending on the condition of the server. The first two conditions generally do not
require that you restart the server.
Chapter 3. Diagnostics27
Table 7. Methods for viewing event logs
ConditionAction
The server is not hung and is connected to a
network.
The server is not hung and is not connected
to a network.
The server is hung.
Use any of the following methods:
v Run Portable or Installable DSA to view
the event logs or create an output file that
you can send to IBM service and support.
v Type the IP address of the IMM and go to
the Event Log page.
v Use IPMItool to view the system-event log.
Use IPMItool locally to view the system-event
log.
v If DSA Preboot is installed, restart the
server and press F2 to start DSA Preboot
and view the event logs.
v If DSA Preboot is not installed, insert the
DSA Preboot CD and restart the server to
start DSA Preboot and view the event
logs.
v Alternatively, you can restart the server
and press F1 to start the Setup utility and
view the POST event log or system-event
log. For more information, see “Viewing
event logs from the Setup utility” on page
26.
POST error codes
The following table describes the POST error codes and suggested actions to
correct the detected problems. These errors can appear as severe, warning, or
informational.
28IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error codeDescriptionAction
0010002Microprocessor not supported
1. Reseat the following components one at a time,
in the order shown, restarting the server each
time:
a. (Trained service technician only)
Microprocessor 1
b. (Trained service technician only)
Microprocessor 2 (if installed)
2. (Trained service technician only) Remove
microprocessor 2 and restart the server.
3. (Trained service technician only) Remove
microprocessor 1 and install microprocessor 2 in
the microprocessor 1 connector. Restart the
server. If the error is corrected, microprocessor 1
is bad and must be replaced.
4. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. (Trained service technician only)
Microprocessor 1
b. (Trained service technician only)
Microprocessor 2
c. (Trained service technician only) System
board
0011000Invalid microprocessor type
1. Update the firmware to the latest level (see
“Updating the firmware” on page 253).
2. (Trained service technician only) Remove and
replace the affected microprocessor (error LED is
lit) with a supported type.
0011002Microprocessor mismatch
1. Run the Setup utility and view the microprocessor
information to compare the installed
microprocessor specifications.
2. (Trained service technician only) Remove and
replace one of the microprocessors so that they
both match.
Chapter 3. Diagnostics29
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error codeDescriptionAction
0011004Microprocessor failed BIST
001100AMicrocode update failed
0050001DIMM disabledNote: Each time you install or remove a DIMM, you
1. Update the firmware to the latest level (see
“Updating the firmware” on page 253).
2. (Trained service technician only) Reseat
microprocessor 2.
3. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. (Trained service technician only)
Microprocessor
b. (Trained service technician only) System
board
1. Update the server firmware to the latest level (see
“Updating the firmware” on page 253).
2. (Trained service technician only) Replace the
microprocessor.
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Make sure the DIMM is installed correctly (see
“Installing a memory module” on page 217).
2. If the DIMM was disabled because of a memory
fault, follow the suggested actions for that error
event and restart the server.
3. Check the IBM support website for an applicable
retain tip or firmware update that applies to this
memory event. If no memory fault is recorded in
the logs and no DIMM connector error LED is lit,
you can re-enable the DIMM through the Setup
utility or the Advanced Settings Utility (ASU).
30IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error codeDescriptionAction
0051003Uncorrectable DIMM errorNote: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Check the IBM support website for an applicable
retain tip or firmware update that applies to this
memory error.
2. Manually re-enable all affected DIMMs if the
server firmware version is older than UEFI v1.10.
If the server firmware version is UEFI v1.10 or
newer, disconnect and reconnect the server to the
power source and restart the server.
3. If the problem remains, replace the failing DIMM
(see “Removing a memory module (DIMM)” on
page 217 and “Installing a memory module” on
page 217).
4. (Trained service technician only) If the problem
occurs on the same DIMM connector, check the
DIMM connector. If the connector contains any
foreign material or is damaged, replace the
system board (see “Removing the system board”
on page 247 and “Installing the system board” on
page 248).
5. (Trained service technician only) Remove the
affected microprocessor and check the
microprocessor socket pins for any damaged
pins. If a damage is found, replace the system
board (see “Removing the system board” on page
247 and “Installing the system board” on page
248).
6. (Trained Service technician only) Replace the
affected microprocessor (see “Removing a
microprocessor and heat sink” on page 238 and
“Installing a microprocessor and heat sink” on
page 240).
0051004DIMM presence detected read/write failureNote: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Update the server firmware to the latest level (see
“Updating the firmware” on page 253).
2. Reseat the DIMMs.
3. Install DIMMs in the correct sequence (see
“Installing a memory module” on page 217).
4. Replace the failing DIMM.
5. (Trained service technician only) Replace the
system board.
Chapter 3. Diagnostics31
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error codeDescriptionAction
0051006DIMM mismatch detectedNote: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
Make sure that the DIMMs match and are installed in
the correct sequence (see “Installing a memory
module” on page 217).
0051009No memory detectedNote: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Make sure one or more DIMMs are installed in
the server.
2. Reseat the DIMMs and restart the server (see
“Removing a memory module (DIMM)” on page
217 and “Installing a memory module” on page
217).
3. Make sure that the DIMMs match and are
installed in the correct sequence (see “Installing a
memory module” on page 217).
4. (Trained service technician only) Replace the
microprocessor that controls the failing DIMMs
(see “Removing a microprocessor and heat sink”
on page 238 and “Installing a microprocessor and
heat sink” on page 240).
5. (Trained service technician only) Replace the
system board (see “Removing the system board”
on page 247 and “Installing the system board” on
page 248).
005100ANo usable memory detectedNote: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Make sure one or more DIMMs are installed in
the server.
2. Reseat the DIMMs and restart the server (see
“Removing a memory module (DIMM)” on page
217 and “Installing a memory module” on page
217).
3. Make sure that the DIMMs match and are
installed in the correct sequence (see “Installing a
memory module” on page 217).
4. Clear CMOS memory to ensure that all DIMM
connectors are enabled (see “Removing the
battery” on page 229 and “Installing the battery”
on page 231). Note that all firmware settings will
be reset to the default settings.
32IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error codeDescriptionAction
0058001PFA threshold exceededNote: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Check the IBM support website for an applicable
retain tip or firmware update that applies to this
memory error.
2. Swap the affected DIMMs (as indicated by the
error LEDs on the system board or the event
logs) to a different memory channel or
microprocessor (see “Installing a memory module”
on page 217).
3. If the error still occurs on the same DIMM,
replace the affected DIMM.
4. (Trained service technician only) If the problem
occurs on the same DIMM connector, check the
DIMM connector. If the connector contains any
foreign material or is damaged, replace the
system board (see “Removing the system board”
on page 247 and “Installing the system board” on
page 248).
5. (Trained service technician only) Remove the
affected microprocessor and check the
microprocessor socket pins for any damaged
pins. If a damage is found, replace the system
board (see “Removing the system board” on page
247 and “Installing the system board” on page
248).
6. (Trained Service technician only) Replace the
affected microprocessor (see “Removing a
microprocessor and heat sink” on page 238 and
“Installing a microprocessor and heat sink” on
page 240).
0058007DIMM population is unsupportedNote: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Reseat the DIMMs and restart the server (see
“Removing a memory module (DIMM)” on page
217 and “Installing a memory module” on page
217).
2. Make sure that the DIMMs are installed in the
proper sequence (see “Installing a memory
module” on page 217).
Chapter 3. Diagnostics33
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error codeDescriptionAction
0058008DIMM failed memory testNote: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Check the IBM support website for an applicable
retain tip or firmware update that applies to this
memory error.
2. Manually re-enable all affected DIMMs if the
server firmware version is older than UEFI v1.10.
If the server firmware version is UEFI v1.10 or
newer, disconnect and reconnect the server to the
power source and restart the server.
3. Swap the affected DIMMs (as indicated by the
error LEDs on the system board or the event
logs) to a different memory channel or
microprocessor (see “Installing a memory module”
on page 217 for memory population).
4. If the problem is related to a DIMM, replace the
failing DIMM (see “Removing a memory module
(DIMM)” on page 217 and “Installing a memory
module” on page 217).
5. (Trained service technician only) If the problem
occurs on the same DIMM connector, check the
DIMM connector. If the connector contains any
foreign material or is damaged, replace the
system board (see “Removing the system board”
on page 247 and “Installing the system board” on
page 248).
6. (Trained service technician only) Remove the
affected microprocessor and check the
microprocessor socket pins for any damaged
pins. If a damage is found, replace the system
board (see “Removing the system board” on page
247 and “Installing the system board” on page
248).
7. (Trained service technician only) If the problem is
related to microprocessor socket pins, replace the
system board (see “Removing the system board”
on page 247 and “Installing the system board” on
page 248).
8. (Trained Service technician only) Replace the
affected microprocessor (see “Removing a
microprocessor and heat sink” on page 238 and
“Installing a microprocessor and heat sink” on
page 240).
0058015Start to Activate Spare Memory ChannelInformation only. A failed DIMM has been detected to
activate the memory online-spare feature. Check the
event log for uncorrected DIMM failure events.
34IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error codeDescriptionAction
00580A1Invalid DIMM population for mirroring modeNote: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. If a fault LED is lit, resolve the failure.
2. Install the DIMMs in the correct sequence (see
“Installing a memory module” on page 217).
00580A4Memory population changedInformation only. Memory has been added, moved, or
changed.
00580A5Mirror failover completeInformation only. Memory redundancy has been lost.
Check the event log for uncorrected DIMM failure
events (See “Event logs” on page 26 for more
information).
00580A6Spare Memory Channel ActivatedInformation only. Memory online-spare channel has
been activated to backup a failed DIMM. Check the
event log for uncorrected DIMM failure events.
0068002CMOS battery cleared
1. Reseat the battery.
2. Clear the CMOS memory (see “System-board
switches and jumpers” on page 17).
3. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Battery
b. (Trained service technician only) System
board
2011000PCI-X PERR
1. Check the riser-card LEDs.
2. Reseat all affected adapters and riser cards.
3. Update the PCI device firmware.
4. Remove both adapters from the riser card.
5. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Riser card
b. (Trained service technician only) System
board
Chapter 3. Diagnostics35
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error codeDescriptionAction
2011001PCI-X SERR
2018001PCI Express uncorrected or uncorrected
error
1. Check the riser-card LEDs.
2. Reseat all affected adapters and riser cards.
3. Update the PCI device firmware.
4. Remove the adapter from the riser card.
5. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Riser card
b. (Trained service technician only) System
board
1. Check the riser-card LEDs.
2. Reseat all affected adapters and riser cards.
3. Update the PCI device firmware.
4. Remove both adapters from the riser card.
5. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Riser card
b. (Trained service technician only) System
board
36IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error codeDescriptionAction
2018002Option ROM resource allocation failureInformational message that some devices might not
be initialized.
1. If possible, rearrange the order of the adapters in
the PCI slots to change the load order of the
optional-device ROM code.
2. Run the Setup utility, select Start Options, and
change the boot priority to change the load order
of the optional-device ROM code.
3. Run the Setup utility and disable some other
resources, if their functions are not being used, to
make more space available:
a. Select Start Options and Planar Ethernet
(PXE/DHCP) to disable the integrated
Ethernet controller ROM.
b. Select Advanced Functions, then PCI Bus
Control, then PCI ROM Control Execution
to disable the ROM of adapters in the PCI
slots.
c. Select Devices and I/O Ports to disable any
of the integrated devices.
4. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Each adapter
b. (Trained service technician only) System
board
3xx0007 (xx
can be 00 - 19)
Firmware fault detected, system halted
1. Recover the server firmware to the latest level.
2. Undo any recent configuration changes, or clear
CMOS memory to restore the settings to the
default values.
3. Remove any recently installed hardware.
3038003Firmware corrupted
1. Run the Setup utility, select Load DefaultSettings, and save the settings to recover the
server firmware.
2. (Trained service technician only) Replace the
system board.
3048005Booted secondary (backup) server firmware
image
Information only. The backup switch was used to boot
the secondary bank.
Chapter 3. Diagnostics37
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error codeDescriptionAction
3048006Booted secondary (backup) server firmware
image because of ABR
305000ARTC date/time is incorrect
3058001System configuration invalid
1. Run the Setup utility, select Load DefaultSettings, and save the settings to recover the
primary server firmware settings.
2. Turn off the server and remove it from the power
source.
3. Reconnect the server to the power source, and
then turn on the server.
1. Adjust the date and time settings in the Setup
utility, and then restart the server.
2. Reseat the battery.
3. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Battery
b. (Trained service technician only) System
board
1. Run the Setup utility, and select Save Settings.
2. Run the Setup utility, select Load DefaultSettings, and save the settings.
3. Reseat the following components one at a time in
the order shown, restarting the server each time:
a. Battery
b. Failing device (if the device is a FRU, then it
must be reseated by a trained service
technician only)
4. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Battery
b. Failing device (if the device is a FRU, then it
must be replaced by a trained service
technician only)
c. (Trained service technician only) System
board
38IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error codeDescriptionAction
3058004Three boot failures
1. Undo any recent system changes, such as new
settings or newly installed devices.
2. Make sure that the server is attached to a reliable
power source.
3. Remove all hardware that is not listed on the
ServerProven Web site.
4. Make sure that the operating system is not
corrupted.
5. Run the Setup utility, save the configuration, and
then restart the server.
6. See “Problem determination tips” on page 155.
3108007System configuration restored to default
settings
3138002Boot configuration error
Information only. This is message is usually
associated with the CMOS battery clear event.
1. Remove any recent configuration changes that
you made in the Setup utility.
2. Run the Setup utility, select Load DefaultSettings, and save the settings.
3808000IMM communication failure
1. Shut down the system and remove the power
cords from the server for 30 seconds; then,
reconnect the server to power and restart it.
2. Update the IMM firmware.
3. Make sure that the virtual media key is seated
and not damaged.
4. (Trained service technician only) Replace the
system board.
3808002Error updating system configuration to IMM
1. Remove power from the server, and then
reconnect the server to power and restart it.
2. Run the Setup utility and select Save Settings.
3. Update the firmware.
3808003Error retrieving system configuration from
IMM
1. Remove power from the server, and then
reconnect the server to power and restart it.
2. Run the Setup utility and select Save Settings.
3. Update the IMM firmware.
3808004IMM system event log full
v When out-of-band, use the IMM Web interface or
IPMItool to clear the logs from the operating
system.
v When using the local console:
1. Run the Setup utility.
2. Select System Event Log.
3. Select Clear System Event Log.
4. Restart the server.
Chapter 3. Diagnostics39
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error codeDescriptionAction
3818001Core Root of Trust Measurement (CRTM)
update failed
3818002Core Root of Trust Measurement (CRTM)
update aborted
3818003Core Root of Trust Measurement (CRTM)
flash lock failed
3818004Core Root of Trust Measurement (CRTM)
system error
3818005Current Bank Core Root of Trust
Measurement (CRTM) capsule signature
invalid
3818006Opposite bank CRTM capsule signature
invalid
3818007CRTM update capsule signature invalid
3828004AEM power capping disabled
1. Run the Setup utility, select Load DefaultSettings, and save the settings.
2. (Trained service technician only) Replace the
system board.
1. Run the Setup utility, select Load DefaultSettings, and save the settings.
2. (Trained service technician only) Replace the
system board.
1. Run the Setup utility, select Load DefaultSettings, and save the settings.
2. (Trained service technician only) Replace the
system board.
1. Run the Setup utility, select Load DefaultSettings, and save the settings.
2. (Trained service technician only) Replace the
system board.
1. Run the Setup utility, select Load DefaultSettings, and save the settings.
2. (Trained service technician only) Replace the
system board.
1. Switch the firmware bank to the backup bank.
2. Run the Setup utility, select Load DefaultSettings, and save the settings.
3. Switch the bank back to the current bank.
4. (Trained service technician only) Replace the
system board.
1. Run the Setup utility, select Load DefaultSettings, and save the settings.
2. (Trained service technician only) Replace the
system board.
1. Check the settings and the event logs.
2. Make sure that the Active Energy Manager
feature is enabled in the Setup utility. Select
System Settings, Power, Active Energy, and
Capping Enabled.
3. Update the server firmware.
4. Update the IMM firmware.
40IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Checkout procedure
The checkout procedure is the sequence of tasks that you should follow to
diagnose a problem in the server.
About the checkout procedure
Before you perform the checkout procedure for diagnosing hardware problems,
review the following information:
v Read the safety information that begins on page vii.
v The diagnostic programs provide the primary methods of testing the major
components of the server, such as the system board, Ethernet controller,
keyboard, mouse (pointing device), serial ports, and hard disk drives. You can
also use them to test some external devices. If you are not sure whether a
problem is caused by the hardware or by the software, you can use the
diagnostic programs to confirm that the hardware is working correctly.
v When you run the diagnostic programs, a single problem might cause more than
one error message. When this happens, correct the cause of the first error
message. The other error messages usually will not occur the next time you run
the diagnostic programs.
Exception: If multiple error codes or light path diagnostics LEDs indicate a
microprocessor error, the error might be in the microprocessor or in the
microprocessor socket. See “Microprocessor problems” on page 51 for
information about diagnosing microprocessor problems.
v Before you run the diagnostic programs, you must determine whether the failing
server is part of a shared hard disk drive cluster (two or more servers sharing
external storage devices). If it is part of a cluster, you can run all diagnostic
programs except the ones that test the storage unit (that is, a hard disk drive in
the storage unit) or the storage adapter that is attached to the storage unit. The
failing server might be part of a cluster if any of the following conditions is true:
– You have identified the failing server as part of a cluster (two or more servers
sharing external storage devices).
– One or more external storage units are attached to the failing server and at
least one of the attached storage units is also attached to another server or
unidentifiable device.
– One or more servers are located near the failing server.
Important: If the server is part of a shared hard disk drive cluster, run one test
at a time. Do not run any suite of tests, such as “quick” or “normal” tests,
because this might enable the hard disk drive diagnostic tests.
v If the server is halted and a POST error code is displayed, see “Event logs” on
page 26. If the server is halted and no error message is displayed, see
“Troubleshooting tables” on page 43 and “Solving undetermined problems” on
page 154.
v For information about power-supply problems, see “Solving power problems” on
page 153.
v For intermittent problems, check the error log; see “Event logs” on page 26 and
“Diagnostic programs, messages, and error codes” on page 72.
Chapter 3. Diagnostics41
Performing the checkout procedure
To perform the checkout procedure, complete the following steps:
1. Is the server part of a cluster?
v No: Go to step 2.
v Yes: Shut down all failing servers that are related to the cluster. Go to step 2.
2. Complete the following steps:
a. Check the power supply LEDs, see “Power-supply LEDs” on page 69.
b. Turn off the server and all external devices.
c. Check all internal and external devices for compatibility at
http://www.ibm.com/servers/eserver/serverproven/compat/us/.
d. Make sure the server is cabled correctly.
e. Check all cables and power cords.
f. Set all display controls to the middle positions.
g. Turn on all external devices.
h. Turn on the server. If the server does not start, see “Troubleshooting tables”
on page 43.
i. Check the system-error LED on the operator information panel. If it is
flashing, check the light path diagnostics LEDs (see “Light path diagnostics”
on page 62).
j. Check for the following results:
v Successful completion of POST (see “POST” on page 26 for more
information).
v Successful completion of startup, which is indicated by a readable display
of the operating-system desktop.
42IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Troubleshooting tables
Use the troubleshooting tables to find solutions to problems that have identifiable
symptoms.
If you cannot find a problem in these tables, see “Running the diagnostic programs”
on page 72 for information about testing the server.
If you have just added new software or a new optional device and the server is not
working, complete the following steps before you use the troubleshooting tables:
1. Check the system-error LED on the operator information panel; if it is lit, check
the LEDs on the system board (see “System-board LEDs” on page 20).
2. Remove the software or device that you just added.
3. Run the diagnostic tests to determine whether the server is running correctly.
4. Reinstall the new software or new device.
DVD drive problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
The optional DVD drive is not
recognized.
The CD or DVD drive is not
working correctly.
1. Make sure that:
v The SATA channel to which the DVD drive is attached (primary) is enabled in
the Setup utility.
v All cables and jumpers are installed correctly (see “Internal cable routing and
connectors” on page 172).
v The signal cable and connector are not damaged and the connector pins are
not bent.
v All damaged parts are repaired or replaced.
v The correct device driver is installed for the DVD drive.
2. Run the DVD drive diagnostic programs and select the optical drive test. See
“Running the diagnostic programs” on page 72.
3. Reseat the following components:
a. DVD drive
b. DVD drive cable
4. Replace the components listed in step 3 one at a time, in the order shown,
restarting the server each time.
1. Clean the CD or DVD.
2. Replace the CD or DVD with new CD or DVD media
3. Run the DVD drive diagnostic programs.
4. Reseat the DVD drive.
5. Replace the DVD drive.
Chapter 3. Diagnostics43
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
The DVD drive tray is not
working.
1. Make sure that the server is turned on.
2. Insert the end of a straightened paper clip into the manual tray-release
opening.
3. Reseat the DVD drive.
4. Replace the DVD drive.
General problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
A cover latch is broken, an LED
is not working, or a similar
problem has occurred.
The server is hung while the
screen is on. Cannot start the
Setup utility by pressing F1.
If the part is a CRU, replace it. If the part is a FRU, the part must be replaced by a
trained service technician.
1. See “Nx boot failure” on page 113 for more information.
2. See “Recovering the server firmware” on page 109 for more information.
Hard disk drive problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
SymptomAction
A hard disk drive has failed, and
the associated amber hard disk
drive status LED is lit.
44IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Replace the failed hard disk drive (see “Removing a hot-swap hard disk drive” on
page 210 and “Installing a hot-swap hard disk drive” on page 210).
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
SymptomAction
An installed hard disk drive is
not recognized.
1. Observe the associated amber hard disk drive status LED. If the LED is lit, it
indicates a drive fault.
2. If the LED is lit, remove the drive from the bay, wait 45 seconds, and reinsert
the drive, making sure that the drive assembly connects to the hard disk drive
backplane.
3. Observe the associated green hard disk drive activity LED and the amber
status LED:
v If the green activity LED is flashing and the amber status LED is not lit, the
drive is recognized by the controller and is working correctly. Run the DSA
hard disk drive test to determine whether the drive is detected.
v If the green activity LED is flashing and the amber status LED is flashing
slowly, the drive is recognized by the controller and is rebuilding.
v If neither LED is lit or flashing, check the hard disk drive backplane (go to
step 4).
v If the green activity LED is flashing and the amber status LED is lit, replace
the drive. If the activity of the LEDs remains the same, go to step 4. If the
activity of the LEDs changes, return to step 1.
4. Make sure that the hard disk drive backplane is correctly seated. When it is
correctly seated, the drive assemblies correctly connect to the backplane
without bowing or causing movement of the backplane.
5. Move the hard disk drives to different bays to determine if the drive or the
backplane is not functioning.
6. Reseat the backplane power cable and repeat steps 1 through 3.
7. Reseat the backplane signal cable and repeat steps 1 through 3.
8. Suspect the backplane signal cable or the backplane:
v If the server has eight hot-swap bays:
a. Replace the affected backplane signal cable.
b. Replace the affected backplane.
v If the server has 12 hot-swap bays:
a. Replace the backplane signal cable.
b. Replace the backplane.
c. Replace the SAS expander card.
9. See “Problem determination tips” on page 155.
Multiple hard disk drives fail.Make sure that the hard disk drive, SAS RAID controller, and server device drivers
and firmware are at the latest level.
Important: Some cluster solutions require specific code levels or coordinated
code updates. If the device is part of a cluster solution, verify that the latest level of
code is supported for the cluster solution before you update the code.
Multiple hard disk drives are
offline.
1. Review the storage subsystem logs for indications of problems within the
storage subsystem, such as backplane or cable problems.
2. See “Problem determination tips” on page 155.
Chapter 3. Diagnostics45
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
SymptomAction
A replacement hard disk drive
does not rebuild.
A green hard disk drive activity
LED does not accurately
represent the actual state of the
associated drive.
An amber hard disk drive status
LED does not accurately
represent the actual state of the
associated drive.
1. Make sure that the hard disk drive is recognized by the controller (the green
hard disk drive activity LED is flashing).
2. Review the SAS RAID controller documentation to determine the correct
configuration parameters and settings.
1. If the green hard disk drive activity LED does not flash when the drive is in use,
run the DSA Preboot diagnostic programs to collect error logs (see “Running
the diagnostic programs” on page 72.
2. Use one of the following procedures:
v If there is a hard disk error log, replace the affect hard disk drive.
v If there is a no hard disk error log, replace the affect backplane.
1. If the amber hard disk drive LED and the RAID controller software do not
indicate the same status for the drive, complete the following steps:
a. Turn off the server.
b. Reseat the SAS controller.
c. Reseat the backplane signal cable, backplane power cable, and SAS
expander card (if the server has 12 drive bays).
d. Reseat the hard disk drive.
e. Turn on the server and observe the activity of the hard disk drive LEDs.
2. See “Problem determination tips” on page 155.
Hypervisor problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
If an optional hypervisor device
is not listed in the expected
boot order, doesn't appear in
the list of boot devices at all, or
a similar problem has occurred.
46IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
1. Make sure that the optional hypervisor device is selected on the boot menu (in
the Setup utility and in F12).
2. Make sure that the hypervisor internal flash memory device is seated in the
connector correctly (see “Removing a USB hypervisor memory key” on page
186 and “Installing a USB hypervisor memory key” on page 187).
3. See the documentation that comes with your optional hypervisor device for
setup and configuration information.
4. Make sure that other software works on the server.
Intermittent problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
A problem occurs only
occasionally and is difficult to
diagnose.
The server resets (restarts)
occasionally.
1. Make sure that:
v All cables and cords are connected securely to the rear of the server and
attached devices.
v When the server is turned on, air is flowing from the fan grille. If there is no
airflow, the fans are not working. This can cause the server to overheat and
shut down.
2. Check the system event log or IMM event log (see “Event logs” on page 26).
3. Make sure that the server and IMM firmware has been updated to the most
recent code levels.
4. Review the operating system logs.
5. Contact your operating-system vendor to set up any available tools that are
capable of monitoring the server.
6. If an error occurs, run the DSA program and forward the results to IBM service
and support for analysis.
7. See “Solving undetermined problems” on page 154.
1. If the reset occurs during POST and the POST watchdog timer is enabled (click
Advanced Setup --> Integrated Management Module (IMM) Setting --> IMM
Post Watchdog in the Setup utility to see the POST watchdog setting), makesure that sufficient time is allowed in the watchdog timeout value (IMM POST
Watchdog Timeout). See the Installation and User’s Guide for information
about the settings in the Setup utility.
If the server continues to reset during POST, see “POST” on page 26 and
“Diagnostic programs, messages, and error codes” on page 72.
2. If the reset occurs after the operating system starts, disable any automatic
server restart (ASR) utilities, such as the IBM Automatic Server Restart IPMI
Application for Windows, or any ASR devices that are installed.
Note: ASR utilities operate as operating-system utilities and are related to the
IPMI device driver.
If the reset continues to occur after the operating system starts, the operating
system might have a problem; see “Software problems” on page 61.
3. If neither condition applies, check the system-event log (see “Event logs” on
page 26).
Chapter 3. Diagnostics47
USB keyboard, mouse, or pointing-device problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
All or some keys on the
keyboard do not work.
The USB mouse or USB
pointing device does not work.
1. If you have installed a USB keyboard, run the Setup utility and enable
keyboardless operation to prevent the POST error message 301 from being
displayed during startup.
2. See http://www.ibm.com/servers/eserver/serverproven/compat/us/ for keyboard
compatibility.
3. Make sure that:
v The keyboard cable is securely connected.
v The server and the monitor are turned on.
4. Move the keyboard cable to a different USB connector.
5. Replace the following components one at a time, in the order shown, restarting
the server each time:
a. Keyboard
b. (Only if the problem occurred with a front USB connector) Internal USB
cable
c. (Trained service technician only) System board
1. Make sure that:
v The mouse is compatible with the server. See http://www.ibm.com/servers/
eserver/serverproven/compat/us/.
v The mouse or pointing-device USB cable is securely connected to the
server, and the device drivers are installed correctly.
v The server and the monitor are turned on.
2. If a USB hub is in use, disconnect the USB device from the hub and connect it
directly to the server.
3. Move the mouse or pointing device cable to another USB connector.
4. Replace the following components one at a time, in the order shown, restarting
the server each time:
a. Mouse or pointing device
b. (Only if the problem occurred with a front USB connector) Internal USB
cable
c. (Trained service technician only) System board
48IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Memory problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v For additional memory troubleshooting information, refer to the "Troubleshooting Memory - IBM
BladeCenter and System x" document at http://www-947.ibm.com/support/entry/portal/
docdisplay?brand=5000020&lndocid=MIGR-5081319.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
The amount of system memory
that is displayed is less than the
amount of installed physical
memory.
Note: Each time you install or remove a DIMM, you must disconnect the server
from the power source; then, wait 10 seconds before restarting the server.
1. Make sure that:
v No error LEDs are lit on the operator information panel.
v Memory mirroring does not account for the discrepancy.
v The memory modules are seated correctly.
v You have installed the correct type of memory (see “Installing a memory
module” on page 217).
v If you changed the memory, you updated the memory configuration in the
Setup utility.
v All banks of memory are enabled. The server might have automatically
disabled a memory bank when it detected a problem, or a memory bank
might have been manually disabled.
2. Check the POST event log for error message 289:
v If a DIMM was disabled by a systems-management interrupt (SMI), replace
the DIMM.
v If a DIMM was disabled by the user or by POST, run the Setup utility and
enable the DIMM.
3. Run memory diagnostics (see “Running the diagnostic programs” on page 72).
4. Make sure that there is no memory mismatch when the server is at the
minimum memory configuration (one 1 GB DIMM in slot 3).
5. Add one pair of DIMMs at a time, making sure that the DIMMs in each pair
match. Install the DIMMs in the sequence that is described in “Installing a
memory module” on page 217.
6. Reseat the DIMMs, and then restart the server.
7. Reverse the DIMMs between the channels (of the same microprocessor), and
then restart the server. If the problem is related to a DIMM, replace the failing
DIMM.
8. (Trained service technician only) Install the failing DIMM into a DIMM connector
for microprocessor 2 (if installed) to verify that the problem is not the
microprocessor or the DIMM connector.
9. (Trained service technician only) Replace the system board.
Chapter 3. Diagnostics49
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v For additional memory troubleshooting information, refer to the "Troubleshooting Memory - IBM
BladeCenter and System x" document at http://www-947.ibm.com/support/entry/portal/
docdisplay?brand=5000020&lndocid=MIGR-5081319.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
Multiple rows of DIMMs in a
branch are identified as failing.
Multiple rows of DIMMs in a
branch are identified as failing.
Note: The highest-numbered
DIMM failed disabling other
DIMM(s) in the same channel.
Note: Each time you install or remove a DIMM, you must disconnect the server
from the power source; then, wait 10 seconds before restarting the server.
1. Reseat the DIMMs; then, restart the server.
2. Remove the lowest-numbered DIMM pair of those that are identified and
replace it with an identical pair of known good DIMMs; then, restart the server.
Repeat as necessary. If the failures continue after all identified pairs are
replaced, go to step 4.
3. Return the removed DIMMs, one pair at a time, to their original connectors,
restarting the server after each pair, until a pair fails. Replace each DIMM in the
failed pair with an identical known good DIMM, restarting the server after each
DIMM. Replace the failed DIMM. Repeat step 3 until you have tested all
removed DIMMs.
4. Replace the lowest-numbered DIMM pair of those identified; then, restart the
server. Repeat as necessary.
5. Reverse the DIMMs between the channels (of the same microprocessor), and
then restart the server. If the problem is related to a DIMM, replace the failing
DIMM.
6. (Trained service technician only) Install the failing DIMM into a DIMM connector
for microprocessor 2 (if installed) to verify that the problem is not the
microprocessor or the DIMM connector.
7. (Trained service technician only) Replace the system board.
Note: Each time you install or remove a DIMM, you must disconnect the server
from the power source; then, wait 10 seconds before restarting the server.
1. Reseat the DIMMs; then, restart the server.
2. Remove the DIMM with lit error LED and replace it with an identical known
good DIMM; then, restart the server. Repeat as necessary. If the failures
continue after all identified DIMMs are replaced, go to step 4.
3. Return the removed DIMMs, one at a time, to their original connectors,
restarting the server after each DIMM, until a DIMM fails. Replace each failing
DIMM with an identical known good DIMM, restarting the server after each
DIMM replacement. Repeat step 3 until you have tested all removed DIMMs.
4. Replace the DIMM with lit error LED; then, restart the server. Repeat as
necessary.
5. Reverse the DIMMs between the channels (of the same microprocessor), and
then restart the server. If the problem is related to a DIMM, replace the failing
DIMM.
6. (Trained service technician only) Install the failing DIMM into a DIMM connector
for microprocessor 2 (if installed) to verify that the problem is not the
microprocessor or the DIMM connector.
7. (Trained service technician only) Replace the system board.
50IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Microprocessor problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
The server goes directly to the
POST Event Viewer when
turned on.
1. Correct any errors that are indicated by the LEDs (see “Light path diagnostics
LEDs” on page 65).
2. Make sure that the server supports all the microprocessors and that the
microprocessors match in speed and cache size. To compare the
microprocessor information, run the Setup utility and select SystemInformation, then select System Summary , and then Processor Details.
3. (Trained service technician only) Reseat the microprocessors.
4. (Trained service technician only) Remove microprocessor 2 and restart the
server.
5. (Trained service technician only) Replace the following components, in the
order shown, restarting the server each time:
v Microprocessors
v System board
Monitor or video problems
Some IBM monitors have their own self-tests. If you suspect a problem with your
monitor, see the documentation that comes with the monitor for instructions for
testing and adjusting the monitor. If you cannot diagnose the problem, call for
service.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
Testing the monitor.
1. Make sure that the monitor cables are firmly connected.
2. Try using the other video port.
3. Try using a different monitor on the server, or try testing the monitor on a
different server.
4. Run the diagnostic programs (see “Running the diagnostic programs” on page
72). If the monitor passes the diagnostic programs, the problem might be a
video device driver.
5. (Trained service technician only) Replace the system board
Chapter 3. Diagnostics51
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
The screen is blank.
The monitor works when you
turn on the server, but the
screen goes blank when you
start some application
programs.
1. If the server is attached to a KVM switch, bypass the KVM switch to eliminate it
as a possible cause of the problem: connect the monitor cable directly to the
correct connector on the rear of the server.
2. The IMM remote presence function is disabled if you install an optional video
adapter. To use the IMM remote presence function, remove the optional video
adapter.
3. Make sure that:
v The server is turned on. If there is no power to the server, see “Power
problems” on page 55.
v The monitor cables are connected correctly.
v The monitor is turned on and the brightness and contrast controls are
adjusted correctly.
4. Make sure that the correct server is controlling the monitor, if applicable.
5. Make sure that damaged server firmware is not affecting the video; see
“Recovering the server firmware” on page 109 for information about recovering
from server firmware failure.
6. Observe the checkpoint LEDs on the light path diagnostics panel; if the codes
are changing, go to the next step.
7. Replace the following components one at a time, in the order shown, restarting
the server each time:
a. Monitor
b. Video adapter (if one is installed)
c. (Trained service technician only) System board
8. See “Solving undetermined problems” on page 154 for information about
solving undetermined problems.
1. Make sure that:
v The application program is not setting a display mode that is higher than the
capability of the monitor.
v You installed the necessary device drivers for the application.
2. Run video diagnostics (see “Running the diagnostic programs” on page 72).
v If the server passes the video diagnostics, the video is good; see “Solving
undetermined problems” on page 154 for information about solving
undetermined problems.
v If the server fails the video diagnostics, (trained service technician only)
replace the system board.
52IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
The monitor has screen jitter, or
the screen image is wavy,
unreadable, rolling, or distorted.
1. If the monitor self-tests show that the monitor is working correctly, consider the
location of the monitor. Magnetic fields around other devices (such as
transformers, appliances, fluorescent lights, and other monitors) can cause
screen jitter or wavy, unreadable, rolling, or distorted screen images. If this
happens, turn off the monitor.
Attention: Moving a color monitor while it is turned on might cause screen
discoloration.
Move the device and the monitor at least 305 mm (12 in.) apart, and turn on
the monitor.
Notes:
a. To prevent diskette drive read/write errors, make sure that the distance
between the monitor and any external diskette drive is at least 76 mm (3
in.).
b. Non-IBM monitor cables might cause unpredictable problems.
2. Reseat the monitor cable
3. Replace the following components one at a time, in the order shown, restarting
the server each time:
a. Monitor cable
b. Video adapter (if one is installed)
c. Monitor
d. (Trained service technician only) System board
Wrong characters appear on the
screen.
1. If the wrong language is displayed, update the server firmware with the correct
language.
2. Reseat the monitor cable.
3. Replace the following components one at a time, in the order shown, restarting
the server each time:
a. Monitor
b. (Trained service technician only) System board
Chapter 3. Diagnostics53
Optional-device problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
An IBM optional device that was
just installed does not work.
An IBM optional device that
used to work does not work
now.
1. Make sure that:
v The device is designed for the server (see http://www.ibm.com/servers/
eserver/serverproven/compat/us/).
v You followed the installation instructions that came with the device and the
device is installed correctly.
v You have not loosened any other installed devices or cables.
v You updated the configuration information in the Setup utility. Whenever
memory or any other device is changed, you must update the configuration.
2. Reseat the device that you just installed.
3. Replace the device that you just installed.
1. Make sure that all of the hardware and cable connections for the device are
secure.
2. If the device comes with test instructions, use those instructions to test the
device.
3. Reseat the failing device.
4. Follow the instructions for device maintenance, such as keeping the heads
clean, and troubleshooting in the documentation that comes with the device.
5. Replace the failing device.
54IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Power problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
The power-control button does
not work, and the reset button
does not work (the server does
not start).
Note: The power-control button
will not function until
approximately 3 minutes after
the server has been connected
to power.
1. Make sure Make sure that both power supplies installed in the server are of the
same type. Mixing different power supplies in the server will cause a system
error (the system-error LED on the front panel turns on and the PS and CNFG
LEDs on the operator information panel are lit).
2. Make sure that:
v The power cords are correctly connected to the server and to a working
electrical outlet.
v The type of memory that is installed is correct.
v The LEDs on the power supply do not indicate a problem (see
“Power-supply LEDs” on page 69).
v The microprocessors are installed in the correct sequence.
3. Make sure that the power-control button and the reset button are working
correctly:
a. Disconnect the server power cords.
b. Reseat the operator information panel assembly cable.
c. Reconnect the power cords.
d. Press the power-control button to restart the server. If the button does not
work, replace the operator information panel assembly.
e. Press the reset button (on the light path diagnostics panel) to restart the
server. If the button does not work, replace the operator information panel
assembly.
4. Replace the following components one at a time, in the order shown, restarting
the server each time:
a. Hot-swap power supplies
b. (Trained service technician only) System board
5. See “Solving power problems” on page 153.
6. See “Solving undetermined problems” on page 154.
Chapter 3. Diagnostics55
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
The OVER SPEC LED on the
light path diagnostics panel is
lit, and the 12v channel A LED
on the system board is lit.
The OVER SPEC LED on the
light path diagnostics panel is
lit, and the 12v channel B LED
on the system board is lit.
1. Disconnect the server power cords.
2. Remove the following components:
v Optical drive
v Fans
v Hard disk drives
v Hard disk drive backplanes
3. Restart the server. If the OVER SPEC and 12v channel LEDs are still lit,
(trained service technician only) replace the system board.
4. Reinstall the components listed in step 2 one at a time, in the order shown,
restarting the server each time. If the 12v channel A LED is lit, the component
that you just reinstalled is defective. Replace the defective component.
v Fans
v Hard disk drive backplanes
v Hard disk drives
v Optical drive
5. (Trained service technician only) Replace the system board.
1. Disconnect the server power cords.
2. Remove the following components:
v PCI riser-card assembly in PCI connector 1 on the system board
v All DIMMs
v (Trained service technician only) Microprocessor 2
3. Restart the server. If the OVER SPEC and 12v channel LEDs are still lit,
(trained service technician only) replace the system board.
4. Reinstall the components listed in step 2 one at a time, in the order shown,
restarting the server each time. If the 12v channel B LED is lit, the component
that you just reinstalled is defective. Replace the defective component.
v (Trained service technician only) Microprocessor 2
v All DIMMs
v PCI riser-card assembly in PCI connector 1
5. (Trained service technician only) Replace the system board.
56IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
The OVER SPEC LED on the
light path diagnostics panel is
lit, and the 12v channel C LED
on the system board is lit.
1. Disconnect the server power cords.
2. Remove the following components:
v Tape drive, if one is installed (see “Removing a tape drive” on page 214for
more information)
v SAS riser-card assembly
v DIMMs 1 through 9
v (Trained service technician only) Microprocessor 1
3. Move switch 2 on switch block 4 (SW4) to the On position to force power on;
then, restart the server. If the OVER SPEC and 12v channel LEDs are still lit,
(trained service technician only) replace the system board.
4. Reinstall the components listed in step 2 one at a time, in the order shown,
restarting the server each time. If the 12v channel C LED is lit, the component
that you just reinstalled is defective. Replace the defective component.
v (Trained service technician only) Microprocessor 1
v DIMMs 1 through 9
v SAS riser-card assembly
v Tape drive, if one is installed (see “Installing a tape drive” on page 216 for
more information)
The OVER SPEC LED on the
light path diagnostics panel is
lit, and the 12v channel D LED
on the system board is lit.
1. Disconnect the server power cords.
2. (Trained service technician only) Remove microprocessor 1.
3. Move switch 2 on switch block 4 (SW4) to the On position to force power on;
then, restart the server. If the OVER SPEC and 12v channel LEDs are still lit,
(trained service technician only) replace the system board.
4. (Trained service technician only) Move switch on switch block back to the Off
position; then, reinstall microprocessor 1.
5. Restart the server. If the 12v channel D LED is lit, (trained service technician
only) replace microprocessor 1.
Chapter 3. Diagnostics57
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
The OVER SPEC LED on the
light path diagnostics panel is
lit, and the 12v channel E LED
on the system board is lit.
The OVER SPEC LED on the
light path diagnostics panel is
lit, and the 240 V AUX failure
LED on the system board is lit.
The server does not turn off.
1. Disconnect the server power cords.
2. Remove the following components:
v Optional PCI video graphics adapter power cable, if one is installed
(connector J154 on the system board)
v Optional PCI video graphics adapter, if one is installed
v PCI riser-card assembly in PCI connector 2 on the system board
v (Trained service technician only) Microprocessor 2
3. Restart the server. If the OVER SPEC and 12v channel LEDs are still lit,
(trained service technician only) replace the system board.
4. Reinstall the components listed in step 2 one at a time, in the order shown,
restarting the server each time. If the 12v channel E LED is lit, the component
that you just reinstalled is defective. Replace the defective component.
a. (Trained service technician only) Microprocessor 2
b. PCI riser-card assembly in PCI connector 2 on the system board
c. Optional PCI video graphics adapter, if one was installed
d. Power cable from optional PCI video graphics adapter to connector J154 on
the system board, if you removed one in step 2.
1. Disconnect the server power cords.
2. Remove the following components:
v All PCI adapters and PCI riser-card assemblies
v SAS riser-card assembly
v Operator information panel assembly
v Optional two-port Ethernet adapter, if installed
3.Move switch 2 on switch block 4 (SW4) to the On position to force power on;
then, restart the server. If the OVER SPEC and the 240 V ac AUX failure LEDs
are still lit, (trained service technician only) replace the system board.
4. Reinstall the components listed in step 2, one at a time, in the order shown,
restarting the server each time. If the 240 V ac AUX failure LED is lit, the
component that you just reinstalled is defective. Replace the defective
component.
v Operator information panel assembly
v SAS riser-card assembly
v Optional two-port Ethernet adapter, if installed
v All PCI adapters and PCI riser-card assemblies
1. Turn off the server by pressing the power-control button for 5 seconds.
2. Restart the server.
3. If the server fails POST and the power-control button does not work, disconnect
the power cord for 20 seconds; then, reconnect the power cord and restart the
server.
4. If the problem remains, suspect the system board.
58IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
The server unexpectedly shuts
down, and the LEDs on the
operator information panel are
not lit.
See “Solving undetermined problems” on page 154.
Serial device problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
The number of serial ports that
are identified by the operating
system is less than the number
of installed serial ports.
A serial device does not work.
1. Make sure that:
v Each port is assigned a unique address in the Setup utility and none of the
serial ports is disabled.
v The serial-port adapter (if one is present) is seated correctly.
2. Reseat the serial port adapter, if one is present.
3. Replace the serial port adapter, if one is present.
1. Make sure that:
v The device is compatible with the server.
v The serial port is enabled and is assigned a unique address.
v The device is connected to the correct connector (see “Rear view” on page
12).
2. Reseat the following components:
a. Failing serial device
b. Serial cable
3. Replace the following components one at a time, in the order shown, restarting
the server each time:
a. Failing serial device
b. Serial cable
c. (Trained service technician only) System board
Chapter 3. Diagnostics59
ServerGuide problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
The ServerGuide Setup and
Installation CD will not start.
The ServeRAID program cannot
view all installed drives, or the
operating system cannot be
installed.
The operating-system
installation program
continuously loops.
The ServerGuide program will
not start the operating-system
CD.
The operating system cannot be
installed; the option is not
available.
1. Make sure that the server supports the ServerGuide program and has a
startable (bootable) CD or DVD drive.
2. If the startup (boot) sequence settings have been changed, make sure that the
CD or DVD drive is first in the startup sequence.
3. If more than one CD or DVD drive is installed, make sure that only one drive is
set as the primary drive. Start the CD from the primary drive.
1. Make sure that there are no duplicate IRQ assignments.
2. Make sure that the hard disk drive is connected correctly.
3. Make sure that the hard disk drive cables are securely connected (see “Internal
cable routing and connectors” on page 172).
Make more space available on the hard disk.
Make sure that the operating-system CD is supported by the ServerGuide program.
For a list of supported operating-system versions, go to http://www.ibm.com/
systems/management/serverguide/sub.html, click IBM Service and Support Site,
click the link for your ServerGuide version, and scroll down to the list of supported
Microsoft Windows operating systems.
Make sure that the server supports the operating system. If it does, no logical drive
is defined (RAID servers). Run the ServerGuide program and make sure that setup
is complete.
60IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Software problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
You suspect a software
problem.
1. To determine whether the problem is caused by the software, make sure that:
v The server has the minimum memory that is needed to use the software. For
memory requirements, see the information that comes with the software. If
you have just installed an adapter or memory, the server might have a
memory-address conflict.
v The software is designed to operate on the server.
v Other software works on the server.
v The software works on another server.
2. If you received any error messages when using the software, see the
information that comes with the software for a description of the messages and
suggested solutions to the problem.
3. Contact the software vendor.
Universal Serial Bus (USB) port problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
SymptomAction
A USB device does not work.
1. Make sure that:
v The correct USB device driver is installed.
v The operating system supports USB devices.
2. Make sure that the USB configuration options are set correctly in the Setup
utility menu (see the Installation and User’s Guide for more information).
3. If you are using a USB hub, disconnect the USB device from the hub and
connect it directly to the server.
4. Move the device cable to a different USB connector.
Video problems
See “Monitor or video problems” on page 51.
Chapter 3. Diagnostics61
Light path diagnostics
Light path diagnostics is a system of LEDs on various external and internal
components of the server. When an error occurs, LEDs are lit throughout the
server. By viewing the LEDs in a particular order, you can often identify the source
of the error.
When LEDs are lit to indicate an error, they remain lit when the server is turned off,
provided that the server is still connected to power and the power supply is
operating correctly.
Before you work inside the server to view light path diagnostics LEDs, read the
safety information that begins on page vii and “Handling static-sensitive devices” on
page 171.
If an error occurs, view the light path diagnostics LEDs in the following order:
1. Look at the operator information panel on the front of the server.
v If the information LED is lit, it indicates that information about a suboptimal
condition in the server is available in the IMM event log or in the system
event log.
v If the system-error LED is lit, it indicates that an error has occurred; go to
step 2.
The following illustration shows the operator information panel.
Operator information
panel
Light path
diagnostics LEDs
Release latch
2. To view the light path diagnostics panel, slide the latch to the left on the front of
the operator information panel and pull the panel forward. This reveals the light
path diagnostics panel. Lit LEDs on this panel indicate the type of error that has
occurred.
Operator information
panel
Light path
diagnostics LEDs
Release latch
The following illustration shows the light path diagnostics panel.
62IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
A checkpoint code is either a byte or a word value produced by server firmware
and sent to the I/O port indicating the point at which the system stopped during
the boot block and power-on self test (POST). It does not provide error codes or
suggest replacement components. These codes can be used by IBM service
and support for more in-depth troubleshooting.
Checkpoint
code display
Note any LEDs that are lit, and then push the light path diagnostics panel back
into the server.
Notes:
a. Do not run the server for an extended period of time while the light path
diagnostics panel is pulled out of the server.
b. Light path diagnostics LEDs remain lit only while the server is connected to
power.
Look at the system service label on the top of the server, which gives an
overview of internal components that correspond to the LEDs on the light path
diagnostics panel. This information and the information in “Light path diagnostics
LEDs” on page 65 can often provide enough information to diagnose the error.
3. Remove the server cover and look inside the server for lit LEDs. A lit LED on or
beside a component identifies the component that is causing the error.
The following illustration shows the LEDs on the system board.
Chapter 3. Diagnostics63
Remind button
12v channel error LEDs indicate an overcurrent condition. Table 11 on page 153
identifies the components that are associated with each power channel, and the
order in which to troubleshoot the components.
The following illustration shows the LEDs on the riser card.
You can use the remind button on the light path diagnostics panel to put the
system-error LED on the operator information panel into Remind mode. When you
press the remind button, you acknowledge the error but indicate that you will not
take immediate action. The system-error LED flashes while it is in Remind mode
and stays in Remind mode until one of the following conditions occurs:
v All known errors are corrected.
64IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v The server is restarted.
v A new error occurs, causing the system-error LED to be lit again.
Light path diagnostics LEDs
The following table describes the LEDs on the light path diagnostics panel and
suggested actions to correct the detected problems.
Note: Check the system-event log and the IMM event log for additional information
before you replace a FRU.
LEDProblemAction
None, but
the
systemerror LED
is lit.
BRDAn error has occurred on the system
CNFGA hardware configuration error has
An error has occurred and cannot be
diagnosed, or the IMM has failed. The
error is not represented by a light path
diagnostics LED.
board.
occurred. (This LED is used with the
MEM and CPU LEDs.)
Use the Setup utility to check the system-event log for
information about the error.
1. Check the LEDs on the system board to identify the
2. Check the system-event log for information about the error.
3. Replace any failed or missing replaceable components,
4. If a voltage regulator has failed, replace the system board.
1. If the CNFG LED and the CPU LED are lit, complete the
2. If the CNFG LED and the MEM LED are lit, complete the
component that is causing the error. The BRD LED can be
lit for the following conditions:
v Battery
v Missing PCI riser-card assembly
v Failed voltage regulator
such as the battery (see “Removing the battery” on page
229 for more information) or PCI riser-card assembly (see
“Removing a PCI riser-card assembly” on page 188 for
more information).
following steps:
a. Check the microprocessors that were just installed to
make sure that they are compatible with each other
(see “Installing a microprocessor and heat sink” on
page 240 for additional information about
microprocessor requirements).
b. (Trained service technician only) Replace the
incompatible microprocessor.
c. Check the system-error logs for information about the
error. Replace any components that are identified in the
error log.
following steps:
a. Check the system-event log in the Setup utility or IMM
error messages. Follow steps indicated in “POST error
codes” on page 28 and “Integrated management
module error messages” on page 114.
Chapter 3. Diagnostics65
LEDProblemAction
CPUWhen only the CPU LED is lit, a
microprocessor has failed.
When the CPU and CNFG LEDs are lit,
an invalid microprocessor configuration
has occurred.
DASDA hard disk drive error has occurred. A
hard disk drive has failed or is missing.
1. Determine whether the CNFG LED is also lit. If the CNFG
LED is not lit, a microprocessor has failed.
a. Make sure that the failing microprocessor, which is
b. If the failure remains, go to http://www.ibm.com/
2. If the CNFG LED is lit, then an invalid microprocessor
configuration has occurred.
a. Make sure that the microprocessors are compatible with
b. (Trained service technician only) Replace an
c. If the failure remains, go to http://www.ibm.com/
1. Check the LEDs on the hard disk drives for the drive with a
lit status LED and reseat the hard disk drive.
2. Reseat the hard disk drive backplane.
3. For more information, see “Hard disk drive problems” on
page 44.
4. If the error remains, replace the following components in
the order listed, restarting the server after each:
a. Replace the hard disk drive (see “Removing a hot-swap
b. Replace the hard disk drive backplane (see “Removing
5. If the problem remains, go to http://www.ibm.com/support/
docview.wss?uid=psg1SERVCALL.
indicated by a lit LED on the system board, is installed
correctly. See “Installing a microprocessor and heat
sink” on page 240 for information about installing a
microprocessor.
systems/support/supportsite.wss/
docdisplay?brandind=5000008&lndocid=SERV-CALL for
additional troubleshooting information.
each other. They must match in speed and cache size.
To compare the microprocessor information, run the
Setup utility and select System Information, then
select System Summary, and then select ProcessorDetails.
incompatible microprocessor.
systems/support/supportsite.wss/
docdisplay?brandind=5000008&lndocid=SERV-CALL for
additional troubleshooting information.
hard disk drive” on page 210 for more information).
the SAS hard disk drive backplane” on page 235 for
more information).
66IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
LEDProblemAction
FANA fan has failed, is operating too slowly,
or has been removed. The TEMP LED
might also be lit.
1. Reseat the failing fan, which is indicated by a lit LED near
the fan connector on the system board..
2. Replace the failing fan, which is indicated by a lit LED near
the fan connector on the system board (see “Removing a
hot-swap fan” on page 223 for more information).
Notes:
1. If an LED that is next to an unused fan connector on the
system board is lit, a PCI riser-card assembly might be
missing; replace the PCI riser-card assembly. One PCI
riser-card assembly must always be present in PCI riser
connector 2.
2. When the BRD LED is lit and all cooling zones all asserted
at the same time by removing the cover and check if PCI
riser-card 2 LED is on.
LINKReserved.
LOGAn error message has been written to
the system-event log
Check the IMM system event log and the system-error log for
information about the error. Replace any components that are
identified in the error logs. (See “Event logs” on page 26 for
more information.)
MEMWhen only the MEM LED is lit, a
memory error has occurred.
When both the MEM and CNFG LEDs
are lit, the memory configuration is
invalid or the PCI Option ROM is out of
resource.
Note: Each time you install or remove a DIMM, you must
disconnect the server from the power source; then, wait 10
seconds before restarting the server.
1. If the MEM LED and the CNFG LED are lit, complete the
following steps:
a. Check the system-event log in the Setup utility or IMM
2. If the CNFG LED is not lit, the system might detect a
memory error. Complete the following steps to correct the
problem:
a. Update the server firmware to the latest level (see
b. Reseat the DIMM.
c. Check the system-event log in the Setup utility or IMM
NMIA nonmaskable interrupt has occurred,
Check the system-event log for information about the error.
or the NMI button has been pressed.
OVER
SPEC
The server was shut down because of a
power-supply overload condition on one
of the power channels. The power
supplies are using more power than
their maximum rating.
1. If any of the power channel error LEDs (A, B, C, D, E, or
AUX) on the system board are lit also, see the section
about power-channel error LEDs in “Power problems” on
page 55. (See “Internal connectors, LEDs, and jumpers” on
page 15 for the location of the power channel error LEDs.)
2. Check the power-supply LEDs for an error indication (AC
LED and DC LED are not both lit, or the information LED is
lit). Replace a failing power supply.
3. Remove optional devices from the server.
error messages. Follow steps indicated in “POST error
codes” on page 28 and “Integrated management
module error messages” on page 114.
“Updating the firmware” on page 253).
error messages. Follow steps indicated in “POST error
codes” on page 28 and “Integrated management
module error messages” on page 114.
Chapter 3. Diagnostics67
LEDProblemAction
PCIAn error has occurred on a PCI bus or
on the system board. An additional LED
is lit next to a failing PCI slot.
PSA power supply has failed. Power supply
1 or 2 has failed. When both the PS and
CNFG LEDs are lit, the power supply
configuration is invalid.
RAIDReserved
SPThe service processor (the IMM) has
failed.
TEMPThe system temperature has exceeded
a threshold level. A failing fan can cause
the TEMP LED to be lit.
VRMReserved.
1. Check the LEDs on the PCI slots to identify the component
that is causing the error.
2. Check the system-event log for information about the error.
3. If you cannot isolate the failing adapter through the LEDs
and the information in the system-event log, remove one
adapter at a time from the failing PCI bus, and restart the
server after each adapter is removed.
4. If the failure remains, go to http://www.ibm.com/systems/
support/supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL for additional troubleshooting
information.
1. Check the power-supply that has an lit amber LED (see
“Power-supply LEDs” on page 69).
2. Make sure that the power supplies are seated correctly.
3. Remove one of the power supplies to isolate the failed
power supply.
4. Make sure that both power supplies installed in the server
are of the same type.
5. Replace the failed power supply.
1. Remove power from the server; then, reconnect the server
to power and restart the server.
2. Update the firmware on the IMM.
3.If the failure remains, go to http://www.ibm.com/systems/
support/supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL for additional troubleshooting
information.
1. Check the error log to identify where the over-temperature
condition was measured. If a fan has failed, replace it.
2. Make sure that the room temperature is not too high. See
“Features and specifications” on page 7 for temperature
information.
3. Make sure that the air vents are not blocked.
4. If the failure remains, go to http://www.ibm.com/systems/
support/supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL for additional troubleshooting
information.
68IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Power-supply LEDs
The following minimum configuration is required for the DC LED on the power
supply to be lit:
v Power supply
v Power cord
v
The following minimum configuration is required for the server to start:
v One microprocessor (slot 1)
v One 2 GB DIMM per microprocessor on the system board (slot 3 if only one
microprocessor is installed)
v One power supply
v Power cord
v Three cooling fans
v One PCI riser-card assembly in PCI riser connector 2
v ServeRAID SAS controller
The following illustration shows the locations of the power-supply LEDs.
The following table describes the problems that are indicated by various
combinations of the power-supply LEDs and the power-on LED on the operator
information panel and suggested actions to correct the detected problems.
Chapter 3. Diagnostics69
Table 8. Power-supply LEDs
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
AC power-supply LEDs
DescriptionActionNotesACDCError
OffOffOffNo ac power to
the server or a
problem with the
ac power source
OffOffOnNo ac power to
the server or a
problem with the
ac power source
and the power
supply had
detected an
internal problem
OffOnOffFaulty power
supply
OffOnOnFaulty power
supply
OnOffOffPower-supply not
fully seated,
faulty system
board, or faulty
power-supply
OnOff or
Flashing
OnOnOffNormal operation
OnOnOnPower supply is
OnFaulty power
supply
faulty but still
operational
1. Check the ac power to the server.
2. Make sure that the power cord is
connected to a functioning power
source.
3. Turn the server off and then turn the
server back on.
4. If the problem remains, replace the
power supply.
1. Replace the power supply.
2. Make sure that the power cord is
connected to a functioning power
source.
Replace the power supply.
Replace the power supply.
1. (Trained service technician only)
Reseat the power supply.
2. If a power channel error LED on the
system board is not lit, replace the
power-supply (see the documentation
that comes with the power supply for
instructions).
3. If a power channel error LED on the
system board is lit, (trained service
technician only) replace the system
board.
Replace the power supply.
Replace the power supply.
This is a normal
condition when no
ac power is present.
This happens only
when a second
power supply is
providing power to
the server.
Typically indicates
that a power supply
is not fully seated.
70IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
The following table describes the problems that are indicated by various
combinations of the power-supply LEDs on a dc power supply and suggested
actions to correct the detected problems.
DC power-supply LEDs
DescriptionActionNotesIN OKOUT OKError (!)
OnOnOffNormal operation
OffOffOffNo dc power to the
server or a problem
with the dc power
source.
OffOffOnNo dc power to the
server or a problem
with the dc power
source and the
power-supply had
detected an internal
problem.
OffOnOffFaulty
power-supply
OffOnOnFaulty
power-supply
OnOffOffPower-supply not
fully seated, faulty
system board, or
faulty power-supply
OnOffOnFaulty
power-supply
OnOnOnPower-supply is
faulty but still
operational
1. Check the dc power to the
server.
2. Make sure that the power
cord is connected to a
functioning power source.
3. Restart the server. If the error
remains, check the
power-supply LEDs.
4. Replace the power-supply.
v Make sure that the power cord
is connected to a functioning
power source.
v Replace the power supply
(see the documentation that
comes with the power supply
for instructions).
Replace the power supply.
Replace the power supply.
1. (Trained service technician
only) Reseat the power
supply.
2. If a power channel error LED
on the system board is not lit,
replace the power-supply
(see the documentation that
comes with the power supply
for instructions).
3. If a power channel error LED
on the system board is lit,
(trained service technician
only) replace the system
board.
Replace the power supply.
Replace the power supply.
This is a normal
condition when no dc
power is present.
This happens only
when a second power
supply is providing
power to the server.
Typically indicates a
power-supply is not
fully seated.
Chapter 3. Diagnostics71
Diagnostic programs, messages, and error codes
The diagnostic programs are the primary method of testing the major components
of the server. As you run the diagnostic programs, text messages are displayed on
the screen and are saved in the test log. A diagnostic text message indicates that a
problem has been detected and provides the action you should take as a result of
the text message.
Make sure that the server has the latest version of the diagnostic programs. To
download the latest version, complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
1. Go to http://www.ibm.com/systems/support/.
2. Under Product support, click System x.
3. Under Popular links, click Software and device drivers.
4. Click IBM System x3650 M3 to display the matrix of downloadable files for the
server.
Utilities are available to reset and update the code on the integrated USB flash
device, if the diagnostic partition becomes damaged and does not start the
diagnostic programs. For more information and to download the utilities, go to
http://www.ibm.com/jct01004c/systems/support/supportsite.wss/
docdisplay?lndocid=MIGR-5072294&brandind=5000008.
Running the diagnostic programs
To run the diagnostic programs, complete the following steps:
1. If the server is running, turn off the server and all attached devices.
2. Turn on all attached devices; then, turn on the server.
3. When the prompt Press F2 for Dynamic System Analysis (DSA) is displayed,
press F2.
Note: The DSA Preboot diagnostic program might appear to be unresponsive
for an unusual length of time when you start the program. This is normal
operation while the program loads. The loading process may take up to 10
minutes.
4. Optionally, select Quit to DSA to exit from the stand-alone memory diagnostic
program.
Note: After you exit from the stand-alone memory diagnostic environment, you
must restart the server to access the stand-alone memory diagnostic
environment again.
5. Type gui to display the graphical user interface, or select cmd to display the
DSA interactive menu.
6. Follow the instructions on the screen to select the diagnostic test to run.
If the diagnostic programs do not detect any hardware errors but the problem
remains during normal server operations, a software error might be the cause. If
you suspect a software problem, see the information that comes with your software.
A single problem might cause more than one error message. When this happens,
correct the cause of the first error message. The other error messages usually will
not occur the next time you run the diagnostic programs.
72IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Exception: If multiple error codes or light path diagnostics LEDs indicate a
microprocessor error, the error might be in a microprocessor or in a microprocessor
socket. See “Microprocessor problems” on page 51 for information about diagnosing
microprocessor problems.
If the server stops during testing and you cannot continue, restart the server and try
running the diagnostic programs again. If the problem remains, replace the
component that was being tested when the server stopped.
Diagnostic text messages
Diagnostic text messages are displayed while the tests are running. A diagnostic
text message contains one of the following results:
Passed: The test was completed without any errors.
Failed: The test detected an error.
Aborted: The test could not proceed because of the server configuration.
Additional information concerning test failures is available in the extended
diagnostic results for each test.
Viewing the test log
To view the test log when the tests are completed, type the view command in the
DSA interactive menu, or select Diagnostic Event Log in the graphical user
interface. To transfer DSA collections to an external USB device, type the copy
command in the DSA interactive menu.
Diagnostic messages
The following table describes the messages that the diagnostic programs might
generate and suggested actions to correct the detected problems. Follow the
suggested actions in the order in which they are listed in the column.
Chapter 3. Diagnostics73
Table 9. DSA Preboot messages
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
numberComponentTestStateDescriptionAction
089-801-xxxCPUCPU Stress
Test
AbortedInternal
program error.
1. Turn off and restart the system.
2. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
3. Run the test again.
4. Make sure that the system firmware is at the
latest level. The installed firmware level is shown
in the diagnostic event log in the Firmware/VPD
section for this component. For more
information, see “Updating the firmware” on
page 253.
5. Run the test again.
6. Turn off and restart the system if necessary to
recover from a hung state.
7. Run the test again.
8. Replace the following components one at a time,
in the order shown, and run this test again to
determine whether the problem has been solved:
a. (Trained service technician only)
Microprocessor board
b. (Trained service technician only)
Microprocessor
9. If the failure remains, go to the IBM Web site for
more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
74IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
numberComponentTestStateDescriptionAction
089-802-xxxCPUCPU Stress
Test
AbortedSystem
resource
availability error.
1. Turn off and restart the system.
2. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
3. Run the test again.
4. Make sure that the system firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this component. For
more information, see “Updating the firmware”
on page 253.
5. Run the test again.
6. Turn off and restart the system if necessary to
recover from a hung state.
7. Run the test again.
8. Make sure that the system firmware is at the
latest level. The installed firmware level is
shown in the DSA event log in the
Firmware/VPD section for this component. For
more information, see “Updating the firmware”
on page 253.
9. Run the test again.
10. Replace the following components one at a
time, in the order shown, and run this test
again to determine whether the problem has
been solved:
a. (Trained service technician only)
Microprocessor board
b. (Trained service technician only)
Microprocessor
11. If the failure remains, go to the IBM Web site
for more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
Chapter 3. Diagnostics
75
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
numberComponentTestStateDescriptionAction
089-901-xxxCPUCPU Stress
Test
FailedTest failure.
1. Turn off and restart the system if necessary to
recover from a hung state.
2. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
3. Run the test again.
4. Make sure that the system firmware is at the
latest level. The installed firmware level is shown
in the diagnostic event log in the Firmware/VPD
section for this component. For more
information, see “Updating the firmware” on
page 253.
5. Run the test again.
6. Turn off and restart the system if necessary to
recover from a hung state.
7. Run the test again.
8. Replace the following components one at a time,
in the order shown, and run this test again to
determine whether the problem has been solved:
a. (Trained service technician only)
Microprocessor board
b. (Trained service technician only)
Microprocessor
9. If the failure remains, go to the IBM Web site for
more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
76IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
numberComponentTestStateDescriptionAction
166-801-xxxIMMIMM I2C Test AbortedIMM I2C test
stopped: the
IMM returned
an incorrect
response
length.
166-802-xxxIMMIMM I2C Test AbortedIMM I2C test
stopped: the
test cannot be
completed for
an unknown
reason.
1. Turn off the system and disconnect it from the
power source. You must disconnect the system
from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the diagnostic event log in the Firmware/VPD
section for this component. For more
information, see “Updating the firmware” on
page 253.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
1. Turn off the system and disconnect it from the
power source. You must disconnect the system
from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the diagnostic event log in the Firmware/VPD
section for this component. For more
information, see “Updating the firmware” on
page 253.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
Chapter 3. Diagnostics
77
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
numberComponentTestStateDescriptionAction
166-803-xxxIMMIMM I2C Test AbortedIMM I2C test
stopped: the
node is busy;
try later.
166-804-xxxIMMIMM I2C Test AbortedIMM I2C test
stopped: invalid
command.
1. Turn off the system and disconnect it from the
power source. You must disconnect the system
from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the diagnostic event log in the Firmware/VPD
section for this component. For more
information, see “Updating the firmware” on
page 253.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
1. Turn off the system and disconnect it from the
power source. You must disconnect the system
from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the diagnostic event log in the Firmware/VPD
section for this component. For more
information, see “Updating the firmware” on
page 253.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
78IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
numberComponentTestStateDescriptionAction
166-805-xxxIMMIMM I2C Test AbortedIMM I2C test
aborted: invalid
command for
the given LUN.
166-806-xxxIMMIMM I2C Test AbortedIMM I2C test
aborted: timeout
while
processing the
command.
1. Turn off the system and disconnect it from the
power source. You must disconnect the system
from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the diagnostic event log in the Firmware/VPD
section for this component. For more
information, see “Updating the firmware” on
page 253.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
1. Turn off the system and disconnect it from the
power source. You must disconnect the system
from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the diagnostic event log in the Firmware/VPD
section for this component. For more
information, see “Updating the firmware” on
page 253.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
Chapter 3. Diagnostics
79
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
numberComponentTestStateDescriptionAction
166-807-xxxIMMIMM I2C Test AbortedIMM I2C test
aborted: out of
space.
166-808-xxxIMMIMM I2C Test AbortedIMM I2C test
aborted:
reservation
canceled or
invalid
reservation ID.
1. Turn off the system and disconnect it from the
power source. You must disconnect the system
from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the diagnostic event log in the Firmware/VPD
section for this component. For more
information, see “Updating the firmware” on
page 253.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
1. Turn off the system and disconnect it from the
power source. You must disconnect the system
from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the diagnostic event log in the Firmware/VPD
section for this component. For more
information, see “Updating the firmware” on
page 253.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
80IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
numberComponentTestStateDescriptionAction
166-809-xxxIMMIMM I2C Test AbortedIMM I2C test
aborted:
request data
was truncated.
166-810-xxxIMMIMM I2C Test AbortedIMM I2C test
aborted:
request data
length is invalid.
1. Turn off the system and disconnect it from the
power source. You must disconnect the system
from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the diagnostic event log in the Firmware/VPD
section for this component. For more
information, see “Updating the firmware” on
page 253.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
1. Turn off the system and disconnect it from the
power source. You must disconnect the system
from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the diagnostic event log in the Firmware/VPD
section for this component. For more
information, see “Updating the firmware” on
page 253.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
Chapter 3. Diagnostics
81
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, Types 4255, 7945, and 7949 server,” on page 157 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
numberComponentTestStateDescriptionAction
166-811-xxxIMMIMM I2C Test AbortedIMM I2C test
aborted:
request data
field length limit
is exceeded.
166-812-xxxIMMIMM I2C Test AbortedIMM I2C Test
aborted: a
parameter is
out of range.
1. Turn off the system and disconnect it from the
power source. You must disconnect the system
from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the diagnostic event log in the Firmware/VPD
section for this component. For more
information, see “Updating the firmware” on
page 253.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
1. Turn off the system and disconnect it from the
power source. You must disconnect the system
from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
http://www.ibm.com/support/
docview.wss?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the diagnostic event log in the Firmware/VPD
section for this component. For more
information, see “Updating the firmware” on
page 253.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
http://www.ibm.com/systems/support/
supportsite.wss/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
82IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Loading...
+ hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.