IBM System x3650 M3 4255, System x3650 M3 7945, System x3650 M3 7949 Service Manual

IBM System x3650 M3 Types 4255, 7945, and 7949

P roble m Dete rminatio n and Se rvi ce Gui de
IBM System x3650 M3 Types 4255, 7945, and 7949

P roble m Dete rminatio n and Se rvi ce Gui de
Note: Before using this information and the product it supports, read the general information in Appendix B, “Notices,” on page 281, the IBM Safety Information and IBM Environmental Notices and User's Guide on the IBM System x Documentation CD, and the IBM Warranty Information document that comes with your server.
Eleventh Edition (September 2012)
US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
Contents
Safety ............................vii
Guidelines for trained service technicians ...............viii
Inspecting for unsafe conditions .................viii
Guidelines for servicing electrical equipment .............ix
Safety statements ........................x
Chapter 1. Start here.......................1
Diagnosing a problem .......................1
Undocumented problems .....................4
Chapter 2. Introduction ......................5
Related documentation ......................5
Notices and statements in this document................6
Features and specifications.....................7
Server controls, LEDs, and connectors ................9
Front view ..........................9
Rear view ..........................12
Internal connectors, LEDs, and jumpers................15
System-board internal connectors .................15
System-board external connectors.................16
System-board switches and jumpers ................17
System-board LEDs ......................20
System-board optional device connectors ..............22
PCI riser-card adapter connectors .................22
PCI riser-card assembly LEDs ..................22
SAS riser-card connectors and LEDs................24
Chapter 3. Diagnostics .....................25
Diagnostic tools ........................25
POST ............................26
Event logs .........................26
POST error codes.......................28
Checkout procedure .......................41
About the checkout procedure ..................41
Performing the checkout procedure ................42
Troubleshooting tables ......................43
DVD drive problems ......................43
General problems .......................44
Hard disk drive problems ....................44
Hypervisor problems ......................46
Intermittent problems......................47
USB keyboard, mouse, or pointing-device problems ..........48
Memory problems .......................49
Microprocessor problems ....................51
Monitor or video problems ....................51
Optional-device problems ....................54
Power problems .......................55
Serial device problems .....................59
ServerGuide problems .....................60
Software problems ......................61
Universal Serial Bus (USB) port problems ..............61
Video problems........................61
Light path diagnostics ......................62
© Copyright IBM Corp. 2012 iii
Remind button ........................64
Light path diagnostics LEDs ...................65
Power-supply LEDs .......................69
Diagnostic programs, messages, and error codes ............72
Running the diagnostic programs .................72
Diagnostic text messages ....................73
Viewing the test log ......................73
Diagnostic messages .....................73
Tape alert flags ........................108
Recovering the server firmware ..................109
Automatic boot failure recovery (ABR) ................112
Nx boot failure.........................113
System event messages log ...................114
Integrated management module error messages ............114
Solving power problems .....................153
Solving Ethernet controller problems ................154
Solving undetermined problems ..................154
Problem determination tips ....................155
Chapter 4. Parts listing, Types 4255, 7945, and 7949 server.......157
Replaceable server components ..................157
Product recovery CDs ......................165
Power cords .........................165
Chapter 5. Removing and replacing server components ........169
Installation guidelines ......................169
System reliability guidelines...................170
Working inside the server with the power on ............171
Handling static-sensitive devices .................171
Returning a device or component ................172
Internal cable routing and connectors ...............172
Removing and replacing consumable parts and Tier 1 CRUs .......176
Removing the cover .....................177
Installing the cover ......................178
Removing the microprocessor 2 air baffle..............178
Installing the microprocessor 2 air baffle ..............180
Removing the DIMM air baffle ..................180
Installing the DIMM air baffle ..................182
Removing the fan bracket ...................182
Installing the fan bracket ....................184
Removing an IBM virtual media key................185
Installing an IBM virtual media key ................186
Removing a USB hypervisor memory key .............186
Installing a USB hypervisor memory key ..............187
Removing a PCI riser-card assembly ...............188
Installing a PCI riser-card assembly ................189
Removing a PCI adapter from a PCI riser-card assembly ........190
Installing a PCI adapter in a PCI riser-card assembly .........191
Removing the optional two-port Ethernet adapter ...........193
Installing the optional two-port Ethernet adapter ...........194
Storing the full-length-adapter bracket ...............196
Removing the SAS riser-card and controller assembly .........197
Installing the SAS riser-card and controller assembly .........198
Removing a ServeRAID SAS controller from the SAS riser card .....201
Installing a ServeRAID SAS controller on the SAS riser card ......202
Removing an optional ServeRAID adapter advanced feature key .....204
iv IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Installing an optional ServeRAID adapter advanced feature key .....205
Removing a ServeRAID SAS controller battery from the remote battery tray 207 Installing a ServeRAID SAS controller battery on the remote battery tray 208
Removing a hot-swap hard disk drive ...............210
Installing a hot-swap hard disk drive ...............210
Removing a simple-swap hard disk drive ..............211
Installing a simple-swap hard disk drive ..............212
Removing an optional CD-RW/DVD drive..............213
Installing an optional CD-RW/DVD drive ..............214
Removing a tape drive ....................214
Installing a tape drive .....................216
Removing a memory module (DIMM) ...............217
Installing a memory module ...................217
Installing a DIMM ......................222
Removing a hot-swap fan ...................223
Installing a hot-swap fan ....................224
Removing a hot-swap ac power supply ..............225
Installing a hot-swap ac power supply ...............226
Removing the battery .....................229
Installing the battery .....................231
Removing the operator information panel assembly ..........233
Installing the operator information panel assembly ..........233
Removing and replacing Tier 2 CRUs ................234
Removing the bezel .....................234
Installing the bezel ......................234
Removing the SAS hard disk drive backplane ............235
Installing the SAS hard disk drive backplane ............235
Removing the simple-swap hard disk drive backplate .........236
Installing the simple-swap hard disk drive backplate ..........237
Removing and replacing FRUs ..................238
Removing a microprocessor and heat sink .............238
Installing a microprocessor and heat sink..............240
Thermal grease .......................244
Removing a heat-sink retention module ..............246
Installing a heat-sink retention module ...............246
Removing the system board ..................247
Installing the system board ...................248
Removing the 240 VA safety cover ................250
Installing the 240 VA safety cover ................251
Chapter 6. Configuration information and instructions ........253
Updating the firmware ......................253
Configuring the server ......................253
Using the Setup utility .....................255
Using the Boot Selection Menu program ..............260
Starting the backup server firmware................260
Using the ServerGuide Setup and Installation CD...........260
Using the integrated management module .............263
Using the USB memory key for VMware hypervisor ..........264
Using the remote presence capability and blue-screen capture ......265
Using the embedded hypervisor .................266
Enabling the Broadcom Gigabit Ethernet Utility program ........267
Configuring the Gigabit Ethernet controller .............267
Using the LSI Configuration Utility program .............268
IBM Advanced Settings Utility program...............270
Updating IBM Systems Director ..................270
Contents v
Updating the Universal Unique Identifier (UUID) ............271
Updating the DMI/SMBIOS data ..................274
Appendix A. Getting help and technical assistance ..........279
Before you call ........................279
Using the documentation.....................279
Getting help and information from the World Wide Web .........279
Software service and support ...................280
Hardware service and support ...................280
IBM Taiwan product service ....................280
Appendix B. Notices ......................281
Trademarks..........................281
Important notes ........................282
Particulate contamination.....................283
Documentation format ......................283
Telecommunication regulatory statement ...............284
Electronic emission notices ....................284
Federal Communications Commission (FCC) statement ........284
Industry Canada Class A emission compliance statement ........284
Avis de conformité à la réglementation d'Industrie Canada .......284
Australia and New Zealand Class A statement ............284
European Union EMC Directive conformance statement ........285
Germany Class A statement ..................285
Japan VCCI Class A statement .................286
Japan Electronics and Information Technology Industries Association (JEITA)
statement ........................286
Korea Communications Commission (KCC) statement .........286
Russia Electromagnetic Interference (EMI) Class A statement ......287
People's Republic of China Class A electronic emission statement ....287
Taiwan Class A compliance statement ...............287
Index ............................289
vi IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Safety
Before installing this product, read the Safety Information.
Antes de instalar este produto, leia as Informações de Segurança.
Pred instalací tohoto produktu si prectete prírucku bezpecnostních instrukcí.
Læs sikkerhedsforskrifterne, før du installerer dette produkt.
Lees voordat u dit product installeert eerst de veiligheidsvoorschriften.
Ennen kuin asennat tämän tuotteen, lue turvaohjeet kohdasta Safety Information.
Avant d'installer ce produit, lisez les consignes de sécurité.
Vor der Installation dieses Produkts die Sicherheitshinweise lesen.
Prima di installare questo prodotto, leggere le Informazioni sulla Sicurezza.
Les sikkerhetsinformasjonen (Safety Information) før du installerer dette produktet.
Antes de instalar este produto, leia as Informações sobre Segurança.
Antes de instalar este producto, lea la información de seguridad.
Läs säkerhetsinformationen innan du installerar den här produkten.
© Copyright IBM Corp. 2012 vii
Bu ürünü kurmadan önce güvenlik bilgilerini okuyun.
Guidelines for trained service technicians
This section contains information for trained service technicians.
Inspecting for unsafe conditions
Use the information in this section to help you identify potential unsafe conditions in an IBM product that you are working on. Each IBM product, as it was designed and manufactured, has required safety items to protect users and service technicians from injury. The information in this section addresses only those items. Use good judgment to identify potential unsafe conditions that might be caused by non-IBM alterations or attachment of non-IBM features or optional devices that are not addressed in this section. If you identify an unsafe condition, you must determine how serious the hazard is and whether you must correct the problem before you work on the product.
Consider the following conditions and the safety hazards that they present: v Electrical hazards, especially primary power. Primary voltage on the frame can
cause serious or fatal electrical shock.
v Explosive hazards, such as a damaged CRT face or a bulging capacitor. v Mechanical hazards, such as loose or missing hardware.
To inspect the product for potential unsafe conditions, complete the following steps:
1. Make sure that the power is off and the power cord is disconnected.
2. Make sure that the exterior cover is not damaged, loose, or broken, and observe any sharp edges.
3. Check the power cord: v Make sure that the third-wire ground connector is in good condition. Use a
meter to measure third-wire ground continuity for 0.1 ohm or less between the external ground pin and the frame ground.
v Make sure that the power cord is the correct type, as specified in “Power
cords” on page 165.
v Make sure that the insulation is not frayed or worn.
4. Remove the cover.
5. Check for any obvious non-IBM alterations. Use good judgment as to the safety of any non-IBM alterations.
viii IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
6. Check inside the server for any obvious unsafe conditions, such as metal filings, contamination, water or other liquid, or signs of fire or smoke damage.
7. Check for worn, frayed, or pinched cables.
8. Make sure that the power-supply cover fasteners (screws or rivets) have not been removed or tampered with.
Guidelines for servicing electrical equipment
Observe the following guidelines when you service electrical equipment: v Check the area for electrical hazards such as moist floors, nongrounded power
extension cords, and missing safety grounds.
v Use only approved tools and test equipment. Some hand tools have handles that
are covered with a soft material that does not provide insulation from live electrical currents.
v Regularly inspect and maintain your electrical hand tools for safe operational
condition. Do not use worn or broken tools or testers.
v Do not touch the reflective surface of a dental mirror to a live electrical circuit.
The surface is conductive and can cause personal injury or equipment damage if it touches a live electrical circuit.
v Some rubber floor mats contain small conductive fibers to decrease electrostatic
discharge. Do not use this type of mat to protect yourself from electrical shock.
v Do not work alone under hazardous conditions or near equipment that has
hazardous voltages.
v Locate the emergency power-off (EPO) switch, disconnecting switch, or electrical
outlet so that you can turn off the power quickly in the event of an electrical accident.
v Disconnect all power before you perform a mechanical inspection, work near
power supplies, or remove or install main units.
v Before you work on the equipment, disconnect the power cord. If you cannot
disconnect the power cord, have the customer power-off the wall box that supplies power to the equipment and lock the wall box in the off position.
v Never assume that power has been disconnected from a circuit. Check it to
make sure that it has been disconnected.
v If you have to work on equipment that has exposed electrical circuits, observe
the following precautions: – Make sure that another person who is familiar with the power-off controls is
near you and is available to turn off the power if necessary.
– When you are working with powered-on electrical equipment, use only one
hand. Keep the other hand in your pocket or behind your back to avoid creating a complete circuit that could cause an electrical shock.
– When you use a tester, set the controls correctly and use the approved probe
leads and accessories for that tester.
– Stand on a suitable rubber mat to insulate you from grounds such as metal
floor strips and equipment frames.
v Use extreme care when you measure high voltages. v To ensure proper grounding of components such as power supplies, pumps,
blowers, fans, and motor generators, do not service these components outside of their normal operating locations.
v If an electrical accident occurs, use caution, turn off the power, and send another
person to get medical aid.
Safety ix
Safety statements
Important:
Each caution and danger statement in this document is labeled with a number. This number is used to cross reference an English-language caution or danger statement with translated versions of the caution or danger statement in the Safety Information document.
For example, if a caution statement is labeled “Statement 1,” translations for that caution statement are in the Safety Information document under “Statement 1.”
Be sure to read all caution and danger statements in this document before you perform the procedures. Read any additional safety information that comes with the server or optional device before you install the device.
Attention: Use No. 26 AWG or larger UL-listed or CSA certified telecommunication line cord.
x IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Statement 1:
DANGER
Electrical current from power, telephone, and communication cables is hazardous.
To avoid a shock hazard: v Do not connect or disconnect any cables or perform installation,
maintenance, or reconfiguration of this product during an electrical storm.
v Connect all power cords to a properly wired and grounded electrical
outlet.
v Connect to properly wired outlets any equipment that will be attached to
this product.
v When possible, use one hand only to connect or disconnect signal
cables.
v Never turn on any equipment when there is evidence of fire, water, or
structural damage.
v Disconnect the attached power cords, telecommunications systems,
networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures.
v Connect and disconnect cables as described in the following table when
installing, moving, or opening covers on this product or attached devices.
To Connect: To Disconnect:
1. Turn everything OFF.
2. First, attach all cables to devices.
3. Attach signal cables to connectors.
4. Attach power cords to outlet.
5. Turn device ON.
1. Turn everything OFF.
2. First, remove power cords from outlet.
3. Remove signal cables from connectors.
4. Remove all cables from devices.
Safety xi
Statement 2:
CAUTION: When replacing the lithium battery, use only IBM Part Number 33F8354 or an equivalent type battery recommended by the manufacturer. If your system has a module containing a lithium battery, replace it only with the same module type made by the same manufacturer. The battery contains lithium and can explode if not properly used, handled, or disposed of.
Do not:
v Throw or immerse into water v Heat to more than 100°C (212°F) v Repair or disassemble
Dispose of the battery as required by local ordinances or regulations.
xii IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Statement 3:
CAUTION: When laser products (such as CD-ROMs, DVD drives, fiber optic devices, or transmitters) are installed, note the following:
v Do not remove the covers. Removing the covers of the laser product could
result in exposure to hazardous laser radiation. There are no serviceable parts inside the device.
v Use of controls or adjustments or performance of procedures other than
those specified herein might result in hazardous radiation exposure.
DANGER
Some laser products contain an embedded Class 3A or Class 3B laser diode. Note the following.
Laser radiation when open. Do not stare into the beam, do not view directly with optical instruments, and avoid direct exposure to the beam.
Class 1 Laser Product Laser Klasse 1 Laser Klass 1 Luokan 1 Laserlaite Appareil A Laser de Classe 1
`
Safety xiii
Statement 4:
18 kg (39.7 lb) 32 kg (70.5 lb) 55 kg (121.2 lb)
CAUTION: Use safe practices when lifting.
Statement 5:
CAUTION: The power control button on the device and the power switch on the power supply do not turn off the electrical current supplied to the device. The device also might have more than one power cord. To remove all electrical current from the device, ensure that all power cords are disconnected from the power source.
2
1
xiv IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Statement 8:
CAUTION: Never remove the cover on a power supply or any part that has the following label attached.
Hazardous voltage, current, and energy levels are present inside any component that has this label attached. There are no serviceable parts inside these components. If you suspect a problem with one of these parts, contact a service technician.
Statement 12:
CAUTION: The following label indicates a hot surface nearby.
Statement 26:
CAUTION: Do not place any object on top of rack-mounted devices.
This server is suitable for use on an IT power-distribution system whose maximum phase-to-phase voltage is 240 V under any distribution fault condition.
Important: This product is not suitable for use with visual display workplace devices according to Clause 2 of the German Ordinance for Work with Visual Display Units.
Safety xv
xvi IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Chapter 1. Start here
You can solve many problems without outside assistance by following the troubleshooting procedures in this Problem Determination and Service Guide and on the IBM Web site. This document describes the diagnostic tests that you can perform, troubleshooting procedures, and explanations of error messages and error codes. The documentation that comes with your operating system and software also contains troubleshooting information.
Diagnosing a problem
Before you contact IBM or an approved warranty service provider, follow these procedures in the order in which they are presented to diagnose a problem with your server:
1. Determine what has changed. Determine whether any of the following items were added, removed, replaced,
or updated before the problem occurred:
v BIOS code v Device drivers v Firmware v Hardware components v Software
If possible, return the server to the condition it was in before the problem occurred.
2. Collect data. Thorough data collection is necessary for diagnosing hardware and software
problems. a. Document error codes and system-board LEDs.
v System error codes: See “Viewing the test log” on page 73 for
information about error codes.
v Software or operating-system error codes: See the documentation for
the software or operating system for information about a specific error code. See the manufacturer's Web site for documentation.
v Light path diagnostics LEDs: See “Light path diagnostics LEDs” on
page 65 for information about light path diagnostics LEDs that are lit.
v System-board LEDs: See “System-board LEDs” on page 20 for
information about system-board LEDs that are lit.
b. Collect system data.
Run Dynamic System Analysis (DSA) to collect information about the hardware, firmware, software, and operating system. Have this information available when you contact IBM or an approved warranty service provider. For instructions for running the DSA program, see “Running the diagnostic programs” on page 72.
If you have to download the latest version of DSA, go to http://www.ibm.com/systems/support/supportsite.wss/ docdisplay?brandind=5000008&lndocid=SERV-DSA or complete the following steps.
© Copyright IBM Corp. 2012 1
Note: Changes are made periodically to the IBM Web site. The actual procedure might vary slightly from what is described in this document.
1) Go to http://www.ibm.com/systems/support/.
2) Under Product support, click System x.
3) Under Popular links, click Software and device drivers.
4) Under Related downloads, click Dynamic System Analysis (DSA). For information about DSA command-line options, go to
http://publib.boulder.ibm.com/infocenter/toolsctr/v1r0/index.jsp?topic=/ com.ibm.xseries.tools.doc/erep_tools_dsa.html or complete the following steps:
1) Go to http://publib.boulder.ibm.com/infocenter/toolsctr/v1r0/index.jsp.
2) In the navigation pane, click IBM System x and BladeCenter Tools Center.
3) Click Tools reference > Error reporting and analysis tools > IBM Dynamic System Analysis.
3. Follow the problem-resolution procedures. The four problem-resolution procedures are presented in the order in which they
are most likely to solve your problem. Follow these procedures in the order in which they are presented:
a. Check for and apply code updates.
Most problems that appear to be caused by faulty hardware are actually caused by BIOS code, system firmware, device firmware, or device drivers that are not at the latest levels.
Important: Some cluster solutions require specific code levels or coordinated code updates. If the device is part of a cluster solution, verify that the latest level of code is supported for the cluster solution before you update the code.
1) Determine the existing code levels. In DSA, click Firmware/VPD to view system firmware levels, or click
Software to view operating-system levels.
2) Download and install updates of code that is not at the latest level. To display a list of available updates for your server, go to
http://www.ibm.com/systems/support/supportsite.wss/ docdisplay?brandind=5000008&lndocid=MIGR-4JTS2T or complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual procedure might vary slightly from what is described in this document.
a) Go to http://www.ibm.com/systems/support/. b) Under Product support, click System x. c) Under Popular links, click Software and device drivers. d) Click System x3650 M3 to display the list of downloadable files for
the server.
You can install code updates that are packaged as an UpdateXpress System Pack or UpdateXpress CD image. An UpdateXpress System Pack contains an integration-tested bundle of online firmware and device-driver updates for your server.
Be sure to separately install any listed critical updates that have release dates that are later than the release date of the UpdateXpress System Pack or UpdateXpress image.
2 IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
When you click an update, an information page is displayed, including a list of the problems that the update fixes. Review this list for your specific problem; however, even if your problem is not listed, installing the update might solve the problem.
b. Check for and correct an incorrect configuration.
If the server is incorrectly configured, a system function can fail to work when you enable it; if you make an incorrect change to the server configuration, a system function that has been enabled can stop working.
1) Make sure that all installed hardware and software are supported. See http://www.ibm.com/servers/eserver/serverproven/compat/us/ to
verify that the server supports the installed operating system, optional devices, and software levels. If any hardware or software component is not supported, uninstall it to determine whether it is causing the problem. You must remove nonsupported hardware before you contact IBM or an approved warranty service provider for support.
2) Make sure that the server, operating system, and software are
installed and configured correctly.
Many configuration problems are caused by loose power or signal cables or incorrectly seated adapters. You might be able to solve the problem by turning off the server, reconnecting cables, reseating adapters, and turning the server back on. For information about performing the checkout procedure, see “Checkout procedure” on page
41. If the problem is associated with a specific function (for example, if a
RAID hard disk drive is marked offline in the RAID array), see the documentation for the associated controller and management or controlling software to verify that the controller is correctly configured.
Problem determination information is available for many devices such as RAID and network adapters.
For problems with operating systems or IBM software or devices, complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual procedure might vary slightly from what is described in this document.
a) Go to http://www.ibm.com/systems/support/. b) Under Product support, click System x. c) From the Product family list, select System x3650 M3. d) Under Support & downloads, click Documentation, Install, and
Use to search for related documentation.
c. Check for troubleshooting procedures and RETAIN tips.
Troubleshooting procedures and RETAIN tips document known problems and suggested solutions. To search for troubleshooting procedures and RETAIN tips, complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual procedure might vary slightly from what is described in this document.
1) Go to http://www.ibm.com/systems/support/.
2) Under Product support, click System x.
3) From the Product family list, select System x3650 M3.
4) Under Support & downloads, click Troubleshoot.
Chapter 1. Start here 3
5) Select the troubleshooting procedure or RETAIN tip that applies to your problem:
v Troubleshooting procedures are under Diagnostic. v RETAIN tips are under Troubleshoot.
d. Check for and replace defective hardware.
If a hardware component is not operating within specifications, it can cause unpredictable results. Most hardware failures are reported as error codes in a system or operating-system log. For more information, see “Troubleshooting tables” on page 43 and Chapter 5, “Removing and replacing server components,” on page 169. Hardware errors are also indicated by light path diagnostics LEDs.
A single problem might cause multiple symptoms. Follow the troubleshooting procedure for the most obvious symptom. If that procedure does not diagnose the problem, use the procedure for another symptom, if possible.
If the problem remains, contact IBM or an approved warranty service provider for assistance with additional problem determination and possible hardware replacement. To open an online service request, go to http://www.ibm.com/support/electronic/. Be prepared to provide information about any error codes and collected data.
Undocumented problems
If you have completed the diagnostic procedure and the problem remains, the problem might not have been previously identified by IBM. After you have verified that all code is at the latest level, all hardware and software configurations are valid, and no light path diagnostics LEDs or log entries indicate a hardware component failure, contact IBM or an approved warranty service provider for assistance. To open an online service request, go to http://www.ibm.com/support/electronic/. Be prepared to provide information about any error codes and collected data and the problem determination procedures that you have used.
4 IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Chapter 2. Introduction
This Problem Determination and Service Guide contains information to help you solve problems that might occur in your IBM or 7949 server. It describes the diagnostic tools that come with the server, error codes and suggested actions, and instructions for replacing failing components.
Replaceable components are of four types: v Consumable Parts: Purchase and replacement of consumable
parts(components, such as batteries and printer cartridges, that have depletable life) is your responsibility. If IBM acquires or installs a consumable part at your request, you will be charged for the service.
v Tier 1 customer replaceable unit (CRU): Replacement of Tier 1 CRUs is your
responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation.
v Tier 2 customer replaceable unit: You may install a Tier 2 CRU yourself or
request IBM to install it, at no additional charge, under the type of warranty service that is designated for your server.
v Field replaceable unit (FRU): FRUs must be installed only by trained service
technicians.
For information about the terms of the warranty and getting service and assistance, see the Warranty and Support Information document on the IBM Documentation CD.
®
System x3650 M3 Type 4255, 7945,
Related documentation
In addition to this document, the following documentation also comes with the server:
v Installation and User's Guide
This document is in Portable Document Format (PDF) on the IBM Documentation CD. It provides general information about setting up and cabling the server, including information about features, and how to configure the server. It also contains detailed instructions for installing, removing, and connecting optional devices that the server supports.
v Rack Installation Instructions
This printed document contains instructions for installing the server in a rack.
v Safety Information
This document is in PDF on the IBM Documentation CD. It contains translated caution and danger statements. Each caution and danger statement that appears in the documentation has a number that you can use to locate the corresponding statement in your language in the Safety Information document.
v Warranty and Support Information
This printed document contains information about the terms of the warranty.
v Environmental Notices and User Guide
This document is in PDF on the IBM Documentation CD. It contains translated environmental notices.
v IBM License Agreement for Machine Code
This document is in PDF on the IBM Documentation CD. It provides translated versions of the IBM License Agreement for Machine code for your product.
© Copyright IBM Corp. 2012 5
v Licenses and Attributions Documents
This document is in PDF. It contains information about the open-source notices.
Depending on the server model, additional documentation might be included on the IBM Documentation CD.
®
The System x
and xSeries®Tools Center is an online information center that contains information about tools for updating, managing, and deploying firmware, device drivers, and operating systems. The System x and xSeries Tools Center is at http://publib.boulder.ibm.com/infocenter/toolsctr/v1r0/index.jsp.
The server might have features that are not described in the documentation that you received with the server. The documentation might be updated occasionally to include information about those features, or technical updates might be available to provide additional information that is not included in the server documentation. These updates are available from the IBM Web site. To check for updated documentation and technical updates, complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual procedure might vary slightly from what is described in this document.
1. Go to http://www.ibm.com/systems/support/.
2. Under Product support, click System x.
3. Under Popular links, click Publications lookup.
4. From the Product family menu, select System x3650 M3 and click Continue.
Notices and statements in this document
The caution and danger statements in this document are also in the multilingual Safety Information document, which is on the Documentation CD. Each statement is numbered for reference to the corresponding statement in your language in the Safety Information document.
The following notices and statements are used in this document:
v Note: These notices provide important tips, guidance, or advice. v Important: These notices provide information or advice that might help you avoid
inconvenient or problem situations.
v Attention: These notices indicate potential damage to programs, devices, or
data. An attention notice is placed just before the instruction or situation in which damage might occur.
v Caution: These statements indicate situations that can be potentially hazardous
to you. A caution statement is placed just before the description of a potentially hazardous procedure step or situation.
v Danger: These statements indicate situations that can be potentially lethal or
extremely hazardous to you. A danger statement is placed just before the description of a potentially lethal or extremely hazardous procedure step or situation.
6 IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Features and specifications
The following information is a summary of the features and specifications of the server. Depending on the model, some features might not be available, or some specifications might not apply.
Racks are marked in vertical increments of 4.45 cm (1.75 inches). Each increment is referred to as a unit, or “U.” A 1-U-high device is 1.75 inches tall.
Notes:
1. Power consumption and heat output vary depending on the number and type of optional features that are installed and the power-management optional features that are in use.
2. The sound levels were measured in controlled acoustical environments according to the procedures specified by the American National Standards Institute (ANSI) S12.10 and ISO 7779 and are reported in accordance with ISO
9296. Actual sound-pressure levels in a given location might exceed the average values stated because of room reflections and other nearby noise sources. The declared sound-power levels indicate an upper limit, below which a large number of computers will operate.
Chapter 2. Introduction 7
Table 1. Features and specifications
Microprocessor:
v Supports up to two Intel Xeon
multi-core microprocessors (one installed)
v Level-3 cache v QuickPath Interconnect (QPI) links
speed up to 6.4 GT per second
Note:
v Do not install an Intel Xeon
series microprocessor and an Xeon 5600 series microprocessor in the same server.
v Use the Setup utility to determine the
type and speed of the microprocessors.
v For a list of supported microprocessors,
see http://www.ibm.com/servers/eserver/ serverproven/compat/us/.
Memory:
v Minimum: 2 GB v Maximum: 288 GB
– 48 GB using unbuffered DIMMs
(UDIMMs)
– 288 GB using registered DIMMs
(RDIMMs)
v Type: PC3-10600R-999, 800, 1067, and
1333 MHz, ECC, DDR3 registered or unbuffered SDRAM DIMMs
v Slots: 18 dual inline v Supports (depending on the model):
– 2 GB and 4 GB unbuffered DIMMs – 2 GB, 4 GB, 8 GB, and 16 GB
registered DIMMs
SATA optical drives (optional):
v DVD-ROM v Multi-burner
Hard disk drive expansion bays (depending on the model:
v Eight 2.5-inch SAS hot-swap bays for
hard disk drive bays with option to add eight more 2.5-inch SAS hot-swap hard disk drive bays
v Four 2.5-inch simple-swap, solid state
SATA hard disk drive bays
PCI expansion slots:
v Two PCI Express riser cards with two
PCI Express x8 slots (x8 lanes) each, standard
v Support for the following optional riser
cards: – Two 133 MHz/64-bit PCI-X 1.0a slots – One PCI Express x16 slot (x16
lanes)
5500
Size (2U):
v Height: 85.2 mm (3.346 in.) v Depth: EIA flange to rear - 698 mm
(27.480 in.), Overall - 729 mm (28.701 in.)
v Width: With top cover - 443.6 mm
(17.465 in.), With front bezel - 482.0 mm (18.976 in.)
v Weight: approximately 21.09 kg (46.5
lb) to 25 kg (55 lb) depending upon configuration
Integrated functions:
v Integrated management module (IMM),
which provides service processor control and monitoring functions, video controller, and (when the optional virtual media key is installed) remote keyboard, video, mouse, and remote hard disk drive capabilities
v Dedicated or shared management
network connections
v Serial over LAN (SOL) and serial
redirection over Telnet or Secure Shell (SSH)
v One systems-management RJ-45 for
connection to a dedicated systems-management network
v Support for remote management
presence through an optional virtual media key
v Broadcom BCM5709 Gb Ethernet
controller with TCP/IP Offload Engine (TOE) and Wake on LAN support
v Four Ethernet ports (two on system
board and two additional ports when the optional IBM Dual-Port 1 Gb Ethernet Daughter Card is installed)
v One serial port, shared with the
integrated management module (IMM)
v Four Universal Serial Bus (USB) ports
(two on front, two on rear of server), v2.0 supporting v1.1, plus one or more dedicated internal USB ports on the SAS riser card
v Two video ports (one on front and one
on rear of server)
v One SATA tape connector, one USB
tape connector, and one tape power connector on SAS riser card (some models)
v Support for hypervisor function through
an optional USB flash device on the SAS riser card (not available on simple-swap models)
Note: In messages and documentation, the term service processor refers to the integrated management module (IMM).
Video controller (integrated into IMM):
v Matrox G200eV (two analog ports - one
front and one rear that can be connected at the same time) Note: The maximum video resolution is 1600 x 1200 at 75 Hz. – SVGA compatible video controller – DDR2 250 MHz SDRAM video
memory controller – Avocent Digital Video Compression – 16 MB of video memory (not
expandable)
ServeRAID controller (depending on the model):
v A ServeRAID-BR10il v2 SAS/SATA
adapter that provides RAID levels 0, 1, and 1E (comes standard on some hot-swap models).
v An optional ServeRAID-BR10il SAS/SATA
adapter that provides RAID levels 0, 1, and 1E can be ordered.
v An optional ServeRAID-MR10i SAS/SATA
adapter that provides RAID levels 0, 1, 5, 6, 10, 50, and 60 can be ordered.
v An optional ServerRAID M1015
SAS/SATA adapter that provides RAID levels 0, 1, and 10 with optional RAID 5/50 and SED (Self Encrypting Drive) upgrade.
v An optional ServeRAID-M5014 SAS/SATA
adapter that provides RAID levels 0, 1, 5, 10 and 50 with optional battery and RAID 6/60 and SED upgrade.
v An optional ServeRAID-M5015 SAS/SATA
adapter with battery that provides RAID levels 0, 1, 5, 10, and 50 with optional RAID 6/60 and SED upgrade
Notes:
1. RAID is supported in hot-swap models only.
2. The ServeRAID controllers are installed in a PCI Express x8 mechanical slot (x4 electrical); however, the controllers run at x4 bandwidth.
8 IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 1. Features and specifications (continued)
Electrical input with hot-swap ac power supplies:
v Sine-wave input (47 - 63 Hz) required v Input voltage range automatically
selected
v Input voltage low range:
– Minimum: 100 V ac – Maximum: 127 V ac
v Input voltage high range:
– Minimum: 200 V ac – Maximum: 240 V ac
v Input kilovolt-amperes (kVA)
approximately: – Minimum: 0.090 kVA – Maximum: 0.700 kVA
Environment:
v Air temperature:
– Server on: 10°C to 35°C (50.0°F to
95.0°F); altitude: 0 to 914.4 m (3000 ft). Decrease system temperature by 1°C for every 1000-foot increase in altitude.
– Server off: 5°C to 45°C (41.0°F to
113.0°F); maximum altitude: 3048 m (10000 ft)
– Shipment: -40°C to +60°C (-40°F to
140°F); maximum altitude: 3048 m (10000 ft)
v Humidity:
– Server on: 20% to 80%; maximum
dew point: 21°C; maximum rate of change 5°C per hour.
– Server off: 8% to 80%; maximum
dew point: 27°C
– Shipment: 5% to 100%
v Particulate contamination:
Attention: Airborne particulates and reactive gases acting alone or in combination with other environmental factors such as humidity or temperature might pose a risk to the server. For information about the limits for particulates and gases, see “Particulate contamination” on page
283.
Hot-swap fans:
Three - provide redundant cooling.
Power supply:
v Up to two hot-swap power supplies for
redundancy support – 460-watt ac – 675-watt ac – 675-watt high-efficiency ac – 675-watt dc
Note: You cannot mix 460-watt and 675-watt power supplies, or high-efficiency and non-high-efficiency power supplies, or ac and dc power supplies in the server.
Acoustical noise emissions:
v Declared sound power, idle: 6.3 bel v Declared sound power, operating: 6.5 bel
Heat output:
Approximate heat output: v Minimum configuration: 662 Btu per hour
(194 watts)
v Maximum configuration: 2302 Btu per
hour (675 watts)
Server controls, LEDs, and connectors
This section describes the controls, light-emitting diodes (LEDs), and connectors.
Front view
The following illustration shows the controls, connectors, and hard disk drive bays on the front of the server.
Hard disk drive activity LED (green)
Hard disk drive status LED (amber)
Rack release latch
Hard disk drive bays
Hard disk drive activity LED: Each hard disk drive has an activity LED. When this LED is flashing, it indicates that the drive is in use.
Hard disk drive status LED: Each hard disk drive has a status LED. When this LED is lit, it indicates that the drive has failed. When this LED is flashing slowly (one flash per second), it indicates that the drive is being rebuilt as part of a RAID
Video connector
USB 1 connector
Bay 15Bay 0
CD/DVD drive activity LED
USB 2 connector
CD/DVD eject button
Operator information panel
Rack release latch
CD/DVD drive (optical drive)
Chapter 2. Introduction 9
configuration. When the LED is flashing rapidly (three flashes per second), it indicates that the controller is identifying the drive.
Video connector: Connect a monitor to this connector. The video connectors on the front and rear of the server can be used simultaneously.
USB connectors: Connect a USB device, such as USB mouse, keyboard, or other USB device, to either of these connectors.
Operator information panel: This panel contains controls, light-emitting diodes (LEDs), and connectors. For information about the controls and LEDs on the operator information panel, see “Operator information panel.”
Rack release latches: Press these latches to release the server from the rack.
Optional CD/DVD-eject button: Press this button to release a CD or DVD from the
CD-RW/DVD drive.
Optional CD/DVD drive activity LED: When this LED is lit, it indicates that the CD-RW/DVD drive is in use.
Operator information panel
The following illustration shows the controls and LEDs on the operator information panel.
The following controls and LEDs are on the operator information panel: v Power-control button and power-on LED: Press this button to turn the server
on and off manually or to wake the server from a reduced-power state. The states of the power-on LED are as follows:
Off: AC power is not present, or the power supply or the LED itself has failed. – Flashing rapidly (4 times per second): The server is turned off and is not
ready to be turned on. The power-control button is disabled. This will last approximately 20 to 40 seconds.
Note: Approximately 40 seconds after the server is connected to ac power, the power-control button becomes active.
Flashing slowly (once per second): The server is turned off and is ready to
be turned on. You can press the power-control button to turn on the server. – Lit: The server is turned on. – Fading on and off: The server is in a reduced-power state. To wake the
server, press the power-control button or use the IMM Web interface.
10 IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Note: If this LED is off, it does not mean that there is no electrical power in the server. The LED might be burned out. To remove all electrical power from the server, you must disconnect the power cord from the electrical outlet.
v Ethernet icon LED: This LED lights the Ethernet icon. v Ethernet activity LEDs: When any of these LEDs is lit, it indicates that the
server is transmitting to or receiving signals from the Ethernet LAN that is connected to the Ethernet port that corresponds to that LED.
v Information LED: When this LED is lit, it indicates that a noncritical event has
occurred. An LED on the light path diagnostics panel is also lit to help isolate the error.
v System-error LED: When this LED is lit, it indicates that a system error has
occurred. An LED on the light path diagnostics panel is also lit to help isolate the error.
v Release latch: Slide this latch to the left to access the light path diagnostics
panel, which is behind the operator information panel.
v Locator button and locator LED: Use this LED to visually locate the server
among other servers. It is also used as the physical presence for Trusted Platform Module (TPM). Press this button to turn on or turn off this LED locally. You can use IBM Systems Director to light this LED remotely.
Light path diagnostics panel
The light path diagnostics panel is on the top of the operator information panel.
To access the light path diagnostics panel, slide the blue release button on the operator information panel to the left. Pull forward on the operator information panel until the hinge of the panel is free of the server chassis. Then pull down on the operator information panel, so that you can view the light path diagnostics panel information.
Operator information panel
Light path diagnostics LEDs
Release latch
The following illustration shows the controls and LEDs on the light path diagnostics panel.
Notes:
1. Do not run the server for an extended period of time while the light path
diagnostics panel is pulled out of the server.
2. Light path diagnostics LEDs remain lit only while the server is connected to
power.
Chapter 2. Introduction 11
Rear view
Checkpoint code display
v Remind button: This button places the system-error LED on the front panel into
Remind mode. In Remind mode, the system-error LED flashes rapidly until the problem is corrected, the server is restarted, or a new problem occurs.
By placing the system-error LED indicator in Remind mode, you acknowledge that you are aware of the last failure but will not take immediate action to correct the problem. The remind function is controlled by the IMM.
v NMI button: Press this button to force a nonmaskable interrupt to the
microprocessor, if you are directed by IBM service and support.
v Reset button: Press this button to reset the server and run the power-on
self-test (POST). You might have to use a pen or the end of a straightened paper clip to press the button. The reset button is in the lower-right corner of the light path diagnostics panel.
The following illustration shows the connectors on the rear of the server.
Ethernet connectors: Use any of these connectors to connect the server to a network.
Power-cord connector: Connect the power cord to this connector.
USB connectors: Connect a USB device, such as USB mouse, keyboard, or other
USB device, to any of these connectors.
Serial connector: Connect a 9-pin serial device to this connector. The serial port is shared with the integrated management module (IMM). The IMM can take control of the shared serial port to perform text console redirection and to redirect serial traffic, using Serial over LAN (SOL).
Video connector: Connect a monitor to this connector. The video connectors on the front and rear of the server can be used simultaneously.
12 IBM System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Loading...
+ 286 hidden pages