IBM 8668, xSeries 232 8668 Hardware Maintenance Manual

ERserver
򔻐򗗠򙳰
Hardware Maintenance Manual
xSeries 232, Type 8668
ER s e r v e r
򔻐򗗠򙳰
Hardware Maintenance Manual
xSeries 232, Type 8668
Note
Before using this information and the product it supports, be sure to read the general information under “Notices” on page 167.
First Edition June 2001 (updated June 2002) The following paragraph does not apply to the United Kingdom or any country where such provisions are
inconsistent with local law:
INTERNATIONAL BUSINESS MACHINES CORPORATION PROVIDES THIS PUBLICATION AS ISWITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore, this statement may not apply to you.
This publication could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. IBM may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time.
This publication was developed for products and services offered in the United States of America. IBM may not offer the products, services, or features discussed in this document in other countries, and the information is subject to change without notice. Consult your local IBM representative for information on the products, services, and features available in your area.
Requests for technical information about IBM products should be made to your IBM reseller or IBM marketing representative.
© Copyright International Business Machines Corporation 2000, 2001. All rights reserved.
US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
About this manual
This manual contains diagnostic information, a Symptom-to-FRU index, service information, error codes, error messages, and configuration information for the
®
IBM
Important: This manual is intended for trained servicers who are familiar with
Important safety information
Be sure to read all caution and danger statements in this book before performing any of the instructions. See “Safety information” on page 133.
Leia todas as instruções de cuidado e perigo antes de executar qualquer operação.
xSeries 232.
IBM PC Server products.
Prenez connaissance de toutes les consignes de type Attention et Danger avant de procéder aux opérations décrites par les instructions.
Lesen Sie alle Sicherheitshinweise, bevor Sie eine Anweisung ausführen.
© Copyright IBM Corp. 2000, 2001 iii
Online Support
Accertarsi di leggere tutti gli avvisi di attenzione e di pericolo prima di effettuare qualsiasi operazione.
Lea atentamente todas las declaraciones de precaución y peligro ante de llevar a cabo cualquier operación.
Use the World Wide Web (WWW) to download Diagnostic, BIOS Flash, and device driver files.
File download address is:
http://www.ibm.com/pc/support
iv Hardware Maintenance Manual: xSeries 232, Type 8668
Contents
About this manual ..........iii
Important safety information ........iii
Online Support .............iv
General checkout ..........1
General information .........3
Features and specifications..........3
Notices and statements in this book ......5
What the IBM xSeries 232 offers ........5
Reliability, availability, and serviceability features . . 6
Server controls and indicators ........7
Turning on the server ..........9
Turning off the server ..........10
Operator information panel .........11
Diagnostics.............13
Diagnostic tools overview .........13
POST................13
Error logs ..............14
SCSI messages (some models) ........14
Diagnostic programs and error messages ....15
Text messages ............15
Starting the diagnostic programs ......16
Viewing the test log ..........17
Recovering BIOS code ...........17
Identifying problems using status LEDs .....18
Power supply LEDs ..........19
Light Path Diagnostics panel .......19
Replacing the battery ...........20
Configuration ............23
Using the Configuration/Setup Utility program . . 23
Starting the Configuration/Setup Utility program 23
Main menu of the Configuration/Setup Utility 24
Using passwords ...........27
Using the SCSISelect utility program ......28
Starting the SCSISelect utility program ....28
Choices available from the SCSISelect menu . . 28
Using the PXE Boot Agent Utility program . . . 29
Using the ServerGuide CDs .........31
Features at a glance ..........32
Setup and configuration overview ......33
System Partition ............34
Typical NOS installation .........34
Setting up or updating multiple servers ....34
Installing your NOS without ServerGuide . . . 35
Additional programs included with ServerGuide 35
Installing options ..........37
Major components of the xSeries 232 server . . . 37
Components by model .........37
System board locations ..........40
System board option connectors ......40
System board internal cable connectors ....41
System board external port connectors ....43
System board switches and jumpers .....43
System board LED locations ........45
Light Path Diagnostics panel .......46
Before you begin ............47
System reliability considerations ......48
Working inside a server with power on ....48
Installing components ...........48
Removing the cover, door, and bezel .....48
Working with adapters .........54
Internal drives ............58
Memory modules ...........65
Microprocessor kit installation .......67
Power supplies ............71
Fans ................80
Completing the installation ........82
Connecting external options .........85
Input/output connector locations ......86
Input/output ports ...........86
Cabling the server ............95
Cabling the tower model .........95
Cabling the rack model .........96
FRU information (service only) ....97
Operator information bracket ........97
Power switch panel............99
Rear cable bracket............101
PCI adapter card support assembly ......102
DASD backplane assembly .........103
Power supply backplane .........103
System board .............105
Symptom-to-FRU index .......107
Beep Symptoms ............107
No Beep symptoms ...........109
Diagnostic Panel LED ..........109
Diagnostic error codes ..........111
Error symptoms ............115
Power supply LED errors .........116
POST error codes ............117
Service processor error codes ........122
SCSI error codes ............123
Temperature error messages ........123
Fan error messages ...........124
Power error messages ..........124
System shutdown ............124
Power related system shutdown ......124
Temperature related system shutdown ....125
DASD checkout ............125
Host Built-In Self Test (BIST) ........126
Bus fault messages ...........126
Undetermined Problems..........126
Parts listing, xSeries 232 ......129
© Copyright IBM Corp. 2000, 2001 v
System ...............130
Keyboards ..............131
Powercords..............132
Related service information .....133
Safety information............133
General safety ............133
Electrical safety............134
Safety inspection guide .........135
Handling electrostatic discharge-sensitive
devices ..............136
Grounding requirements ........137
Safety notices (multi-lingual translations) . . . 137
Send us your comments! .........166
Problem determination tips .........167
Notices ...............167
Trademarks ..............168
vi Hardware Maintenance Manual: xSeries 232, Type 8668
General checkout
The server diagnostic programs are stored in upgradable read-only memory (ROM) on the system board. These programs are the primary method of testing the major components of the server: the system board, Ethernet controller, video controller, RAM, keyboard, mouse (pointing device), diskette drive, serial ports, hard drives, and parallel port. You can also use them to test some external devices. See, “Diagnostic tools overview” on page 13.
Also, if you cannot determine whether a problem is caused by the hardware or by the software, you can run the diagnostic programs to confirm that the hardware is working properly.
When you run the diagnostic programs, a single problem might cause several error messages. When this occurs, work to correct the cause of the first error message. After the cause of the first error message is corrected, the other error messages might not occur the next time you run the test.
A failed system might be part of a shared DASD cluster (two or more systems sharing the same external storage device(s). Prior to running diagnostics, verify that the failing system is not part of a shared DASD cluster.
A system might be part of a cluster if:
v The customer identifies the system as part of a cluster. v One or more external storage units are attached to the system and at least one of
the attached storage units is additionally attached to another system or unidentifiable source.
v One or more systems are located near the failing system.
If the failing system is suspected to be part of a shared DASD cluster, all diagnostic tests can be run except diagnostic tests which test the storage unit (DASD residing in the storage unit) or the storage adapter attached to the storage unit.
Notes:
1. For systems that are part of a shared DASD cluster, run one test at a time in looped mode. Do not run all tests in looped mode, as this could enable the DASD diagnostic tests.
2. If multiple error codes are displayed, diagnose the first error code displayed.
3. If the computer hangs with a POST error, go to “Error logs” on page 14.
4. If the computer hangs and no error is displayed, “Diagnostic programs and
error messages” on page 15.
5. Power supply problems, “Power supply LED errors” on page 116.
6. For intermittent problems, check the error log; “Undetermined Problems” on
page 126.
© Copyright IBM Corp. 2000, 2001 1
001 IS THE SYSTEM PART OF A CLUSTER?
YES. Schedule maintenance with the customer. Shut down all systems
related to the cluster. Run storage test.
NO. Go to step 002.
002 THE SYSTEM IS NOT PART OF A CLUSTER
v Power-off the computer and all external devices. v Check all cables and power cords. v Set all display controls to the middle position. v Power-on all external devices. v Power-on the computer. v Record any POST error messages displayed on the screen. If an error is
displayed, look up the first error in the “Error logs” on page 14.
v Check the information LED panel System Error LED; if on, see
“Operator information panel” on page 11.
v Check the System Error Log. If an error was recorded by the system, see
“Error logs” on page 14.
v Start the Diagnostic Programs. See “Starting the diagnostic programs” on
page 16.
v Check for the following responses:
1. One beep.
2. Readable instructions or the Main Menu.
003 DID YOU RECEIVE BOTH OF THE CORRECT RESPONSES?
YES. Run the Diagnostic Programs. If necessary, refer to “Starting the
diagnostic programs” on page 16.
NO. Find the failure symptom in “Symptom-to-FRU index” on page 107.
2 Hardware Maintenance Manual: xSeries 232, Type 8668
General information
The IBM (SMP) server. It is ideally suited for networking environments that require superior microprocessor performance, efficient memory management, flexibility, and reliable data storage.
The xSeries 232 contains several IBM X-Architecture increase server performance and reliability. For more information about X-Architecture features, refer to “What the IBM xSeries 232 offers” on page 5. You can obtain more information about the IBM X-Architecture technologies and features at http://www.pc.ibm.com/us/eserver/xseries/xarchitecture/index.html.
If you have access to the World Wide Web, you can obtain up-to-date information about the xSeries 232 model and other IBM server products at the following World Wide Web address:
http://www.pc.ibm.com/eserver/xseries/
The information label containing the serial number, machine type, model number, and agency marks for your server is located as follows:
Tower model On the bottom of the server, on the rear of the server, and on the
Rack model On the side of the server, on the rear of the server, and on the front
xSeries 232 is a high-performance, symmetric multiprocessing
technologies, which help
front of the server below the bezel
of the server below the bezel
Features and specifications
The following table provides a summary of the features and specifications for the xSeries 232 server.
© Copyright IBM Corp. 2000, 2001 3
Table 1. Features and specifications
Microprocessor:
®
v Intel
Pentium®III
v 256 or 512 KB Level-2 cache v Supports up to two
microprocessors
Memory:
v Maximum: 4 GB v Type: ECC, SDRAM, PC133,
registered DIMMs
v Slots: Four (two-way
interleaved)
Drives standard:
v Diskette: 1.44 MB v CD-ROM: 48X IDE
Expansion bays:
v Hot-swap: Six slim high v Non-hot-swap: Three 5.25-inch
(one used by CD-ROM drive)
v You can install a drive-bay
expansion kit to convert two of the 5.25-inch bays so that they support three hot-swap drives
PCI expansion slots:
v One 33 MHz/32-bit v Two 33 MHz/64-bit v Two 66 MHz/64-bit
Power supplies:
Depending on model: one 385-watt nonredundant power supply; or two 250-watt (115-230 V ac) power supplies with a maximum of three 250-watt units for power-supply redundancy
Acoustical noise emissions:
v Sound power, idling: 6.6 bel maximum v Sound power, operating: 6.8 bel
maximum
v Sound pressure, operating: 53 dBa
maximum
Video:
v S3 video controller v Compatible with SVGA and VGA v 8 MB video memory
Size (rack model 5U)
v Height: 217.2 mm (8.6 in.) v Depth: 688 mm (27.1 in.) v Width: 427.8 mm (16.8 in.) v Weight: approximately 35.4 kg (78 lb)
when fully configured
Size (tower model)
v Height: 439.8 mm (17.3 in.) v Depth: 700 mm (27.6 in.) v Width: 217.2 mm (8.6 in.) v Weight: approximately 37.64 kg (83 lb)
when fully configured
Integrated functions:
v Ultra160 SCSI controller v One 10BASE-T/100BASE-TX/100BASE-
FX, Intel Ethernet controller with alert on LAN
and Wake on LAN
®
support
v Two serial ports v Two Universal Serial Bus (USB) ports v Keyboard port v Mouse port v Video port v Integrated system management
processor (ISMP) with two ISM (RS-485) RJ-45 connectors
Environment:
v Air temperature:
– Server on: 10° to 35°C (50.0° to
95.0°F). Altitude: 0 to 914 m (2998.7 ft)
– Server on: 10° to 32°C (50.0° to
89.6°F). Altitude: 914 m (2998.7 ft) to 2133 m (6998.0 ft)
– Server off: 10° to 43°C (50.0° to
109.4°F). Maximum altitude: 2133 m (6998.0 ft)
v Humidity:
– Server on: 8% to 80% – Server off: 8% to 80%
Heat output:
Approximate heat output in British thermal units (Btu) per hour v Minimum configuration: 683 Btu
(200 watts)
v Maximum configuration: 1877 Btu
(550 watts)
Electrical input:
v Sine-wave input (50-60 Hz)
required
v Input voltage low range:
– Minimum: 100 V ac – Maximum: 127 V ac
v Input voltage high range:
– Minimum: 200 V ac – Maximum: 240 V ac
v Input kilovolt-amperes (kVA)
approximately: – Minimum: 0.08 kVA – Maximum: 0.52 kVA
4 Hardware Maintenance Manual: xSeries 232, Type 8668
Notices and statements in this book
The caution and danger statements used in this book also appear in the multilingual Safety Information book provided on the IBM Documentation CD. Each caution and danger statement is numbered for easy reference to the corresponding statements in the safety book.
The following types of notices and statements are used in this book:
v Note: These notices provide important tips, guidance, or advice. v Important: These notices provide information or advice that might help you
avoid inconvenient or problem situations.
v Attention: These notices indicate possible damage to programs, devices, or data.
An attention notice is placed just before the instruction or situation in which damage could occur.
v Caution: These statements indicate situations that can be potentially hazardous
to you. A caution statement is placed just before the description of a potentially hazardous procedure step or situation.
v Danger: These statements indicate situations that can be potentially lethal or
extremely hazardous to you. A danger statement is placed just before the description of a potentially lethal or extremely hazardous procedure step or situation.
What the IBM xSeries 232 offers
The unique design of the server takes advantage of advancements in symmetric multiprocessing (SMP), data storage, and memory management. The server combines:
v IBM X-Architecture technology
IBM X-Architecture leverages proven innovative IBM technologies to build the most powerful, and reliable Intel processor-based servers in the world.
v Impressive performance using an innovative approach to SMP
The server supports up to two Pentium III microprocessors. The server comes with one microprocessor installed; you can install an additional microprocessor to enhance performance and provide SMP capability.
v Large data-storage and hot-swap capabilities
All models of the server support up to six hot-swap hard disk drives. This hot-swap feature enables you to remove and replace hard disk drives without turning off the server.
The addition of an optional digital linear tape drive (DLT) enables quick backup of large amounts of data.
v IBM integrated system management processor (ISMP)
The integrated system management processor provides environmental monitoring for your server. When environmental conditions exceed thresholds or when system components fail, the ISMP lights the Light Path Diagnostics to indicate the location of the problem. Critical errors, as well as environmental and configuration information, are also included in the error log.
If an optional Remote Supervisor Adapter is installed on the Advanced System Management (ASM) interconnect network, you can view the system health; update the ISMP code; power on, power off, and restart the server; view the error log; view the vital product data; and send alerts over the ASM interconnect network.
v Redundant power capabilities
LEDs
General information 5
Depending on model, the server comes with either one 385-watt nonredundant power supply, or two 250-watt power-supply units. The two power-supply units provide redundant power for many server configurations. For power loads above 250 watts, a third optional power supply can be installed to provide a full 500 watts of power.
The NON LED on the system board is lit when the power load is 250 watts or greater with two power supplies, or when the power load is 500 watts or greater with three power supplies.
v Large system memory
The memory bus in your server supports up to 4 GB (GB equals approximately 1 000 000 000 bytes) of two-way interleaved system memory. The memory controller provides error correcting code (ECC) support for up to four industry-standard PC133, 3.3 V, 168-pin, 8-byte, registered, synchronous­dynamic-random access memory (SDRAM) dual inline memory modules (DIMMs).
v Integrated network environment support
The server comes with an Ethernet controller on the system board. This Ethernet controller has an interface for connecting to 10-Mbps or 100-Mbps networks. The server automatically selects between 10BASE-T and 100BASE-TX. The controller provides full-duplex (FDX) capability, which enables simultaneous transmission and reception of data on the Ethernet local area network (LAN).
v Redundant network interface card
The addition of an optional, redundant network interface card (NIC) provides a failover capability to a redundant Ethernet connection. If a problem occurs with the primary Ethernet connection, all Ethernet traffic associated with this primary connection is automatically switched to the redundant NIC. This switching occurs without data loss and without user intervention.
v Optional PCI adapters
The server uses peripheral component interconnect (PCI) bus architecture to provide compatibility with a wide range of existing hardware devices and software applications. The server supports up to five PCI adapters in the expansion slots.
Reliability, availability, and serviceability features
Three of the most important features in server design are reliability, availability, and serviceability (RAS). These factors help to ensure the integrity of the data stored on your server; that your server is available when you want to use it; and that should a failure occur, you can diagnose and repair the failure with minimal inconvenience.
The following is an abbreviated list of the RAS features that your server supports: v Menu-driven setup, system configuration, RAID configuration, and diagnostic
programs
v Power-on self-test (POST) v ROM-resident diagnostics v Integrated system management processor (ISMP) v Predictive failure alerts v Remote system problem-determination support v Power and temperature monitoring v Microprocessor built-in self-test (BIST) v Internal error signal monitoring v Configuration checking
6 Hardware Maintenance Manual: xSeries 232, Type 8668
v CPU/VRM failure identification through Light Path Diagnostics technology and
alerting
v Diagnostic support of ServeRAID v Hot-swap drive bays v Error codes and messages and system error logging v Upgradable BIOS, diagnostics, and ISMP code v Automatic restart after a power failure v Parity checking on the SCSI bus v Error checking and correcting (ECC) memory v Redundant hot-swap power-supply option v Redundant hot-swap cooling v Redundant Ethernet capabilities (with optional adapter) v Vital product data (VPD) on processor complex, system board, power backplane,
SCSI backplane, and each power supply
v Operator information panel and group of diagnostic LEDs on the system board v Remind button to temporarily turn off LEDs for nonvital alerts
Server controls and indicators
The following illustrations show the controls and indicators on the front of the server.
adapters and Ethernet adapters
Operator information panel
Tower model
Power-control
button shield
(if installed)
Power-control
button
Reset
button
Cover-release
latch
Serial
number
Hard disk drive activity light (green)
Hard disk drive status light (amber)
General information 7
Rack model
Operator information panel
Power-control
button shield
(if installed)
Power-control
button
Reset button
Hard disk drive status light (amber)
Cover-release latch
Hard disk drive activity light (green)
Serial number
Power-control button shield: You can install this circular disk over the power-control button to prevent accidental manual power-off. This disk is provided with your server.
Power-control button: Press this button to manually shut down and turn off the server.
Reset button: Press this button to reset the server and run the power-on self-test (POST). You might need to use a pen or the end of a straightened paper clip to press the button.
Operator information panel: The lights on this panel give status information for your server. See “Operator information panel” on page 11.
Cover release latch: Slide this lever to release the cover.
Serial number: This number uniquely identifies your server.
Hard disk drive status light: Each of the hot-swap drives has a hard disk drive
status light. When this amber light is on continuously, the drive has failed.
If a ServeRAID adapter is installed and this amber light flashes slowly (one flash per second), the drive is being rebuilt. When the light flashes rapidly (three flashes per second), the controller is identifying the drive.
Hard disk drive activity light: Each of the hot-swap drives has a hard disk drive light. When this green light is flashing, the controller is accessing the drive.
8 Hardware Maintenance Manual: xSeries 232, Type 8668
Rear view (tower shown)
DC power LED (green)
DC
GOODACGOOD
AC power LED (green)
Two 250-watt configuration 385-watt configuration
DC power LED (green)
AC power LED (green)
AC power light: This green light provides status information about the power supply. During typical operation, both the ac and dc power lights are on.
DC power light: This green light provides status information about the power supply. During typical operation, both the ac and dc Power lights are on.
Turning on the server
After you plug the power cord of your server into the power supply and an electrical outlet, the server can be started in any of the following ways:
v You can press the power-control button on the front of the server to start the
server.
v If the server is turned on, a power failure occurs, and unattended-start mode is
enabled in the Configuration/Setup Utility program, the server will start automatically when power is restored.
v If ac power is present, the server is off, and the wake-up feature is enabled in
the Configuration/Setup Utility program, the wake-up feature will turn on the server at the set time.
v If ac power is present, the system is off, and the Wake on LAN feature is
enabled, the system can be turned on by a network wake-up frame from the onboard Intel Ethernet controller.
v If ac power is present, the system is off, and the system is connected to an ASM
interconnect network, which contains at least one system with an optional Remote Supervisor Adapter installed, the system can be turned on from the Remote Supervisor Adapter user interface.
v If ac power is present, the system is off, and an optional Remote Supervisor
Adapter is installed in the system, the system can be turned on from the Remote Supervisor Adapter user interface.
General information 9
Turning off the server
Turning off the server refers to the act of disconnecting the server from the power source.
Statement 5:
CAUTION:
The power-control button on the device and the power switch on the power supply do not turn off the electrical current supplied to the device. The device also might have more than one power cord. To remove all electrical current from the device, ensure that all power cords are disconnected from the power source.
2
1
You can turn off the server in any of the following ways: v You can press the power-control button on the front of the server. This starts an
orderly shutdown of the operating system, if this feature is supported by your operating system.
Note: After performing an orderly shutdown, wait at least 5 seconds before you
press the power-control button to turn on the server again.
v You might need to press and hold the power-control button for more than 4
seconds to cause an immediate shutdown of the server and to force the power off. You can use this feature if the operating system stops functioning.
v If the system was turned on by the wake-up feature or Wake on LAN feature,
you can turn it off by either a software runtime utility or by the fail-safe, power-down counter.
v You can disconnect the server power cords from the electrical outlets to shut off
all power to the server.
Note: After disconnecting the power cords, wait approximately 15 seconds for
your system to stop running. Watch for the power-on light to stop blinking.
v If the system is connected to an ASM interconnect network which contains at
least one system with a Remote Supervisor Adapter installed, the system can be turned off from the Remote Supervisor Adapter user interface.
v If a Remote Supervisor Adapter is installed in the system, the system can be
turned off from the Remote Supervisor Adapter user interface.
10 Hardware Maintenance Manual: xSeries 232, Type 8668
Operator information panel
The following illustration shows the location of the lights on the operator information panel on the front of the server (see “Server controls and indicators”
Power-on light
POST complete light
OK
TX
LINK
100
OK
MB
RX
SCSI hard disk drive in-use light
Information light
on page 7).
Ethernet speed light
Ethernet link status light
Ethernet transmit/ receive activity light
System-error light
Power-on light: This green LED lights when system power is present in the server. When this light flashes, the server is in Standby mode (the system power supply is turned off and current is present). If this light is not on, the power cord is not connected, the power supply has failed, or this LED has failed.
POST complete light: This green LED lights when the server completes the power-on self-test (POST).
SCSI hard disk drive in-use light: This green LED lights when there is activity on a hard disk drive.
Information light: This amber LED lights when the information log contains information about certain conditions in your server that might affect performance. For example, the light will be on if your server has multiple power supplies and does not have redundant power. An LED on the diagnostic panel on the system board will also be on.
System-error light: This amber LED lights when a system error occurs. An LED on the diagnostic panel on the system board may also be on to further isolate the error. See “Identifying problems using status LEDs” on page 18.
Ethernet speed light: This green LED lights when the Ethernet LAN speed is 100 Mbps during typical operation, when the system is powered off the LED remains lit.
Ethernet link status light: This green LED lights when there is an active connection on the Ethernet port during typical operation, when the system is powered off the LED remains lit.
Ethernet transmit/receive activity light: This green LED lights when there is transmit or receive activity to or from the server.
General information 11
12 Hardware Maintenance Manual: xSeries 232, Type 8668
Diagnostics
This section provides basic troubleshooting information to help you resolve some common problems that might occur with the server.
If you cannot locate and correct the problem using the information in this section, refer to “Symptom-to-FRU index” on page 107 for more information.
Diagnostic tools overview
The following tools are available to help you identify and resolve hardware-related problems:
v POST beep codes, error messages, and error logs
The power-on self-test (POST) generates beep codes and messages to indicate successful test completion or the detection of a problem. See “POST” for more information.
v Light Path Diagnostics
The server has light-emitting diodes (LEDs) to help you identify problems with server components. These LEDs are part of the Light Path Diagnostics feature that is built into the server. By following the path of lights, you can quickly identify the type of system error. See “Identifying problems using status LEDs”
on page 18 for more information.
v Diagnostic programs and error messages
The server diagnostic programs are stored in upgradable read-only memory (ROM) on the system board. These programs are the primary method of testing the major components of the server. See “Diagnostic programs and error messages” on page 15 for more information.
POST
When you turn on the server, it performs a series of tests to check the operation of server components and some of the options installed in the server. This series of tests is called the power-on self-test, or POST.
If POST finishes without detecting any problems, a single beep sounds, the first screen of your operating system or application program appears, and the System POST Complete (OK) light is illuminated on the operator information panel.
If POST detects a problem, more than one beep sounds and an error message appears on your screen. See “Beep Symptoms” on page 107 and “POST error codes” on page 117 for more information.
Notes:
1. If you have a power-on password or administrator password set, you must type the password and press Enter, when prompted, before POST will continue.
2. A single problem might cause several error messages. When this occurs, work to correct the cause of the first error message. After you correct the cause of the first error message, the other error messages usually will not occur the next time you run the test.
© Copyright IBM Corp. 2000, 2001 13
Error logs
The POST error log contains the three most recent error codes and messages that the system generated during POST. The system error log contains all messages issued during POST and system status messages from the integrated system management processor.
Note: It is possible during power-on that some system error log entries may
You can view the contents of the error logs from the Configuration/Setup Utility program or from the diagnostic programs.
Log viewing from Configuration/Setup
To view the contents of the error log from the Configuration/Setup Utility, start the Configuration/Setup Utility program; then, select Error Logs from the main menu. See “Starting the Configuration/Setup Utility program” on page 23 for more information.
Log viewing from diagnostic programs
To view the contents of the error log from the diagnostic programs, start the diagnostic programs; select Hardware Info from the top of the diagnostic programs screen; select System Error Log from the list that appears; then, follow the instructions on the screen. See “Starting the diagnostic programs” on page 16 for more information.
contain a date of 01/01/1990 with a time of 00:00:00. These entries occur prior to the ISMP clock being initialized and may be ignored.
SCSI messages (some models)
If you receive a SCSI error message while using the SCSISelect Utility, use the following list to determine the possible cause of the error and what action to take.
Note: If your system does not have a hard disk drive, ignore any message that
indicates that the BIOS code is not installed.
One or more of the following might be causing the problem.
v A failing SCSI device (adapter or drive) v An improper SCSI configuration v Duplicate SCSI IDs in the same SCSI chain v An improperly installed SCSI terminator v A defective SCSI terminator v An improperly installed cable v A defective cable
To solve the problem, verify that: v The external SCSI devices are turned on. External SCSI devices must be turned
on before the server.
v The cables for all external SCSI devices are connected correctly. v The last device in each SCSI chain is terminated properly. v The SCSI devices are configured correctly.
If the above items are correct, run the diagnostic programs to obtain additional information about the failing device. If the error remains or recurs, call for service.
14 Hardware Maintenance Manual: xSeries 232, Type 8668
Diagnostic programs and error messages
The server diagnostic programs are stored in upgradable read-only memory (ROM) on the system board. These programs are the primary method of testing the major components of the server.
Diagnostic error messages indicate that a problem exists; they are not intended to be used to identify a failing part.
Sometimes the first error to occur causes additional errors. In this case, the server displays more than one error message. Always follow the suggested action instructions for the first error message that appears.
The following sections contain the error codes that might appear in the detailed test log and summary log when running the diagnostic programs.
The error code format is as follows:
fff-ttt-iii-date-cc-text message
where: fff is the three-digit function code that indicates the function being tested
when the error occurred. For example, function code 089 is for the microprocessor.
ttt is the three-digit failure code that indicates the exact test failure that was
encountered.
iii is the three-digit device ID. date is the date that the diagnostic test was run and the error recorded. cc is the check value that is used to verify the validity of the information. text message
is the diagnostic message that indicates the reason for the problem.
Text messages
The diagnostic text message format is as follows:
Function Name: Result (test specific string)
where:
Function Name
is the name of the function being tested when the error occurred. This corresponds to the function code (fff) given in the previous list.
Result can be one of the following:
Passed
Failed This result occurs when the diagnostic test discovers an error. User Aborted
This result occurs when the diagnostic test completes without any errors.
This result occurs when you stop the diagnostic test before it is complete.
Diagnostics 15
Not Applicable
This result occurs when you specify a diagnostic test for a device that is not present.
Aborted
This result occurs when the test could not proceed because of the system configuration.
Warning
This result occurs when a possible problem is reported during the diagnostic test, such as when a device that is to be tested is not installed.
Test Specific String
This is additional information that you can use to analyze the problem.
Starting the diagnostic programs
You can press F1 while running the diagnostic programs to obtain help information. You also can press F1 from within a help screen to obtain online documentation from which you can select different categories. To exit from the help information and return to where you left off, press Esc.
To start the diagnostic programs:
1. Turn on the server and watch the screen.
2. When the message F2 for Diagnostics appears, press F2.
3. Type the appropriate password; then, press Enter.
4. Select either Extended or Basic from the top of the screen.
5. When the Diagnostic Programs screen appears, select the test you want to run
from the list that appears; then, follow the instructions on the screen.
Notes:
a. If the server stops during testing and you cannot continue, restart the server
and try running the diagnostic programs again. If the problem remains, call for service.
b. The keyboard and mouse (pointing device) tests assume that a keyboard
and mouse are attached to the server.
c. If you run the diagnostic programs with no mouse attached to your server,
you will not be able to navigate between test categories using the Next Cat and Prev Cat buttons. All other functions provided by mouse-selectable buttons are also available using the function keys.
d. You can test the USB keyboard by using the regular keyboard test. The
regular mouse test can test a USB mouse. Also, you can run the USB hub test only if there are no USB devices attached.
e. You can view server configuration information (such as system
configuration, memory contents, interrupt request (IRQ) use, direct memory access (DMA) use, device drivers, and so on) by selecting Hardware Info from the top of the screen.
If the diagnostic problems do not detect any hardware errors but the problem remains during typical server operations, a software error might be the cause. If you suspect a software problem, refer to the information that comes with the software package.
16 Hardware Maintenance Manual: xSeries 232, Type 8668
Viewing the test log
When the tests have completed, you can view the test log by selecting Utility from the top of the screen and then selecting View Test Log.
Notes:
1. You can view the test log only while you are in the diagnostic programs. When you exit the diagnostic programs, the test log is cleared (saved test logs are not affected). To save the test log so that you can view it later, click Save Log on the diagnostic programs screen and specify a location and name for the saved log file.
2. To save the test log to a diskette, you must use a diskette that you have formatted yourself; this function does not work with preformatted diskettes. If the diskette has sufficient space for the test log, the diskette may contain other data.
Recovering BIOS code
If the BIOS code has become damaged, such as from a power failure during a flash update, you can recover the BIOS using the flash boot block recovery jumper and a BIOS flash diskette.
Note: You can obtain a BIOS flash diskette from one of the following sources:
v Use the ServerGuide program to make a BIOS flash diskette. v Download a BIOS flash diskette from the World Wide Web. Go to
http://www.ibm.com/pc/support/, click IBM Server Support, and make the selections for your server.
The flash memory of your server contains a protected area that cannot be overwritten. The recovery boot block is a section of code in this protected area that enables the server to start up and to read a flash diskette. The flash utility recovers the system BIOS code from the BIOS recovery files on the diskette.
Diagnostics 17
The following illustration shows the location of the flash boot block recovery jumper on the system board.
Pin 1
Pin 2
Pin 3
System board switch block (SW1)
Flash boot block recovery jumper (J16)
To recover the BIOS:
1. Turn off the server and peripheral devices and disconnect all external cables and power cords; then, remove the cover.
2. Locate the flash boot block recovery jumper block (J16) on the system board.
3. Move the jumper to pins 1 and 2 to enable BIOS recovery mode.
4. Insert the BIOS flash diskette into the diskette drive.
5. Restart the server.
The Recovery Boot screen appears. A progress report, Loading data from diskette xx%, is displayed. When programming is underway, a further
progress report, Programming block n of 7 yy%, is displayed. When the procedure completes, the message Recovery complete, remove the diskette and return boot block switch to the off position before rebooting is displayed.
6. Remove the flash diskette from the diskette drive.
7. Turn off the server.
8. Move the jumper on the boot-block jumper block to pins 2 and 3 to return to
normal startup mode.
9. Restart the server.
Identifying problems using status LEDs
If the System Error light in the operator information panel on the front of the server is on, one or more LEDs inside the server may be on. Use the Light Path Diagnostics feature to identify the type of error that occurred. See “Diagnostic tools overview” on page 13.
18 Hardware Maintenance Manual: xSeries 232, Type 8668
You can use the Light Path Diagnostics feature built into the server to quickly identify the type of system error that occurred. The server is designed so that any LEDs that are illuminated remain illuminated when the server shuts down as long as the ac power source is good and the power supplies can supply +5 V dc current to the server. This feature helps you isolate the problem if an error causes the server to shut down. See “Diagnostic tools overview” on page 13.
Power supply LEDs
The ac and dc power LEDs on the power supply provide status information about the power supply.
DC power LED (green)
AC power LED (green)
For information about interpreting these lights, see “Power supply LED errors” on page 116.
Light Path Diagnostics panel
The following illustration shows the LEDs on the diagnostic panel on the system board. See “Diagnostic Panel LED” on page 109 for information on identifying problems using these LEDs.
DC
GOODACGOOD
DC power LED (green)
AC power LED (green)
MEM
CPU PCI A PCI B PCI C
VRM
DASD
SP
PS1 PS2 PS3
NON
OVER
NMI
TEMP
FAN
REMIND
Diagnostics 19
Replacing the battery
When replacing the battery, you must replace it with a lithium battery of the same type from the same manufacturer. To avoid possible danger, read and follow the safety statement below.
To order replacement batteries, call 1-800-772-2227 within the United States, and 1-800-465-7999 or 1-800-465-6666 within Canada. Outside the U.S. and Canada, call your IBM
Note: After you replace the battery, you must reconfigure the system and reset the
system date and time.
Statement 2
CAUTION: When replacing the lithium battery, use only IBM Part Number 33F8354 or an equivalent type battery recommended by the manufacturer. If your system has a module containing a lithium battery, replace it only with the same module type made by the same manufacturer. The battery contains lithium and can explode if not properly used, handled, or disposed of.
Do not:
v Throw or immerse into water. v Heat to more than 100 C (212 F) v Repair or disassemble
®
reseller or IBM marketing representative.
Dispose of the battery as required by local ordinances or regulations.
To replace the battery:
1. Read “Safety information” on page 133 and follow any special handling and installation instructions supplied with the replacement battery.
2. Turn off the server and peripheral devices and disconnect all external cables and power cords.
3. Remove the battery:
4. a. Use one finger to slightly slide the battery out from its socket. The spring
mechanism will push the battery out toward you as you slide it from the socket.
b. Use one finger to lift the battery clip over the battery. c. Use your thumb and index finger to pull the battery from under the battery
clip.
20 Hardware Maintenance Manual: xSeries 232, Type 8668
d. Ensure that the battery clip is touching the base of the battery socket by
pressing gently on the clip.
5. Insert the new battery: a. Tilt the battery so that you can insert it into the socket, under the battery
clip.
b. As you slide it under the battery clip, press the battery down into the
socket.
6. Reinstall the server cover and connect the cables.
7. Turn on the server.
8. Start the Configuration/Setup Utility program and set configuration
parameters.
v Set the system date and time. v Set the power-on password. v Reconfigure the server.
Diagnostics 21
22 Hardware Maintenance Manual: xSeries 232, Type 8668
Loading...
+ 148 hidden pages