Fourth Edition (June 2000)
Part Number 143091-004
Spare Part Number 158549-001
Compaq Computer Corporation
Notice
The information in this publication is subject to change without notice.
COMPAQ COMPUTER CORPORATION SHALL NOT BE LIABLE FOR TECHNICAL OR
EDITORIAL ERRORS OR OMISSIONS CONTAINED HEREIN, NOR FOR INCIDENTAL OR
CONSEQUENTIAL DAMAGES RESULTING FROM THE FURNISHING, PERFORMANCE, OR USE
OF THIS MATERIAL. THIS INFORMATION IS PROVIDED “AS IS” AND COMPAQ COMPUTER
CORPORATION DISCLAIMS ANY WARRANTIES, EXPRESS, IMPLIED OR STATUTORY AND
EXPRESSLY DISCLAIMS THE IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR
PARTICULAR PURPOSE, GOOD TITLE AND AGAINST INFRINGEMENT.
This publication contains information protected by copyright. No part of this publication may be
photocopied or reproduced in any form without prior written consent from Compaq Computer Corporation.
The software described in this guide is furnished under a license agreement or nondisclosure agreement.
The software may be used or copied only in accordance with the terms of the agreement.
Compaq, Deskpro, Fastart, Compaq Insight Manager, Systempro, Systempro/LT, ProLiant, ROMPaq,
QVision, SmartStart, NetFlex, QuickFind, PaqFax, ProSignia, registered United States Patent and
Trademark Office.
Neoserver, Netelligent, Systempro/XL, SoftPaq, QuickBlank, QuickLock are trademarks and/or service
marks of Compaq Computer Corporation.
Microsoft, MS-DOS, Windows, and Windows NT are registered trademarks of Microsoft Corporation.
Pentium is a registered trademark and Xeon is a trademark of Intel Corporation.
Other product names mentioned herein may be trademarks and/or registered trademarks of their respective
companies.
Compaq ProLiant ML370 Maintenance and Service Guide
Maintenance and Service Guide
Fourth Edition (June 2000)
Part Number 143091-004
Spare Part Number 158549-001
About This Guide
Symbols in Text.........................................................................................................vii
This maintenance and service guide is a troubleshooting guide that can be used for reference
when servicing Compaq ProLiant ML370 Servers.
IMPORTANT: The installation of options and servicing of this product shall be performed by individuals
who are knowledgeable of the procedures, precautions, and hazards associated with equipment
containing hazardous energy circuits.
Symbols in Text
These symbols may be found in the text of this guide. They have the following meanings.
About This Guide
WARNING: To reduce the risk of personal injury from electric shock and hazardous energy
levels, only authorized service technicians should attempt to repair this equipment. Improper
repairs could create conditions that are hazardous.
WARNING: Text set off in this manner indicates that failure to follow directions in the warning
could result in bodily harm or loss of life.
CAUTION: Text set off in this manner indicates that failure to follow directions could result in
damage to equipment or loss of information.
IMPORTANT: Text set off in this manner presents clarifying information or specific instructions.
NOTE: Text set off in this manner presents commentary, sidelights, or interesting points of information.
viii Compaq ProLiant ML370 Maintenance and Service Guide
Compaq Technician Notes
WARNING: Only authorized technicians trained by Compaq should attempt to repair this
equipment. All troubleshooting and repair procedures are detailed to allow only
subassembly/module-level repair. Because of the complexity of the individual boards and
subassemblies, no one should attempt to make repairs at the component level or to make
modifications to any printed wiring board. Improper repairs can create a safety hazard.
WARNING: To reduce the risk of personal injury from electric shock and hazardous energy
levels, do not exceed the level of repair specified in these procedures. Because of the
complexity of the individual boards and subassemblies, do not attempt to make repairs at the
component level or to make modifications to any printed wiring board. Improper repairs could
create conditions that are hazardous.
WARNING: To reduce the risk of electric shock or damage to the equipment:
■ If the system has multiple power supplies, disconnect power from the system by
unplugging all power cords from the power supplies.
■ Do not disable the power cord grounding plug. The grounding plug is an important safety
feature.
■ Plug the power cord into a grounded (earthed) electrical outlet that is easily accessible at
all times.
CAUTION: To properly ventilate your system, you must provide at least 12 inches (30.5 cm) of
clearance at the front and back of the computer.
CAUTION: The computer is designed to be electrically grounded. To ensure proper operation,
plug the AC power cord into a properly grounded AC outlet only.
IMPORTANT: Any indications of component replacement or printed wiring board modifications may void
any warranty.
Where to Go for Additional Help
In addition to this guide, the following information sources are available:
■ User documentation
■ Online information at the Compaq website
http://www.compaq.com/support/servers/index.html
■ Compaq Service Quick Reference Guide
■ Service training guides
■ Compaq service advisories and bulletins
■ Compaq QuickFind
■ Compaq Insight Manager
■ Compaq download facility: Call 1-281-518-1418 using your modem.
Integrated Management Display
Some Compaq server models include a Compaq Integrated Management Display (IMD), an
integrated, 16 × 4 character display mounted on the front of the server. This display provides
easy-to-use menu-driven access to server information, including model number, LCD firmware
revision, and POST operations.
Telephone Numbers
For the name of your nearest Compaq authorized reseller:
■ In the United States, call 1-800-345-1518.
■ In Canada, call 1-800-263-5868.
For Compaq technical support:
■ In the United States and Canada, call 1-800-386-2172.
■ For Compaq technical support phone numbers outside the United States and Canada, visit
the Compaq website at:
http://www.compaq.com
About This Guide ix
Chapter
1
Illustrated Parts Catalog
This chapter provides the illustrated parts breakdown and a spare parts list for
Compaq ProLiant™ ML370 Servers. See Table 1-1 and Table 1-2 for the names of referenced
spare parts.
1-2 Compaq ProLiant ML370 Maintenance and Service Guide
Mechanical Parts Exploded View
2
3
1
6
5
4
7
Figure 1- 1. Mechanical parts expl oded view
8
Mechanical Spare Parts List
ItemDescriptionSpare Part Number
Chassis
1Chassis157986-001
2Small access panelPart of kit 387762-001
3Large access panelPart of kit 387762-001
4Right side panel (tower model only)Part of kit 387762-001
5Front bezel (tower model only)217492-001
6Rack front plate (rack-mountable model only)163489-001
7Feet (tower model only)333575-001
8Hot-plug drive cage, 6 x 1-inch387089-001
91-Ghz processor spare221536-001
Illustrated Parts Catalog 1-3
Table 1-1
Mechanical Spare Parts List
1-4 Compaq ProLiant ML370 Maintenance and Service Guide
System Components Exploded View
12
14a
17
15
13
19
1
16
10
20
11
9
Figure 1-2. System components exploded view
27a
18
21a
22
8
26
System Components Spare Parts
System Components Spare Parts List
ItemDescriptionSpare Part Number
System Components
9Power supply, 325 W402151-001
10I/O fan, 92 mm173907-001
11CPU fan, 120 mm326873-001
123V lithium battery179322-001
13Power switch with LED indicators and cable157928-001
Boards
14Processors
a) 6/600-MHz, with heat sink (heat sink may vary)166146-001
b) 6/667-MHz, with heat sink (heat sink may vary)*166109-001
c) 6/733-MHz, with heat sink *166108-001
d) 6/800-MHz, with heat sink *187781-001
e) 6/866-MHz, with heat sink *187782-001
f) 6/933-MHz, with heat sink *196628-001
Illustrated Parts Catalog 1-5
Table 1-2
15System board157824-001
16Processor Power Module157825-001
17Riser board, with brace157925-001
18SCSI backplane board163903-001
2218-GB, 1-inch, 10K rpm, Wide Ultra2 hard drive143920-001
Cables
23Miscellaneous data cable kit *386559-001
* Not shown
327659-001
179963-001
continued
1-6 Compaq ProLiant ML370 Maintenance and Service Guide
Table 1-2
System Components Sp ar e Part s List
ItemDescriptionSpare Part Number
Cables
continued
24Miscellaneous signal cable kit *158530-001
a) Hard drive/CD-ROM drive data cable
b) Diskette drive cable assembly
c) 1-device SCSI cable assembly
d) 3-device SCSI cable assembly
25Miscellaneous power cable kit *158529-001
a) Diskette and CD-ROM drive power cable assembly
b) Power supply power cable assembly
c) Removable media bay power cable assembly
d) Signal power cable assembly
Miscellaneous
26Hard drive blank122759-001
continued
27Miscellaneous plastics kit140479-001
a) Removable media drive bay blank
28Country kit *158531-001
29Maintenance and service guide *158549-001
30Illustrated parts map *158550-001
31Miscellaneous hardware kit *281847-001
a) Slide rail, 22-inch, pair (rackmount model only)
32Rack conversion kit *163490-001
33Keyboard *386209-001
34Baffle *163364-001
44AC Power supply, 225-W, hot-plug redundant *283623-001
45DC Power supply, 225-W, hot-plug redundant *333593-001
* Not shown
continued
Chapter 2
Removal and Replacement Procedures
This chapter provides subassembly/module-level removal and replacement procedures for
Compaq ProLiant ML370 Servers. The ProLiant ML370 Server is available in tower and rack
models. The Compaq ProLiant ML370Maintenance and Service Guide shows the tower model.
In most cases, the rack model removal and replacement procedures are slightly different from
the tower model procedures. After completing all necessary removal and replacement
procedures, run the diagnostics program to verify that all components operate properly.
To service Compaq ProLiant ML370 Servers, you might need the following:
■ Torx T-15 screwdriver
■ From the Compaq SmartStart and Support Software CD:
G System Configuration Utility
G Drive Array Advanced Diagnostics
G Diagnostics
2-2 Compaq ProLiant ML370 Maintenance and Service Guide
Electrostatic Discharge Information
A discharge of static electricity can damage static-sensitive devices or microcircuitry. Proper
packaging and grounding techniques are necessary precautions to prevent damage. To prevent
electrostatic damage, observe the following precautions:
■ Transport products in static-safe containers such as conductive tubes, bags, or boxes.
■ Keep electrostatic-sensitive parts in their containers until they arrive at static-free stations.
■ Cover work stations with approved static-dissipating material. Provide a wrist strap
connected to the work surface and properly grounded tools and equipment.
■ Keep work area free of nonconductive materials such as ordinary plastic assembly aids
and foam packing.
■ Make sure you are always properly grounded when touching a static-sensitive component
or assembly.
■ Avoid touching pins, leads, or circuitry.
■ Always place drives PCB assembly-side down.
■ Use conductive field service tools.
Symbols in Equipment
CLASS 1 LASER PRODUCT
WARNING: Any surface or area of the equipment marked with these symbols
indicates the presence of a hot surface or hot component. If this surface is
contacted, the potential for injury exists. To reduce the risk of injury from a hot
component, allow the surface to cool before touching it.
WARNING: Any surface or area of the equipment marked with these symbols
indicates the presence of electric shock hazards. The enclosed area contains no
operator serviceable parts. To reduce the risk of injury from electric shock
hazards, do not open this enclosure.
WARNING: Any RJ-45 receptacle marked with these symbols indicates a Network
Interface Connection. To reduce the risk of electric shock, fire, or damage to the
equipment, do not plug telephone or telecommunications connectors into this
receptacle.
WARNING: This label or equivalent is located on the surface of your CD-ROM
drive. This label indicates that the product is classified as a CLASS 1 LASER
PRODUCT.
Power supplies or systems marked with these symbols indicate the equipment is
supplied by multiple sources of power.
WARNING: To reduce the risk of injury from electric shock, remove all power
cords to completely disconnect power from the system.
Weight in kg
Weight in lb
Preparation Procedures
Before beginning to remove any serviceable parts, determine whether the part is hot-pluggable
or non-hot-pluggable.
Hot-Pluggable Devices
Hot-pluggable devices in the ProLiant ML370 Servers include SCSI hard drives and power
supplies (with redundant power supply option kit).
IMPORTANT: It is not necessary to turn off the server to replace hot-plug devices when they are not in
active use.
Non-Hot-Pluggable Devices
Removal and Replacement Procedures 2-3
Any product or assembly marked with these symbols indicates that the
component exceeds the recommended weight for one individual to handle
safely.
If the part is non-hot-pluggable, the server must be powered down. Non-hot-pluggable parts
include the processors, all boards, DIMMs, fans, drive cages, and the standard power supply.
Powering Down the Server
System power in ProLiant ML370 Servers does not completely shut off with the front panel
Power On/Standby switch. The switch toggles between On and Standby, rather than On and Off.
The standby position removes power from most electronics and the drives, but portions of the
hot-plug power supply and some internal circuitry remain active. To completely remove all
power from the system, you must disconnect all power cords from the server.
WARNING: To reduce the risk of personal injury or damage to the equipment, ensure that only
one component is extended at a time. A rack may become unstable if more than one
component is extended for any reason.
WARNING: Because the rack allows you to stack computer components on a vertical rather
than horizontal plane, you must take precautions to provide for rack stability and safety. It is
important that you follow these precautions to provide for rack stability and safety and to
protect both personnel and property. Heed all cautions and warnings throughout the installation
instructions that came with the server.
WARNING: To reduce the risk of injury from electric shock, remove all power
cords to completely disconnect power from the system.
2-4 Compaq ProLiant ML370 Maintenance and Service Guide
WARNING: To reduce the risk of personal injury or damage to the equipment: If the server is
mounted in a rack, internal devices may not be accessible for removal and replacement. If the
server must be removed from the rack for device accessibility, remove the server from the rack
and place it on a sturdy table or workbench. Refer to the ProLiant ML370 Servers Setup andInstallation Guide for further information on working with racks.
CAUTION: The system power in the server does not completely shut off from the front Power
On/Standby switch. Moving the switch from On to Standby leaves some portions of the hot-plug
power supply and some internal circuitry active. Disconnect all power cords from the server to
remove all power from the system.
CAUTION: Electrostatic discharge can damage electronic components. Be sure you are
properly grounded before beginning any installation procedure. For more information, see the
section titled “Electrostatic Discharge Information” earlier in this chapter.
To power down the server:
1. Press the Power On/Standby switch to Standby. This places the server in standby mode,
thereby disabling the main power supply output and providing auxiliary power to the
server. Standby does not completely remove power from the system.
2. Verify that the system power LED indicator on the front panel, near the Power
On/Standby switch, is amber and that the fan noise stops.
3. Disconnect all AC power cords from the AC outlet, then from the server.
4. Disconnect all external peripheral devices from the server.
5. For stability and safety, properly positioning the server is critical. If the server is mounted
in a rack, remove the rack model server from the rack and place it on a sturdy table or
workbench. Refer to the ProLiant ML370 Servers Setup and Installation Guide for
instructions.
Rack Warnings
WARNING: To reduce the risk of personal injury, make sure that the rack is adequately
stabilized before extending a component outside the rack. A rack may become unstable if more
than one component is extended for any reason. Extend only one component at a time.
WARNING: To reduce the risk of personal injury or damage to the equipment, be sure that:
■ The leveling jacks are extended to the floor.
■ The full weight of the rack rests on the leveling jacks.
■ The stabilizers are attached to the rack if it is a single rack installation.
■ The racks are coupled together in multiple rack installations.
Server Warnings and Precautions
WARNING: To reduce the risk of personal injury from hot surfaces, allow the internal system
components to cool before touching.
WARNING: To reduce the risk of electric shock or damage to the equipment:
■ Do not disable the power cord grounding plug. The grounding plug is an important safety
feature.
■ Plug the power cord into a grounded (earthed) electrical outlet that is easily accessible at
all times.
■ Install the power supply before connecting the power cord to the power supply.
■ Unplug the power cord before removing the power supply from the server.
■ Disconnect power from the server by unplugging the power cord from either the electrical
outlet or the server. If the system has multiple power supplies, disconnect power from the
system by unplugging all power cords from the power supplies.
CAUTION: Protect the server from power fluctuations and temporary interruptions with a
regulating uninterruptible power supply (UPS). This device protects the hardware from damage
caused by power surges and voltage spikes and keeps the system in operation during a power
failure.
Removal and Replacement Procedures 2-5
CAUTION: ProLiant ML370 Serversmust always be operated with the system unit cover on.
Proper cooling will not be achieved if the system unit cover is removed.
2-6 Compaq ProLiant ML370 Maintenance and Service Guide
Front Bezel
NOTE: This procedure applies to the ProLiant ML370 tower model only.
To open the front bezel:
1. Unlock the front bezel keylock.
2. Swing the front bezel out away from the server.
To remove the front bezel, continue with the following step.
3. Lift up the front bezel, then pull it away from the chassis.
Figure 2-1. Removing the front bezel
Reverse steps 1 through 3 to replace the front bezel.
Rack Front Plate
NOTE: This procedure applies to the ProLiant ML370 rack-mountable model only.
To remove the rack front plate:
Removal and Replacement Procedures 2-7
1. Remove the ten T-15 screws from the rack front plate
2. Pull the rack front plate away from the server
2
Figure 2-2. Removing the rack front plate
Reverse steps 1 and 2 to replace the rack front plate.
.
.
1
2-8 Compaq ProLiant ML370 Maintenance and Service Guide
Feet
NOTE: This procedure applies to the ProLiant ML370 tower model only.
To remove the feet from the chassis, one at a time:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Place the server on its right side (opposite from the large access panel).
3. Remove the T-15 screw from each foot ➊.
4. Pivot each foot down ➋; then pull it off the base of the chassis ➌.
2
3
1
Figure 2-3. Removing the feet from the chassis
Reverse steps 1 through 4 to replace the feet. Make sure each foot snaps securely into its holder.
Large Access Panel
WARNING: To reduce the risk of personal injury from hot surfaces, allow internal system
components to cool before touching them.
To remove the large access panel:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
Removal and Replacement Procedures 2-9
3. Loosen the two thumbscrews securing the large access panel to the front of the chassis
4. Slide the large access panel back about 1.5 cm (0.5 inches), then pull it away from the
chassis
Figure 2-4. Removing the large access panel
.
1
2
Reverse steps 1 through 4 to replace the large access panel.
.
2-10 Compaq ProLiant ML370 Maintenance and Service Guide
Small Access Panel
WARNING: To reduce the risk of personal injury from hot surfaces, allow internal system
components to cool before touching them.
To remove the small access panel:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Loosen the thumbscrew securing the small access panel to the chassis ➊.
4. Slide the small access panel toward the rear of the unit about 1.5 cm (0.5 inches), then off
to the side
➋.
2
1
Figure 2-5. Removing the small access panel
Reverse steps 1 through 4 to replace the small access panel.
Right Side Panel
NOTE: This panel does not exist on the rack-mountable model. This procedure applies to the tower
model only.
To remove the right side panel from the chassis:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Remove the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the feet on the base of the right side panel. See “Feet” earlier in this chapter.
4. Remove the two Torx T-15 screws securing the right side panel to the front of the
chassis
Removal and Replacement Procedures 2-11
.
5. Pull the right side panel back, then away from the chassis
2
Figure 2-6. Removing the right side panel
Reverse steps 1 through 5 to replace the right side panel.
.
1
1
2-12 Compaq ProLiant ML370 Maintenance and Service Guide
Drives and Related Components
Compaq ProLiant ML370 Servers ship standard with a hot-plug drive cage containing
six 1.0-inch hot-plug drive bays. Four removable media bays contain one third-height diskette
drive and one half-height IDE CD-ROM drive. Two bays can contain a second CD-ROM drive,
tape drives, hard drives, or any SCSI device.
Compaq ProLiant ML370 Servers support up to ten mass storage devices including the diskette
drive and IDE CD-ROM drive. Figure 2-7 and Table 2-1 describe the drive configurations.
1
2
3
4
5
6
7
8
9
Figure 2-7. Drives and related components
5432
Drives and Related Components
ItemDescription
Drive activity LED indicator
Power On/Standby status LED indicator
Power On/Standby switch
Key lock
Diskette drive
CD-ROM drive
Removable media slot
Removable media slot
Hot-plug drive bays 0-5
10
Table 2-1
Cable Routing Diagrams
CAUTION: When routing cables, always ensure that the cables are not in a position where they
will be pinched or crimped.
IDE CD-ROM Drive Cable Diagram
1
Removal and Replacement Procedures 2-13
1
Figure 2-8. IDE CD-ROM drive cable
Diskette Drive Cable Diagram
1
Figure 2-9. Diskette drive cable
1
2-14 Compaq ProLiant ML370 Maintenance and Service Guide
SCSI Cable Diagram
1
2
Figure 2-10. SCSI cable diagram: the three-device, terminated SCSI cable connected from
internal SCSI Port 1 to the vacant bays of the removable media area, and the point-to-point SCSI
cable connected from internal SCSI Port 2 to the backplane of the hot-plug drive cage
1
2
Hot-Plug Drive Cage
To remove the hot-plug drive cage:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Remove the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the large access panel. See “Large Access Panel” earlier in this chapter.
4. Disconnect all cables from the hot-plug drive cage: one SCSI cable and two power
connectors.
Removal and Replacement Procedures 2-15
5. Remove the four T-15 screws securing the hot-plug drive cage to the chassis
6. Slide the hot-plug drive cage out the front of the chassis
2
Figure 2-11. Removing the hot-plug drive cage
.
1
1
Reverse steps 1 through 6 to replace the hot-plug drive cage.
.
1
1
CAUTION: Make sure that all power and signal cables to the hot-plug drive cage have been
reseated properly.
2-16 Compaq ProLiant ML370 Maintenance and Service Guide
IDE CD-ROM Drive
To remove the CD-ROM drive:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the large access panel. See “Large Access Panel” earlier in this chapter.
4. Disconnect all cables from the CD-ROM drive.
5. Remove the two T-15 screws and washers securing the CD-ROM drive to the chassis
6. Slide the CD-ROM drive out the front of the chassis
2
Figure 2-12. Removing the CD-ROM drive
.
1
1
Reverse steps 1 through 6 to replace the CD-ROM drive.
.
Diskette Drive
To remove the diskette drive:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the large access panel. See “Large Access Panel” earlier in this chapter.
4. Disconnect all cables from the diskette drive.
Removal and Replacement Procedures 2-17
5. Remove the two T-15 screws and washers securing the diskette drive to the chassis
6. Slide the diskette drive out the front of the chassis
2
Figure 2-13. Removing the diskette drive
.
1
1
Reverse steps 1 through 6 to replace the diskette drive.
.
2-18 Compaq ProLiant ML370 Maintenance and Service Guide
Hard Drive Blank
IMPORTANT: A hard drive blank must be installed in an unused hard drive slot. Failure to install a hard
drive blank could result in system errors.
To remove a hard drive blank:
1. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
2. Squeeze the top and bottom tabs of the blank
3. Pull the blank out of the hard drive cage
1
1
Figure 2-14. Removing a hard drive blank
.
.
2
Reverse steps 1 through 3 to replace a hard drive blank.
Hot-Plug SCSI Hard Drive
To remove a hot-plug SCSI hard drive:
1. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
Removal and Replacement Procedures 2-19
2. Press the release button
3. Pull the hot-plug SCSI hard drive out of the bay
Figure 2-15. Removing a hot-plug SCSI hard drive
and swing the lever outward from the bottom.
.
3
Reverse steps 1 through 3 to replace the hot-plug SCSI hard drive.
2
1
2-20 Compaq ProLiant ML370 Maintenance and Service Guide
Removable Media Drive Bay Blank
To remove a removable media drive bay blank:
1. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
2. Remove the four Torx T-15 screws securing the removable media drive bay blank to the
chassis
.
3. Remove the removable media drive bay blank
2
.
1
Figure 2-16. Removing a removable media drive bay blank
Reverse steps 1 through 3 to replace a removable media drive bay blank.
1
CPU Fan
Removal and Replacement Procedures 2-21
To remove the CPU fan:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the server large access panel. See “Large Access Panel” earlier in this chapter.
4. Unplug the CPU fan from the system board.
5. Remove the four screws
6. Remove the CPU fan
1
Figure 2-17. Removing the CPU fan
Reverse steps 1 through 6 to replace the CPU fan.
securing the fan assembly to the chassis.
.
2
2-22 Compaq ProLiant ML370 Maintenance and Service Guide
I/O Fan
To remove the I/O fan:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the large and small access panels. See “Large Access Panel” and “Small Access
Panel” earlier in this chapter.
4. Loosen the single thumbscrew securing the I/O fan to the chassis ➊.
5. Tilt the top of the I/O fan forward ➋.
6. Disconnect the I/O fan cable from the system board ➌.
7. Lift the I/O fan away from the chassis.
1
3
2
Figure 2-18. Removing the I/O fan
Reverse steps 1 through 7 to replace the I/O fan.
Power Switch with Cable and LED Indicators
To remove the power switch and cable assembly:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the small access panel. See “Small Access Panel” earlier in this chapter.
4. Remove the single T-15 screw securing the power switch to the chassis ➊.
5. Slide the power switch housing back ➋, then lift it out of the chassis ➌.
2
Removal and Replacement Procedures 2-23
Figure 2-19. Removing the power switch with LED and cable
3
1
2-24 Compaq ProLiant ML370 Maintenance and Service Guide
6. Disconnect the power switch from the system board
4
.
Figure 2-20. Unplugging the power switch from the system board
Reverse steps 1 through 6 to replace the power switch and cable assembly.
Riser Board Expansion Slots
Figure 2-21. Riser board expansion slots
Removal and Replacement Procedures 2-25
6
2
5
3
4
4
3
5
2
6
1
ItemDescription
,
, , ,
Primary PCI bus slots 1 and 2 (32-bit)
Secondary PCI bus slots 3-6 (64-bit)
Table 2-2
Riser Board Expansion Slots
2-26 Compaq ProLiant ML370 Maintenance and Service Guide
I/O Expansion Slot Cover
To remove an I/O expansion slot cover:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the small access panel. See “Small Access Panel” earlier in this chapter.
4. Press down on the release button at the top of the I/O expansion slot retention lever
5. Swing the I/O expansion slot retention lever upward
6. Lift the I/O expansion slot cover up and out of the server
3
Figure 2-22. Removing an I/O expansion slot cover
.
.
1
2
.
Reverse steps 1 through 6 to replace an I/O expansion slot cover.
I/O Expansion Board
To remove an I/O expansion board:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the small access panel. See “Small Access Panel” earlier in this chapter.
4. Remove any peripheral devices attached to the I/O expansion board.
5. Release the I/O expansion board retention lever. See “I/O Expansion Slot Cover” earlier in
this chapter.
Removal and Replacement Procedures 2-27
6. Lift the I/O expansion board up and out of the server
1
Figure 2-23. Removing an I/O expansion board
Reverse steps 1 through 6 to replace an I/O expansion board.
IMPORTANT: If only removing the board, install an expansion slot cover.
.
2-28 Compaq ProLiant ML370 Maintenance and Service Guide
Riser Board and Brace
To remove the riser board and brace:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove any I/O expansion boards (if installed) from the riser board. See “I/O Expansion
Board” earlier in this chapter.
4. Remove the large access panel. See “Large Access Panel” earlier in this chapter.
5. Remove the two Torx T-15 screws securing the riser board and brace to the chassis
6. Slide the riser board and brace out of the server
1
2
1
Figure 2-24. Removing the riser board and brace
.
Reverse steps 1 through 6 to replace the riser board and brace. Make sure that the board is
seated securely before the screws are tightened.
.
Processor
To remove the processor:
Removal and Replacement Procedures 2-29
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the large access panel. See “Large Access Panel” earlier in this chapter.
4. Push in the tabs at each side of the processor until you hear two clicks
5. While holding the tabs in, pull out the processor ➋.
1
1
1
2
Figure 2-25. Removing the processor
Reverse steps 1 through 5 to replace the processor.
.
Use the System Configuration Utility to reconfigure the system if a faulty processor is replaced.
See Chapter 3, “Diagnostics and Troubleshooting,” for more information.
NOTE: Individual heat sinks may vary from the one(s) shown above.
2-30 Compaq ProLiant ML370 Maintenance and Service Guide
Processor Power Module
To remove the Processor Power Module:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the large access panel. See “Large Access Panel” earlier in this chapter.
4. Press outward on the tabs at each end of the Processor Power Module ➊.
5. Pull the Processor Power Module out of the socket ➋.
1
2
Figure 2-26. Removing the Processor Power Module
Reverse steps 1 through 5 to replace a Processor Power Module. The clips on the Processor
Power Module will snap into a locked position automatically when the Processor Power Module
is fully seated in the socket. The module is keyed to ensure correct alignment.
Use System Configuration Utility to reconfigure the server. See Chapter 3, “Diagnostics and
Troubleshooting,” for more information.
Memory
Removal and Replacement Procedures 2-31
Compaq ProLiant ML370 Servers ship standard with a 128-MB Synchronous DRAM
(SDRAM) dual inline memory module (DIMM) installed in DIMM slot 1. Memory is
expandable to 4 GB (when 1-GB DIMMs are available).
1
2
3
4
Figure 2-27. DIMM slots on the system board
Table 2-3
SDRAM DIMM Slot Locations
ItemDescription
CAUTION: Use only Compaq DIMMs. DIMMs from other sources may adversely affect data
integrity. Power-On Self-Test (POST) will warn of nonsupported DIMMs.
IMPORTANT: The following guidelines MUST be followed when installing or replacing memory:
■ Use 133-MHz, 64-, 128-, 256-, 512-MB or 1-GB registered SDRAM DIMMs, 3.3 volts, 72 bits wide,
with Error Correction Code (ECC).
■ Any combination of the above modules can be used.
■ Install SDRAM DIMM modules one at a time in sequential order, starting with DIMM slot 1.
■ Install DIMMs of the same speed. Do not install DIMM modules supporting different speeds.
■ Install DIMMs only in one direction. Ensure that the notch on the module matches the tab on the
DIMM slot and that the module is fully inserted and properly seated.
SDRAM DIMM slot 1 (populated with standard 128-MB DIMM)
SDRAM DIMM slot 2
SDRAM DIMM slot 3
SDRAM DIMM slot 4
2-32 Compaq ProLiant ML370 Maintenance and Service Guide
To remove an SDRAM DIMM:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the large access panel. See “Large Access Panel” earlier in this chapter.
4. Press both SDRAM DIMM slot release levers outward ➊.
5. Pull the DIMM module up from the board ➋.
2
1
1
Figure 2-28. Removing a DIMM module from the system board
Reverse steps 1 through 5 to replace the DIMM. The DIMM can be installed in only one way.
Match the notch on the DIMM module with the tab on the memory socket. Push the DIMM
module down into the socket, making sure that the module is fully inserted and properly seated.
Power Supply
To remove the power supply:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the large access panel. See “Large Access Panel” earlier in this chapter.
4. Slide the hot-plug drive cage out slightly by loosening the screws attaching it to the
chassis. See “Hot-Plug Drive Cage” earlier in this chapter.
5. Disconnect the power supply cable from the system board.
Removal and Replacement Procedures 2-33
WARNING: To reduce the risk of electric shock, do not disassemble the power supply or
attempt to repair it. It should be replaced only with the specified Compaq spare part.
Figure 2-29. Disconnecting the power supply cable from the system board
2-34 Compaq ProLiant ML370 Maintenance and Service Guide
6. Disconnect all other power supply cables.
7. Remove the four Torx T-15 screws securing the power supply to the back of the
chassis
.
8. Pull the power supply slightly back, then out the side of the chassis
2
1
Figure 2-30. Removing the power supply
Reverse steps 1 through 8 to replace the power supply.
.
Hot-Plug SCSI Drive Backplane
To remove the hot-plug SCSI drive backplane:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the large access panel. See “Large Access Panel” earlier in this chapter.
4. Remove all hard drive blanks and hot-plug hard drives. See “Hard Drive Blank” and
“Hot-Plug SCSI Hard Drive” earlier in this chapter.
5. Remove the hot-plug drive cage. See “Hot-Plug Drive Cage” earlier in this chapter.
6. Remove the three Torx T-15 screws securing the hot-plug SCSI drive backplane to the
hot-plug SCSI drive cage
7. Slide the hot-plug SCSI drive backplane down slightly. Then pull it from the mounting
posts
.
.
Removal and Replacement Procedures 2-35
1
2
1
1
Figure 2-31. Removing the hot-plug SCSI drive backplane
Reverse steps 1 through 7 to replace the hot-plug SCSI drive backplane.
2-36 Compaq ProLiant ML370 Maintenance and Service Guide
System Board
To remove the system board:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the small access panel. See “Small Access Panel” earlier in this chapter.
4. Remove the large access panel. See “Large Access Panel” earlier in this chapter.
5. Remove all expansion boards. See “I/O Expansion Board” earlier in this chapter.
6. Remove the riser board and brace. See “Riser Board and Brace” earlier in this chapter.
7. Disconnect all internal and external cables
8. Loosen the thumbscrew securing the system board to the chassis
.
.
9. Slide the system board forward to release it from the six retaining guides, then pull the
system board out the top of the chassis
1
1
1
1
.
2
3
Figure 2-32. Removing the system board
Reverse steps 1 through 9 to replace the system board.
Reverse steps 1 through 6 to replace the Compaq Integrated Smart Array Controller upgrade
module.
.
1
2-38 Compaq ProLiant ML370 Maintenance and Service Guide
Internal Replacement Battery
WARNING: This server contains an internal lithium manganese dioxide battery. There is risk of
fire and burns if the battery pack is not handled properly. To reduce the risk of personal injury:
■ Do not attempt to recharge the battery.
■ Do not expose to temperatures higher than 60°C.
■ Do not disassemble, crush, puncture, short external contacts, or dispose of in fire or water.
■ Replace only with the Compaq spare designated for this product.
To install the internal replacement battery:
1. Perform the preparation procedures. See “Powering Down the Server” earlier in this
chapter.
2. Open the front bezel (tower model only). See “Front Bezel” earlier in this chapter.
3. Remove the small access panel. See “Small Access Panel” earlier in this chapter.
4. Remove the battery from the system board.
Figure 2-34. Removing the internal replacement battery
5. Install the replacement battery, then run the System Configuration utility to reconfigure
the server.
CAUTION: Loss of BIOS Settings will occur when the lithium battery is removed. BIOS settings
must be reconfigured whenever the battery is replaced.
CAUTION: Batteries, battery packs, and accumulators should not be disposed of
together with the general household waste. In order to forward them to recycling or
proper disposal, please use the public collection system or return them to Compaq,
your authorized Compaq Partners, or their agents.
Chapter 3
Diagnostics and Troubleshooting
This chapter describes software and firmware diagnostic tools available for all Compaq server
products. The sections in this chapter are:
■ Diagnostic Tools Utility Overview
■ Default Configuration
■ Utilities Access
■ Power-On Self-Test (POST)
■ Diagnostics Software
■ Array Diagnostic Utility (ADU)
■ Integrated Management Log
■ Rapid Error Recovery
■ Remote Service Features
■ ROMPaq Error Recovery Options
■ Compaq Insight Manager
3-2 Compaq ProLiant ML370 Maintenance and Service Guide
Diagnostic Tools Utility Overview
These utilities were developed to assist in diagnosing problems, testing the hardware, and
monitoring and managing Compaq server hardware.
Diagnostic Tools
ToolWhat it isHow to run it
Table 3-1
Compaq Diagnostics
Program
Compaq Inspect
Utility
Compaq Insight
Manager
Compaq Survey
Utility
A utility to assist testing and/or verifying
operation of Compaq hardware. If
problems are found, Compaq
Diagnostics isolates failure(s) down to
replaceable part, whenever possible.
The Inspect Utility provides a report
detailing system information.
A client/server application used to
remotely manage Compaq hardware in
a network environment. Reports
hardware fault conditions (both failure
and prefailure) and collects data for
reporting and graphing.
An online information gathering agent
that runs on servers, gathering critical
hardware and software information
from various sources. A utility for
servers running Windows NT or
NetWare.
If a significant change occurs between
data gathering intervals, previous
information is marked, and the surveytext file is overwritten to reflect the
latest configuration and changes since
last configuration. This allows a
historical record of change events for
server hardware and software.
Diagnostics and Utilities are located on
Compaq system partition (if installed) or
your hard drive and must be accessed
when a system configuration error is
detected during Power-On Self-Test
(POST). Compaq Diagnostics software is
also available on Compaq SmartStart and
Support Software CD. A Diagnostics
diskette can be created from SmartStart
and Support Software CD, and
Diagnostics run from diskette.
The Inspect Utility can be run from either
the main menu of the System
Configuration Utility or the Compaq
Diagnostics program.
For more information, refer to the
Compaq Management CD and the
Compaq Insight Manager User Guide.
More information on viewing and printing
the event list can be found in the Compaq
Insight Manager section of this chapter.
Install Survey from SmartStart, Compaq
Integration Maintenance Utility, or from
Compaq Management CD.
continued
Table 3-1
Diagnostic Tools
ToolWhat it isHow to run it
continued
Diagnostics and Troubleshooting 3-3
Array Diagnostics
Utility (ADU)
Drive Array
Advanced
Diagnostics (DAAD)
Integrated
Management Log
A Windows-based tool designed to run
on all Compaq systems that support
Compaq array controllers. Two main
functions of ADU are to collect all
possible information about the array
controllers in the system and generate a
list of detected problems.
This tool is available for all Compaq
servers covered by this guide.
The predecessor to ADU, DAAD is a
DOS-based tool for Compaq servers
with Smart Array Controllers. DAAD
collects information about the array
controllers in the system and offers a
list of detected problems.
A log of system events, such as system
failures or nonfatal error conditions.
View events in the Integrated
Management Log:
■ On the Integrated Management
Display
■ From within Compaq Insight
Manager
■ From within Compaq Survey Utility
Use the information provided in Array
Diagnostics Utility (ADU) later in this
chapter.
For a list of Compaq servers still
supported by this tool, visit the Compaq
website:
www.compaq.com
The Integrated Management Log requires
Compaq Operating System-dependent
drivers. Refer to Compaq Support
Software CD for instructions on installing
the appropriate drivers.
System
Configuration Utility
Utility to easily configure the hardware
installed in or connected to the server.
Specifically, it can:
■ Resolve resource conflicts in areas
such as memory, port addresses,
and interrupts (IRQs)
■ Configure PCI boards automatically
■ Provide switch and jumper settings
■ Manage installation of memory,
processor upgrades, and mass
storage devices such as hard
drives, tape drives, and diskette
drives
■ Store configuration information in
nonvolatile memory
■ Assist in installation of an operating
system
■ Assist in running diagnostic tools
such as TEST and INSPECT
If the server has bootable CD-ROM drive,
run Compaq System Configuration Utility
directly from Support Software CD
supplied with Smart Controller Option Kit,
or SmartStart and Support Software CD
supplied with server. Use the CD supplied
for latest version supporting Smart
Controller.
If the server does not have a bootable
CD-ROM drive, create diskettes with
latest version of the System
Configuration Utility from Support
Software CD, or SmartStart and Support
Software CD.
3-4 Compaq ProLiant ML370 Maintenance and Service Guide
Default Configuration
When the system is first powered up, the system ROM detects the unconfigured state of the
hardware and provides default configuration settings for most devices. By providing this
initialization, the system can run Diagnostics and other software applications before running the
normal SmartStart and System Configuration programs.
Default Configuration Messages
IMPORTANT: If you chose to format and partition the boot drive before running SmartStart and the
System Configuration programs, this may prohibit creating a Compaq System Partition and the off-line
remote management features that it provides.
If you insert a System Configuration, Diagnostics, or SmartStart and Support Software CD into
the CD-ROM drive prior to powering up the server, the system ROM boots to that utility. If the
system ROM does not detect one of those CDs, you will be prompted for the intended operating
system. The system reboots if any operating system-dependent configurations have changed
with the new operating system selection. If the selected operating system-dependent
configurations are the same as the current configurations, the system boots normally. If you
enter a wrong choice, on subsequent re-boots you may change the operating system.
Inspect Utility
The Inspect Utility provides configuration information such as the contents of the operating
system startup files, the current memory configuration, the ROM version, and Server Health
Log information. It operates with MS-DOS and in the MS-DOS emulation mode of OS/2.
Running the Inspect Utility
Printing the Inspect Listing
Select Print on the Inspect screen to print a copy of the Inspect listing. Keep a copy of the listing
with each server for later reference.
1. Turn the server off, and back on. Then press F10 when the cursor appears in the upper
right corner of the screen.
2. At the main menu, select Diagnostics and Utilities.
3. Press Enter.
4. Select Inspect Computer and press Enter.
5. Follow the instructions to view system information.
Utilities Access
The Compaq SmartStart and Support Software CD contains the SmartStart program and many
of the Compaq utilities needed to maintain the system, including:
■ System Configuration Utility
■ Array Configuration Utility
■ Array Diagnostic Utility
■ ROMPaq Firmware Upgrade Utilities
■ Compaq Diagnostics
CAUTION: Do not select the Erase Utility when running the SmartStart and Support
Software CD. This will result in data loss to the entire system.
Running Compaq Utilities
Compaq utilities may be accessed from the Compaq System partition on the hard drive, from
diskette, or from the Compaq SmartStart and Support Software CD.
Running the Utilities on the Compaq System Partition
Diagnostics and Troubleshooting 3-5
If the system was installed using SmartStart, the Compaq utilities will automatically be
available on the Compaq system partition. The Compaq system partition could also have been
created during a manual system installation.
To run the utilities on the Compaq system partition, boot the system and press F10 when you
see: “Press F10 for Compaq system partition utilities.” Then select the utilities from the menu.
■ The System Configuration Utility is available under the System Configuration menu.
■ The Array Configuration Utility is available under the System Configuration menu.
■ Compaq Diagnostics (Test and Inspect) are available under the Diagnostics and Utilities
menu.
■ The ROMPaq Firmware Upgrade Utility is available under the Diagnostics and Utilities
menu.
3-6 Compaq ProLiant ML370 Maintenance and Service Guide
Running the Utilities from Diskette
■ Run the utilities from their individual diskettes. If you have a utility diskette newer than
the version on the SmartStart and Support Software CD, use that diskette.
■ Create a diskette version of the utility from the SmartStart and Support Software CD.
To create diskette versions of the utilities from the CD:
1. Boot the Compaq SmartStart and Support Software CD.
2. From the Compaq System Utilities screen, select Create Support Software Disks then
Next.
3. Select the diskette you would like to create from the list, then follow the instructions on
the screen.
Running the Utilities from the Compaq SmartStart and Support
Software CD
IMPORTANT: Only the System Configuration Utility and the Array Configuration Utility and the Array
Diagnostic Utility can be executed from the Compaq SmartStart and Support Software CD. All other
utilities must be executed from the Compaq system partition or from diskette.
To run these utilities directly from the Compaq SmartStart and Support Software CD:
1. Boot the Compaq SmartStart and Support Software CD.
2. From the Compaq System Utilities screen, select the utility you wish to run, then select
Next.
G To execute the System Configuration Utility, select Run System Configuration Utility.
G To execute the Array Configuration Utility, select Run Array Configuration Utility.
G To execute the Array Diagnostic Utility, select Run Array Diagnostic Utility.
Power-On Self-Test (POST)
POST is a series of diagnostic tests that run automatically on Compaq computers when the
system is turned on. POST checks the following assemblies to ensure that the computer system
is functioning properly:
■ System ROM
■ Keyboard
■ Power supply
■ System board
■ Memory
■ Controllers
■ Diskette drives
■ Hard drives
■ Processor Power Modules
■ Fans
Diagnostics and Troubleshooting 3-7
POST Error Messages
If POST finds an error in the system, an error condition is indicated by an audible and/or visual
message. If an error code displays on the screen during POST or after resetting the system, use
the instructions in the POST Error Messages table. The error messages and codes listed include
all codes generated by Compaq products. The system generates only those codes that are
applicable to the configuration and options.
NOTE: Many of the actions listed require you to run Diagnostics or the Compaq System Configuration
Utility. Steps for running these utilities are provided following the POST Error Messages tables.
In each case, the Recommended Action column lists the steps necessary to correct the problem.
After completing each step, run the Diagnostics program to verify whether the error condition
has been corrected. If the error code reappears, perform the next step, then run the Diagnostics
program again. Follow this procedure until Diagnostics no longer detects an error condition.
3-8 Compaq ProLiant ML370 Maintenance and Service Guide
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
A Critical Error
occurred prior to this
power-up
Unsupported
Processor Detected
System Halted
FATAL ROM ERROR:
The System ROM is
not properly
programmed
101 - ROM Error1 long, 1 shortSystem ROM checksumRun Diagnostics. Replace failed
101 - I/O ROM ErrorNoneOptions ROM checksumRun Diagnostics. Replace failed
102 - System Board
Failure
104 - ASR-2 Timer
Failure
105 - Current System
ROM is corrupt—
now booting
redundant System
ROM
NoneA catastrophic system error,
1 long, 1 shortProcessor not supported by
1 long, 1 shortROM programmed in factory
NoneDMA, timers, and so on
NoneSystem board failure
2LNon-booted ROM image is
Table 3-2
which caused the server to
crash, has been logged.
current system ROM
incorrectly
corrupt
Run Diagnostics. Replace failed
assembly as indicated.
Check documentation for
supported processors, and if
supported, remove the
processor and update the
system to latest ROM.
Replace the physical ROM part.
assembly as indicated.
assembly as indicated.
Replace the system board. Run
the Compaq System
Configuration Utility.
Run Diagnostics. Replace failed
assembly as indicated.
Flash the ROM utilizing
ROMPaq.
162 - System
Options Not Set
163—Time & Date
Not Set
164 - Memory Size
Error
172 - Configuration
Nonvolatile Memory
Invalid
173 - Slot ID
Mismatch
174 - Configuration/
Slot Mismatch Device
Not Found
2SConfiguration incorrectRun the System Configuration
Utility and correct.
2SInvalid time or date in
configuration memory
2SConfiguration memory incorrectRun the System Configuration
NoneNonvolatile configuration corrupt
or jumper installed
NoneBoard replaced, configuration not
updated
NonePCI board not foundRun the System Configuration
Run the System Configuration
Utility and correct.
Utility and correct.
Run the System Configuration
Utility and correct.
Run the System Configuration
Utility and correct.
Utility and correct.
continued
Diagnostics and Troubleshooting 3-9
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
1 long, 1 shortUnable to size memoryCheck DIMM installation, and if
error persists, call Compaq
service provider.
NoneSwitch not set properly during
installation or upgrade
1 shortProcessor in slot x failedRun Diagnostics and replace
NonePowerSafe Module (DC-DC
Converter) failed
NoneIndicated PPM (DC-DC Converter)
failed
Verify switch settings.
failed processor.
Run Diagnostics. Replace failed
assembly as indicated.
Replace failed assembly as
indicated.
continued
3-10 Compaq ProLiant ML370 Maintenance and Service Guide
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
215 - Processor
Power Module has
lost Redundancy in
Socket x
215 - Nonfunctioning
Voltage Regulator
Module for
Processors
216 - Voltage
Regulator Module for
Processor X no
longer redundant
216 - Processor PPM
has lost Redundancy,
Module X
218 - Cache
Accelerators Not
Installed. System
Halted.
219 - Tag Update
Rules SRAM Failure.
System Halted.
219 - Snoop Rules
SRAM Failure.
System Halted.
NonePPM (DC-DC Converter) has lost
redundancy.
NonePPM (DC-DC Converter) has
failed or lost redundancy.
NoneRedundancy failed in PPMTo restore redundancy, replace
NoneIndicated PPM (DC-DC Converter)
has lost redundancy
NoneCache accelerators not installed
or improperly installed
NoneCatastrophic chipset failureCall Compaq service provider.
NoneCatastrophic chipset failureCall Compaq service provider.
Run Diagnostics. Replace failed
assembly as indicated.
Run Diagnostics. Replace failed
assembly as indicated.
the PPM.
Replace failed assembly as
indicated.
Check cache accelerator
installation.
220 - Cache
Accelerator Slot x
Initialization Failed.
System Halted.
221 - Power Fault On
Processor Bus X
301 - Keyboard ErrorNoneKeyboard failureTurn off the computer, then
301 - Keyboard Error
or Test Fixture
Installed
NoneCache accelerator in slot x
improperly installed or bad
NoneA PPM on indicated bus is in a
failed state.
NoneKeyboard failureReplace the keyboard.
Check cache accelerator
installation and if properly
installed, replace.
Run Diagnostics. Replace failed
assembly as indicated.
reconnect the keyboard.
continued
Diagnostics and Troubleshooting 3-11
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
ZZ-301 - Keyboard
Error
303 - Keyboard
Controller Error
304 - Keyboard or
System Unit Error
40X - Parallel Port X
Address Assignment
Conflict
601 - Diskette
Controller Error
605 - Diskette Drive
Type Error
NoneKeyboard failure. (ZZ represents
the Keyboard Scan Code.)
NoneSystem board, keyboard, or
mouse controller failure
NoneKeyboard, keyboard cable, or
system board failure
2SBoth external and internal ports
are assigned to parallel port X.
NoneDiskette controller circuitry
failure
2SMismatch in drive typeRun the System Configuration
1. A key is stuck. Try to free
it.
2. Replace the keyboard.
1. Run Diagnostics.
2. Replace failed assembly
as indicated.
1. Make sure the keyboard is
attached.
2. Run Diagnostics to
determine which is in
error.
3. Replace the part indicated.
Run the System Configuration
Utility and correct.
1. Make sure the diskette
drive cables are attached.
2. Replace the diskette drive
and/or cable.
3. Replace the system board.
Utility to set diskette type
correctly.
1151 - Com Port 1
Address Assignment
Conflict
1152 - Com
Port 2, 3,
or 4 Address
Assignment Conflict
1610 - Temperature
violation detected.
Waiting for system to
cool
1611 - Fan failure
detected
2SBoth external and internal serial
ports are assigned to COM1.
2SBoth external and internal serial
ports are assigned to COM2,
COM3 or COM4.
2SAmbient system temperature too
hot
2SRequired fan not installed or
spinning
Run the System Configuration
Utility and correct.
Run the System Configuration
Utility and correct.
Check fan in system
environment.
Check fans.
continued
3-12 Compaq ProLiant ML370 Maintenance and Service Guide
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
1611 - I/O Fan
(Fan X) failure
detected
1611 - CPU Fan
(Fan X) failure
detected
1612 - Primary
power supply failure
1613 - Low System
Battery
1615 - Power Supply
Failure, Power
Supply Unplugged, or
Power Supply Fan
Failure in Bay X
1617 - Fan controller
not responding
1617 - I/O Fan
controller not
responding
1617 - CPU Fan
controller not
responding
2SI/O fan has failedReplace the failed fan.
2SCPU fan has failedReplace the failed fan.
2SPrimary power supply has failedReplace power supply as soon
as possible.
NoneReal time clock system battery is
running low on power
NoneA power supply has failed.Replace or check specified
2SFan controller failureCheck and replace failed
2SI/O fan controller failureCheck and replace failed
2SCPU fan controller failureCheck and replace failed
Run Diagnostics. Replace failed
assembly as indicated.
power supply.
controller assembly.
controller assembly.
controller assembly.
1620 - Locked SCSI
Bus Detected. Verify
SCSI bus cabling.
System halted.
1621 - Current SCSI
bus cable
configuration is not
recommended
1622 - Internal SCSI
Jumper Board Not
Installed
NoneSCSI bus failureRun Diagnostics. Replace failed
assembly as indicated.
NoneImproper SCSI bus cablingCheck documentation for proper
SCSI bus cabling.
NoneThe system has detected that the
array enabler board is not
installed.
Install the array enabler board.
continued
Diagnostics and Troubleshooting 3-13
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
1702 - SCSI cable
error detected;
system halted.
1703 - SCSI cable
error detected.
Internal SCSI cable
not attached to
system board
connector. System
halted.
1720 - Slot x Drive
Array—SMART hard
Drive Detects
Imminent Failure
SCSI: Port n:
SCSI ID x
1721 - Slot x Drive
Array—Drive
parameter tracking
predicts imminent
failure. The following
devices should be
replaced when
conditions permit. Do
not replace drive
unless all other
drives in the array
are online! Back up
data before replacing
drive(s) if using
RAID 0.
NoneTermination or cabling problem
with the system board integrated
SCSI controller.
NoneIncorrect cablingEnsure that the integrated SCSI
NonePrefailure conditionIndicated drive has reported a
NoneMonitor and performance
threshold exceeded condition.
Refer to cabling diagrams in
Chapter 2 of this guide, the
cabling guidelines on the
Compaq website, and your
Compaq server documentation.
controller has SCSI termination
attached.
SMART predictive-failure
condition and may fail at some
time in the future. If drive is part
of a non-fault-tolerant
configuration, back up all data
before replacing drive and
restore all data afterward. If
drive is part of a fault-tolerant
configuration do not replace the
drive unless all other drives in
the array are online. Press F1 to
resume.
Replace the drive when it is
reasonable for you to do. The
drive has not failed, but Compaq
recommends you replace the
drive.
continued
3-14 Compaq ProLiant ML370 Maintenance and Service Guide
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
*1724 - Slot x Drive
Array—Physical
Drive Position
Change(s) Detected –
Logical drive
configuration has
automatically been
updated
*1727 - Slot x Drive
Array—New Logical
Drive(s) Attachment
Detected. If more
than 32 logical
drives, this message
will be followed by:
Auto-configuration
failed: Too many
logical drives.
NoneDrive changeIndicates that logical drive
configuration has been updated
automatically following physical
drive position changes. Press F1
to resume.
configuration has been updated
automatically due to
replacement of Array
Accelerator (or controller) with
one having different memory
size. Press F1 to resume.
NoneAdditional drivesThis message indicates that the
controller has detected an
additional array of drives that
was attached when the power
was off. The logical drive
configuration information has
been updated to add the new
logical drives. The maximum
number of logical drives
supported is 32. Additional
logical drives will not be added
to the configuration. Press F1 to
resume.
1729 - Slot 1 Drive
Array—Disk
consistency
initialization in
progress—RAID 4/5
performance may be
lower until Auto
Reliability Monitoring
has completed
automatic parity
consistency
initialization
1730 - Fixed Disk 0
does not support
DMA Mode.
NoneInitial RAID configuration in
progress.
NoneFixed disk drive error detectedRun the System Configuration
This message is normal
following the initial
configuration of RAID 4 or
RAID 5 logical drives. This POST
message will go away and
performance of the controller
will improve after the parity data
has been initialized by ARM (an
automatic process than runs in
the background on the
controller).
Utility and correct.
continued
Diagnostics and Troubleshooting 3-15
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
1731 - Fixed Disk 1
NoneHard drive errorRun the System Configuration
does not support
DMA Mode.
1740 - Fixed Disk 0
NoneHard drive errorRun the System Configuration
failed Set Block
Mode command
1741 - Fixed Disk 1
NoneHard drive errorRun the System Configuration
failed Set Block
Mode command
1750 - Fixed Disk 0
NoneHard drive errorRun the System Configuration
failed Identify
command
1751 - Fixed Disk 1
NoneHard drive errorRun the System Configuration
failed Identify
command
1760 - Fixed Disk 0
NoneHard drive errorRun the System Configuration
does not support
Block Mode
1761 - Fixed Disk 1
NoneHard drive errorRun the System Configuration
does not support
Block Mode
1764 - Slot x Drive Array Capacity Expansion Process is temporarily disabled
(followed by one of the following):
■ Expansion will resume when Array Accelerator has been reattached.
■ Expansion will resume when Array Accelerator has been replaced.
■ Expansion will resume when Array Accelerator RAM allocation is successful.
■ Expansion will resume when Array Accelerator battery reaches full charge.
■ Expansion will resume when automatic data recovery has been completed.
Utility and correct.
Utility and correct.
Utility and correct.
Utility and correct.
Utility and correct.
Utility and correct.
Utility and correct.
Reattach or replace Array
Accelerator, wait until the Array
Accelerator batteries have
charged, or for automatic data
recovery to complete, as
indicated.
*1766 - Slot x Drive Array requires System ROM Upgrade. Run Systems ROMPaq
Utility.
1767 - Slot x Drive Array Option ROM is Not Programmed Correctly or may Conflict
with the Memory Address Range of an Option Card. Check the Memory Address
Configuration of installed Option Card(s) or run Options ROMPaq Utility to attempt
SMART-2/E Option ROM Reprogramming.
Run the latest Systems ROMPaq
Utility to upgrade the system
ROMs. Press F1 to resume.
Remove or reconfigure
conflicting Option cards,
especially any cards that are not
recognized by the System
Configuration Utility. Try
reprogramming the ROM on the
SMART-2/E Controller using the
latest Options ROMPaq
(version 2.29 or higher).
continued
3-16 Compaq ProLiant ML370 Maintenance and Service Guide
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
1768 - Slot x Drive
Array—Resuming
logical drive
expansion process.
1769 - Slot x Drive
Array—Drive(s)
disabled due to
failure during
expand. Select F1 to
continue with logical
drives disabled.
Select F2 to accept
data loss and to
re-enable logical
drives.
1770 - Slot x drive
array—critical drive
firmware problem
detected—please
upgrade firmware on
the following drive(s)
using Options
ROMPaq (available
from
www.compaq.com):
SCSI port (y) SCSI
ID (x)
whenever a controller reset or
power cycle occurs while array
expansion is in progress.
NoneSMART-2 Controller errorData has been lost while
expanding the array; therefore,
the drives have been
temporarily disabled. Press F2
to accept the data loss and
re-enable the logical drives.
Restore data from backup.
NoneFirmware update need indicatedThe indicated drives are running
firmware that is known to cause
intermittent problems. Please
use the Compaq Options
ROMPaq utility to upgrade
firmware on all drives to the
latest revision. Press F1 or F2 to
resume.
1771 - Primary Disk
Port Address
Assignment Conflict
1772 - Secondary
Disk Port Address
Assignment Conflict
1773 - Primary Fixed
Disk Port Assignment
Conflict
NoneInternal and external hard drive
controllers are both assigned to
the primary address.
NoneAddress assignment conflict:
internal and external hard drive
controllers are both assigned to
the secondary address.
NoneFixed disk drive errorRun the System Configuration
Run the System Configuration
Utility and correct.
Run the System Configuration
Utility and correct.
Utility and correct.
continued
Diagnostics and Troubleshooting 3-17
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
1774 - Slot x Drive
Array—Obsolete
data found in Array
Accelerator.
Select F1 to discard
contents of Array
Accelerator.
Select F2 to write
contents of Array
Accelerator to drives.
1775 - Slot x Drive
Array—ProLiant
Storage System Not
Responding SCSI
Port (y): Check
storage system
power switch and
cables. Turn the
system power off
while checking the
ProLiant power and
cable connections,
then turn the system
power back on to
retry.
NoneSMART-2 Controller errorData found in Array Accelerator
is older than data found on
drives. Press F1 to discard the
older data in the Array
Accelerator and retain the
newer data on the drives.
NoneStorage system problemTurn off power to system.
Check external ProLiant power
switch – external drives must all
be powered up before or at the
same time as the main system.
Check cables. If retry does not
help, try replacing the cable,
ProLiant firmware, ProLiant
backplane, or the Smart Array
Controller. Press F1 to resume.
continued
3-18 Compaq ProLiant ML370 Maintenance and Service Guide
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
1776 - Slot x Drive
ArrayProLiant
drive storage
enclosure problem
detected (followed by
one or more of the
following):
SCSI Port (y) Cooling
fan malfunction
detected
SCSI port (y):
Overheated condition
detected
SCSI port (y) Side
panel must be closed
to prevent
overheating
SCSI port (y)
Redundant power
supply malfunction
detected
SCSI port (y): Wide
SCSI transfer failed
SCSI port (y):
Interrupt signal
inoperative
(RESUME = F1 KEY)
NoneTemperature violation detectedCheck cooling fan operation by
placing hand over fan. Check
internal plenum cooling fan in
tower servers or storage
systems. If fan is not operating,
check for obstructions and
check all internal connector.
Replace unit side panel if
removed.
If the ProLiant Storage System
power LED is amber instead of
green, this indicates a
redundant power supply failure.
Check SCSI cables. If the
message indicates to check
SCSI cables, please verify your
cabling with the diagrams in the
Integrated Smart Array
Controller Installation Guide. If
the routing is correct, replace
cables on the specified port
until the POST message is
eliminated.
Press F1 to resume.
continued
Diagnostics and Troubleshooting 3-19
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
1778 - Slot x Drive
Array resuming
Automatic Data
Recovery process
1779 - Slot x Drive
Array—replacement
drives detected OR
previously failed
drive(s) now appear
to be operational:
Port (y): SCSI ID (x):
Restore data from
backup if
replacement drive x
has been installed.
1780 - Disk 0 FailureNoneHard drive/format errorRun Diagnostics. Replace failed
1781 - Disk 1 FailureNoneHard drive/format errorRun Diagnostics. Replace failed
1782 - Disk
Controller Failure
1783 - Slot x Drive
Array Controller
Failure
NoneThis message appears whenever
a controller reset or power cycle
occurs while Automatic Data
Recovery is in progress.
NoneIntermittent drive failure and/or
possible loss of data
NoneHard disk drive circuitry errorRun Diagnostics. Replace failed
NoneROM installation problem or
Array Accelerator board problem
No action required. This
message appears whenever a
controller reset or power cycle
occurs while Automatic Data
Recovery is in progress.
If this message appears and
drive x (identified by its SCSI ID)
has not been replaced, this
indicates an intermittent drive
failure. This message also
appears once immediately
following drive replacement
whenever data must be restored
from backup.
assembly as indicated.
assembly as indicated.
assembly as indicated.
If this message appears
immediately following a ROM
installation, the ROM is
defective or not installed
properly. Check that the
controller is firmly inserted in its
slot. Try upgrading the System
ROMs. Otherwise, replace the
Integrated Smart Array
Controller.
1784 - Slot x Drive
Array Drive Failure.
The following SCSI
drive(s) should be
replaced: SCSI
port (y) SCSI ID (x).
NoneDefective drive and/or cablesCheck for loose cables. Replace
defective drive x and/or
cable(s).
continued
3-20 Compaq ProLiant ML370 Maintenance and Service Guide
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
1785 - Drive Array
not Configured
(followed by one of
the following):
■ Run Compaq
Array
Configuration
Utility
■ No drives
detected
■ Drive positions
cannot be
changed—Run
Drive Array
Advanced
Diagnostics if
previous
positions are
unknown. Then
turn system
power OFF and
move drives to
their original
positions.
■ Configuration
information
indicates drive
positions beyond
the capability of
this controller.
This may be due
to drive
movement from
a controller that
supports more
drives than the
current
controller.
■ Configuration
information
indicates drives
were configured
on a controller
with newer
firmware.
NoneDrive array configuration not
detected
Run the Compaq Array
Configuration Utility.
Turn off system and check SCSI
cable connections to make sure
drives are attached properly.
Run Drive Array Advanced
Diagnostics if previous positions
are unknown. Then turn system
power off and move drives to
their original positions.
To avoid data loss, turn system
power off and reattach drives to
the original controller.
To avoid data loss, reattach
drives to original controller or
upgrade the controller firmware
to the version on the original
controller using Option ROMPaq.
Press F1 key to resume.
continued
Diagnostics and Troubleshooting 3-21
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
1786 - Drive Array
Recovery Needed.
The following SCSI
drive(s) need
Automatic Data
Recovery: SCSI
port 1: SCSI ID 0.
Select F1 to continue
with recovery of data
to drive. Select F2 to
continue without
recovery of data to
drive.
Or
Slot 1 Drive Array
Recovery needed.
Automatic Data
Recovery previously
Aborted! The
following SCSI
drive(s) need
Automatic Data
Recovery: SCSI
port 1: SCSI ID 0.
Select F1 to retry
Automatic Data
Recovery to drive.
Select F2 to continue
without starting
Automatic data
Recovery data to
drive.
1787 - Slot x Drive
Array Operating in
Interim Recovery
Mode. The following
SCSI drive(s) should
be replaced: SCSI
port (y): SCSI ID (x)
NoneSystem is in Interim Data
Recovery mode. Data has not yet
been recovered.
NoneHard drive X failed or a cable is
loose or defective. Following a
system restart, this message
reminds you that drive X is
defective and fault tolerance is
being used.
Press F1 to allow Automatic
Data Recovery to begin. Data
will automatically be restored to
drive X now that the drive has
been replaced or now seems to
be working.
Or
Press F2 and the system will
continue to operate in the
interim Data Recovery mode.
The “previously aborted”
version of the 1786 POST
message will appear if the
previous rebuild attempt was
aborted for any reason. Run
Drive Array Advanced
Diagnostics (ADU) for more
information. If the replacement
drive was failed, try using
another replacement drive. If
rebuild was aborted due to a
read error from another physical
drive in the array, back up all
readable data on the array, run
Diagnostics Surface Analysis,
and then restore your data.
1. Replace drive X as soon as
possible.
2. Check loose cables.
3. Replace defective cables.
continued
3-22 Compaq ProLiant ML370 Maintenance and Service Guide
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
*1788 - Slot x drive
array reports
incorrect drive
replacement. The
following SCSI
drive(s) should have
been replaced: SCSI
port (y): SCSI ID (x).
The following SCSI
drive(s) were
incorrectly replaced:
SCSI port (y):
SCSI ID (z).
Select F1 to continue
drive array will
remain disabled.
Select F2 to reset
configuration—all
data will be lost.
* The 1788 error message might display inadvertently due to a bad power cable connection to the drive, noise on the
data cable, or a defective SCSI cable. If this message was due to a bad power cable connection, but not incorrect drive
replacement, repair the connection and press F2. If this message is not eliminated by pressing F2, run the Array
Diagnostic Utility (ADU) to resolve.
—Or—
If this message was not due to a bad power cable connection, and no drive replacement took place, this could indicate
noise on the data cable. Check the cable for proper routing.
—Or—
If this message is due to a defective SCSI cable, replace the cable.
NoneDrives are not installed in their
original positions, so the drives
have been disabled. See note
below.
Reinstall the drives correctly, as
indicated.
Press F1 to restart the computer
with the drive array disabled.
—Or—
Press F2 to use the drives as
configured and lose all the data
on them.
continued
Diagnostics and Troubleshooting 3-23
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
1789 - Slot x Drive
array SCSI physical
drive(s) Not
Responding, Check
cables or replace the
following drives: SCSI
port (x): SCSI ID (x)
Select F1 to
continue—drive
array will remain
disabled.
Select F2 to fail
drive(s) that are not
responding—Interim
Recovery Mode will
be enabled if
configured for fault
tolerance.
1790 Disk 0 Configuration
Error
1791 - Disk 1 ErrorNoneHard drive error or wrong drive
1792 - Drive Array
Reports Valid Data
Found in Array
Accelerator.
Data will
automatically be
written to drive array.
NoneCable or hard drive failure1. Check the cable
NoneHard drive error or wrong drive
type
type
NoneThis indicates that while the
system was in use, power was
interrupted while data was in the
Array Accelerator memory.
Power was then restored
within 8 to 10 days, and the data
in the Array Accelerator was
flushed to the drive array.
connections.
2. If cables are connected,
replace the drive.
3. If you do not want to
replace the drive now,
press F2.
Run the System Configuration
Utility and Diagnostics and
correct.
Run the System Configuration
Utility and Diagnostics and
correct.
No action necessary; no data
has been lost. Perform orderly
system shutdowns to avoid data
remaining in the Array
Accelerator.
1793 - Drive Array—
Array Accelerator
Battery Depleted
Data Lost
(Error message 1794
also displays.)
NoneThis indicates that while the
system was in use, power was
interrupted while data was in the
Array Accelerator memory. Array
Accelerator batteries failed. Data
in the Array Accelerator has been
lost.
Power was not restored within
8 to 10 days. Perform orderly
system shutdowns to avoid data
remaining in the Array
Accelerator.
continued
3-24 Compaq ProLiant ML370 Maintenance and Service Guide
Table 3-2
POST Error Messages
Error CodeAudible BeepsProbable Source of ProblemRecommended Action
continued
1794 - Drive Array—
Array Accelerator
Battery Charge Low.
Array Accelerator is
temporarily disabled.
Array Accelerator will
be re-enabled when
battery reaches full
charge.
1799 - Drive Array—
Drive(s) Disabled due
to Array Accelerator
Data Loss. Select
“F1” to continue with
logical drives
disabled. Select “F2”
to accept data loss
and to re-enable
logical drives.
Beeps only: 2 Long2LNo valid memory is present in
NoneHard parity error while reading
data from posted-writes memory
NoneHard parity error while writing
data to posted-writes memory
NoneVolume failed due to loss of data
in posted-writes memory.
the system.
Enable Array Accelerator.
Enable the Array Accelerator.
Press F1 to continue with
logical drives disabled or F2 to
accept data loss and re-enable
logical drives.
Replace failed memory with
valid memory.
Beeps only:
2 Long + 2 Short
(Run System
Configuration Utility F10 key)
(RESUME - F1 KEY)NoneAs indicated to continue
2L, 2SPower is cycled. Temperature too
hot. Processor fan not installed
or spinning.
NoneA configuration error occurred
during POST.
Check fans.
Press F10 to run System
Configuration Utility.
Press F1.
3-26 Compaq ProLiant ML370 Maintenance and Service Guide
Diagnostics Software
Test Error Codes Tables include all test error codes generated by Compaq Server Diagnostics.
Each code has a corresponding description and recommended action(s).
When you select Diagnostics and Utilities from the System Configuration Utility main menu,
the utility prompts you to test, inspect, upgrade, and diagnose the server.
Diagnostics and Utilities are located on the Compaq system partition on the hard drive and must
be accessed when a system configuration error is detected during the Power-On Self-Test
(POST). Compaq Diagnostics software is also available on the Compaq SmartStart and Support
Software CD. You can create a Diagnostics diskette from the SmartStart and Support
Software CD and run Diagnostics from diskette. See procedure in the “Running Compaq
Utilities” section earlier in this chapter.
The following options are available from the Diagnostics and Utilities menu:
■ Test Computer
■ Inspect Computer
■ Upgrade Firmware
■ Remote Utilities
■ Diagnose Drive Array
Diagnostic error codes are generated when the Diagnostics software recognizes a problem.
These error codes help identify possible defective subassemblies.
Steps for Diagnostics
In each case, the Recommended Action column lists the steps necessary to correct the problem.
After completing each step, run the Diagnostics Utility to verify that the error condition has
been corrected. If the error code reappears:
1. Perform the next step listed in the table.
2. Run the Diagnostics program again.
3. Follow this procedure until the Diagnostics program no longer detects an error condition.
100-199, Primary Processor Test Error Codes
The 100 series of diagnostic error codes identifies failures with processor and system board
functions generated by Compaq Server Diagnostics.
Primary Processor Test Error Codes
Error CodeDescriptionRecommended Action
101-xxCPU test failedReplace the processor board and retest.
103-xxDMA page registers test failed
104-xxInterrupt controller master test
failed
105-xxPort 61 error
106-xxKeyboard controller self-test failed
Diagnostics and Troubleshooting 3-27
Table 3-3
Replace the processor board and retest.
107-xxCMOS RAM test failed
108-xxCMOS interrupt test failed
109-xxCMOS clock load data test failed
110-xxProgrammable timer load data test
failed
111-xxRefresh detect test failed
113-xxProtected mode test failed
114-xxSpeaker test failed1. Verify the speaker connection and retest.
116-xxCache test failedReplace the system board and retest.
122-xxMultiprocessor Dispatch test failed
123-xxInterprocessor Communication
test failed
199-xxInstalled devices test failed1. Check the system configuration and retest.
1. Replace the battery/clock module and retest.
2. Replace the system board and retest.
Replace the processor board and retest.
2. Replace the speaker and retest.
3. Replace the system board and retest.
1. Check the system configuration and retest.
2. Replace the processor board and retest.
3. Replace the system board and retest.
2. Verify cable connections and retest.
3. Check switch and/or jumper settings and retest.
4. Run the Configuration Utility and retest.
5. Replace the processor board and retest.
6. Replace the system board and retest.
3-28 Compaq ProLiant ML370 Maintenance and Service Guide
200-299, Memory Test Error Codes
The 200 series of diagnostic error codes identifies failures with the memory subsystem.
Table 3-4
Memory Test Error Codes
Error CodeDescriptionRecommended Action
200-xxInvalid memory configuration.Reinsert memory modules in correct location and retest.
201-xxMemory machine ID test failed.
202-xxMemory system ROM checksum
failed.
1. Replace the system ROM and retest.
2. Replace the processor board and retest.
3. Replace the memory expansion board and retest.
203-xxMemory write/read test failed.
204-xxMemory address test failed.
205-xxWalking I/O test failed.
206-xxIncrement pattern test failed
207-xxInvalid memory
configuration-check DIMM
installation. DIMMs installed
have 8K refresh.
208-xxInvalid memory speed detected -
check DIMM installation. Slow
DIMMs may cause data loss.
209-01RAM long test failed.
209-02Error while saving test block.
209-03Error while restoring test block.
209-89ECC error during testing.
210-xxRandom pattern test failed.1. Replace the memory module and retest.
211-01RAND test failed.
211-02Error while saving test block
1. Replace the memory module and retest.
2. Replace the processor board and retest.
3. Replace the memory expansion board and retest.
Replace DIMMs.
Replace DIMMs with timing greater than 60 ns.
1. Replace the memory module/board and retest.
2. Replace the system board and retest.
3. Replace the memory expansion board and retest.
2. Replace the processor board and retest.
3. Replace the memory expansion board and retest.
1. Replace the memory module/board and retest.
2. Replace the system board and retest.
211-03Error while restoring test block
211-04Insufficient memory to perform the
test
211-05Failed inverted pattern compare,
but RAM was correct
211-89ECC error during testing
214-01Data error during noise test
214-89ECC error during testing
215Non-functioning DC-DC converter
for processor X
1. Replace the memory module/board and retest.
2. Replace the system board and retest.
Replace the DC-DC converter (Processor Power
Module).
300-399, Keyboard Test Error Codes
The 300 series of diagnostic error codes identifies failures with keyboard and system board
functions.
Error CodeDescriptionRecommended Action
Diagnostics and Troubleshooting 3-29
Table 3-5
Keyboard Test Error Codes
301-xxKeyboard short test, 8042 self-test
failed.
302-xxKeyboard long test failed.
303-xxKeyboard LED test, 8042 self-test
failed.
304-xxKeyboard typematic test failed.
400-499, Parallel Printer Test Error Codes
The 400 series of diagnostic error codes identifies failures with parallel printer interface card or
system board functions.
Parallel Printer Test Error Codes
Error CodeDescriptionRecommended Action
401-xxPrinter failed or not connected.
402-xxPrinter data register failed.
403-xxPrinter pattern test failed.
498-xxPrinter failed or not connected.
1. Check the keyboard connection. If disconnected,
turn off the computer, connect the keyboard,
power up, and retest.
2. Replace the keyboard and retest.
3. Replace the system board and retest.
Table 3-6
1. Connect the printer and retest.
2. Check the power to the printer and retest.
3. Install the loopback connector and retest.
4. Check the switch on the serial/parallel interface
board (if applicable) and retest.
5. Replace the serial/parallel interface board
(if applicable) and retest.
6. Replace the system board and retest.
3-30 Compaq ProLiant ML370 Maintenance and Service Guide
500-599, Graphics Controller Unit Test Error Codes
The 500 series of diagnostic error codes identifies failures with video or system board functions.
Table 3-7
Graphics Display Unit Test Error Codes
Error CodeDescriptionRecommended Action
501-xxGraphics controller test failed
502-xxGraphics memory test failed
503-xxGraphics attribute test failed
504-xxGraphics character set test failed
505-xxGraphics 80x25-mode 9x14
character cell test failed
506-xxGraphics 80x25-mode 8x8
character cell test failed
507-xxGraphics 40x25-mode test failed
508-xxGraphics 320x200-mode color
set 0 test failed
509-xxGraphics 320x200-mode color
set 1 test failed
510-xxGraphics 640x200-mode test
failed
511-xxGraphics screen memory page test
failed
512-xxGraphics gray scale test failed
514-xxGraphics white screen test failed
516-xxGraphics noise pattern test failed
1. Replace the monitor and retest.
2. Replace the system board and retest.
600-699, Diskette Drive Test Error Codes
The 600 series of diagnostic error codes identifies failures with diskette, diskette drive, or
system board functions.
Diskette Drive Test Error Codes
Error CodeDescriptionRecommended Action
Diagnostics and Troubleshooting 3-31
Table 3-8
600-xxDiskette ID drive types test failed.
601-xxDiskette format failed.
602-xxDiskette read test failed.
603-xxDiskette write/read/compute test
failed.
604-xxDiskette random seek test failed.
605-xxDiskette ID media failed.
606-xxDiskette speed test failed.
607-xxDiskette wrap test failed.
608-xxDiskette write protect test failed.
609-xxDiskette reset controller test
failed.
610-xxDiskette change line test failed.
699-xxDiskette drive/media ID error1. Replace the media and retest.
1100-1199, Serial Test Error Codes
The 1100 series of diagnostic error codes identifies failures with serial/parallel interface board
or system board functions.
1. Replace the diskette and retest.
2. Check and/or replace the diskette power and
signal cables and retest.
3. Replace the diskette drive and retest.
4. Replace the system board and retest.
2. Run the Configuration Utility and retest.
Table 3-9
Serial Test Error Codes
Error CodeDescriptionRecommended Action
1101-xxSerial port test failed.
1109-xxClock register test failed
1. Check the switch settings on the Serial/Parallel
Interface board (if applicable) and retest.
2. Replace the Serial/Parallel Interface board
(if applicable) and retest.
3. Replace the system board and retest.
3-32 Compaq ProLiant ML370 Maintenance and Service Guide
1200-1299, Modem Communications Test Error Codes
The 1200 series of diagnostic error codes identifies failures with the modem.
Table 3-10
Modem Communications Test Error Codes
Error CodeDescriptionRecommended Action
1201-xxModem internal loopback test
failed.
1202-xxModem time-out test failed.
1203-xxModem external termination test
failed.
1204-xxModem auto originate test failed.
1206-xxDial multi-frequency tone test
failed.
1210-xxModem direct connect test failed.
6000-6099, Compaq NIC Boards Test Error Codes
The 6000 series of diagnostic error codes identifies failures with various Compaq Network
Interface Controllers.
The 6500 series of diagnostic error codes identifies failures with SCSI hard drives, SCSI hard
drive controller boards, SCSI hard drive cabling, and system board functions. If the system uses
a drive array controller, see the section for Array Diagnostic Utility (ADU).
SCSI Hard Drive Test Error Codes
Error CodeDescriptionRecommended Action
Diagnostics and Troubleshooting 3-33
Table 3-12
6500-xxSCSI disk ID drive types test failed.
6502-xxSCSI disk unconditional format test
failed.
6505-xxSCSI disk read test failed.
6506-xxSCSI disk SA/media test failed.
6509-xxSCSI disk erase tape test failed.
6523-xxSCSI disk random read test failed.
6528-xxMedia load/unload test failed.
6600-6699, SCSI/IDE CD-ROM Drive Test Error Codes
The 6600 series of diagnostic error codes identifies failures with the CD-ROM drive cabling,
CD-ROM drives, adapter boards, or the system board assembly.
Table 3-13
SCSI/IDE CD-ROM Drive Test Error Codes
Error CodeDescriptionRecommended Action
6600-xxCD-ROM ID failed.
6605-xxCD-ROM read failed.
6608-xxSCSI controller test
1. Run the System Configuration Utility and
verify the drive type.
2. Replace the SCSI disk drive signal and
power cables and retest.
3. Replace the SCSI controller and retest.
4. Replace the SCSI disk drive and retest.
5. Replace the system board and retest.
1. Replace the CD-ROM media and retest.
2. Check and/or replace the signal cable and
retest.
3. Check the switch settings on the adapter
board (if applicable).
4. Replace the SCSI controller (if applicable)
and retest.
5. Replace the CD-ROM drive and retest.
6. Replace the system board and retest.
3-34 Compaq ProLiant ML370 Maintenance and Service Guide
6700-6799, SCSI Tape Drive Test Error Codes
The 6700 series of diagnostic error codes identifies failures with tape cartridges, tape drives,
media changers, tape drive cabling, adapter boards, or the system board assembly.
Table 3-14
SCSI Tape Drive Test Error Codes
Error CodeDescriptionRecommended Action
6700-xxSCSI tape ID drive types test failed.
6706-xxSCSI disk SA/media test failed.
6708-xxSCSI controller test
6709-xxSCSI disk erase tape test failed.
6728-xxMedia load/unload test failed.
7079Battery test
1. Run the System Configuration Utility and
2. Replace the SCSI tape drive signal and
3. Replace the SCSI controller and retest.
4. Replace the SCSI tape drive and retest.
5. Replace the system board and retest.
8600-8699, Pointing Device Interface Test Error Codes
The 8600 diagnostic error codes identifies failures with the pointing device (mouse, trackball,
and so on) or the system board assembly.
Table 3-15
Pointing Device Interface Test Error Codes
Error CodeDescriptionRecommended Action
8601-xxPointing Device Interface test failed.1. Replace with a working pointing device and
2. Replace the system board and retest.
verify the drive type.
power cables and retest.
retest.
Array Diagnostic Utility (ADU)
Array Diagnostic Utility (ADU) is a Windows-based software tool designed to run on all
Compaq servers that support Compaq array controllers, and are running SmartStart 4.10 or later.
The two main functions of ADU are to collect all possible information about array controllers in
the system and to generate a list of detected problems. The error messages and codes listed
include all codes generated by Compaq products. The system generates only codes applicable to
the configuration and options in the server.
ADU works by issuing multiple commands to the array controllers to determine if a problem
exists. This data can then be saved to a file. In severe situations, this file can be sent to Compaq
for analysis. In most cases, ADU provides enough information to initiate problem resolution
immediately.
NOTE: ADU does not write to the drives, destroy data, or change or remove configuration information.
Starting ADU
1. Insert the SmartStart CD into the CD-ROM drive.
2. Reboot the system from the SmartStart CD.
3. Select Array Diagnostic Utility (ADU) from the System Utilities menu.
A “Please Wait” panel displays, indicating that ADU is identifying the system parameters.
ADU gathers information from all of the array controllers in the system. The time it takes to
gather this information depends on the size of the system. When the information gathering
process is complete, ADU displays the main screen or a panel indicating any problems
detected.
Diagnostics and Troubleshooting 3-35
CAUTION: Do not cycle the power during this process. ADU must perform low-level operations
that, if interrupted, could cause the controller to revert to a previous level of firmware if the
firmware was soft-upgraded.
4. To generate an ADU report, select File then Save Data from the command menu.
3-36 Compaq ProLiant ML370 Maintenance and Service Guide
Table 3-16
ADU Diagnostic Messages
MessageDescriptionRecommended Action
Accelerator
board not
detected
Accelerator error
log
Accelerator
parity read
errors: n
Accelerator
parity write
errors: n
Accelerator
status: Cache
was
automatically
configured
during last
controller reset.
Array controller did not detect a
configured Array Accelerator board
This log lists the last 32 parity errors
on transfers to or from memory on
the Array Accelerator board, and
displays starting memory address,
transfer count, and operation (read
and write).
Number of times that read memory
parity errors were detected during
transfers from memory on the Array
Accelerator board
Number of times that write memory
parity errors were detected during
transfers to memory on the Array
Accelerator board
This can occur when cache board is
replaced with one of a different size
Install the Array Accelerator board on the array
controller. If an Array Accelerator board is
already installed, check for proper seating on
the array controller board.
If there are many parity errors, you may need to
replace the Array Accelerator board.
If there are many parity errors, you may need to
replace the Array Accelerator board.
If there are many parity errors, you may need to
replace the Array Accelerator board.
Normal operations should continue.
Accelerator
status: Data in
the cache was
lost due to some
reason other
than the battery
being discharged
Accelerator
status: Dirty data
detected has
reached limit.
Cache still
enabled, but
writes no longer
being posted
Accelerator
status: Dirty data
detected. Unable
to write dirty
data to drives
Data in the cache was lost, but not
because of the battery being
discharged.
The number of cache lines containing
dirty data that cannot be flushed
(written) to the drives has reached a
preset limit. The cache is still
enabled, but writes are no longer
being posted. This error usually
occurs when there is a problem with
the drive(s).
At least one cache line contains dirty
data that the controller has been
unable to flush (write) to the drives.
This error usually occurs when there
is a problem with the drive(s).
Ensure that the Array Accelerator is properly
seated. If the error continues, you may need to
replace the Array Accelerator.
Resolve the problem with the drive(s). The
controller will then be able to write dirty data to
drives and posted write operations will be
restored.
Resolve the problem with the drive(s). The
controller will then be able to write dirty data to
drives.
continued
Diagnostics and Troubleshooting 3-37
Table 3-16
ADU Diagnostic Messages
MessageDescriptionRecommended Action
continued
Accelerator
status: Excessive
ECC errors
detected in at
least one cache
line. As a result,
at least one
cache line is no
longer in use
Accelerator
status: Obsolete
data detected
Accelerator
status: Obsolete
data was
discarded
Accelerator
status: Obsolete
data was flushed
(written) to
drives
Accelerator
status:
Permanently
disabled
At least one line in the cache is no
longer in use due to excessive ECC
errors detected during use of the
memory associated with that cache
line.
During reset initialization obsolete
data was found in cache. This was
due to drives being moved and
written to by another controller.
During reset initialization obsolete
data was found in cache and was
discarded (not written to drives).
During reset initialization obsolete
data was found in cache. Obsolete
data was written to the drives, but
newer data may have been
overwritten.
Array Accelerator board has been
permanently disabled. It remains
disabled until it is reinitialized using
the Array Configuration Utility (ACU).
Replacement of cache should be considered. If
cache replacement is not done remaining cache
lines should continue to operate properly.
Normal operations should continue. The
controller will either write data to the drives or
discard the data completely.
Normal operations should continue.
If newer data was overwritten, you may need to
restore newer data; otherwise, normal
operations should continue.
Check the Disable Code field. Run Array
Configuration Utility (ACU) to reinitialize the
Array Accelerator board.
Accelerator
status: Possible
data loss in
cache
Accelerator
status:
Temporarily
disabled
Accelerator
status:
Unrecognized
status
Accelerator
status: Valid data
found at reset
Accelerator
status: Warranty
alert
Possible data loss detected during
power-up due to all batteries being
below sufficient voltage level and no
presence of identification signatures
on the Array Accelerator board.
Array Accelerator board has been
temporarily disabled.
A status returned from the Array
Accelerator board that ADU does not
recognize.
Valid data was found in posted write
memory at reinitialization. Data will
be flushed to disk.
Catastrophic problem with Array
Accelerator board. Refer to other
messages on Diagnostics screen for
exact meaning of this message.
There is no way to determine if dirty or bad data
was in the cache and is now lost.
Check the Disable Code field.
Obtain the latest version of ADU.
Not an error or data loss condition; no action
required
Replace the Array Accelerator board.
continued
3-38 Compaq ProLiant ML370 Maintenance and Service Guide
Table 3-16
ADU Diagnostic Messages
MessageDescriptionRecommended Action
continued
Adapter/NVRAM
ID mismatch
Array Accelerator
battery pack X
not fully charged
Array Accelerator
battery pack X
below reference
voltage
(recharging)
Board in use by
Expand
operation
Board not
attached
Configuration
signature is zero
Configuration
signature
mismatch
Nonvolatile RAM has an ID for a
different controller from the one
physically present in the slot.
Battery is not fully charged.If 75% of batteries present are fully charged,
Battery pack on Array Accelerator is
below required voltage levels.
Array Accelerator memory is in use
by Expand operation.
Array controller configured for use
with Array Accelerator board, but one
is not attached.
ADU detected that nonvolatile RAM
contains a configuration signature of
zero. Old versions of the System
Configuration Utility could cause this.
Array Accelerator board configured
for a different array controller board.
Configuration signature on Array
Accelerator board does not match the
one stored on the array controller
board.
Run the System Configuration Utility.
Array Accelerator is fully operational. If less
than 75% of batteries are fully charged, allow
36 hours to recharge them.
Allow enough time for batteries to recharge
(36 hours). If batteries have not recharged
after 36 hours, replace the Array Accelerator
board.
Operate system without Array Accelerator board
until Expand operation completes.
Attach Array Accelerator board to array
controller.
Run the latest version of System Configuration
Utility to configure the controller and nonvolatile
RAM.
To recognize the Array Accelerator board, run
the Array Configuration Utility.
Controller
communication
failure occurred
Controller
detected.
NVRAM
configuration not
present
Controller
firmware needs
upgrading
Controller is
located in
special “video”
slot
Controller communication failure
occurred.
Nonvolatile RAM does not contain a
configuration for this controller.
Controller firmware is below the
latest recommended version.
Controller is installed in slot for
special video control signals. If
controller is used in this slot, LED
indicators on front panel may not
function properly.
ADU was unable to successfully issue
commands to the controller in this slot.
Run the System Configuration Utility to
configure the nonvolatile RAM.
Run Options ROMPaq to upgrade the controller
to the latest firmware revision.
Install the controller in a different slot and run
the System Configuration Utility to configure the
controller and nonvolatile RAM. Then run the
Array Configuration Utility to configure
controller.
continued
Diagnostics and Troubleshooting 3-39
Table 3-16
ADU Diagnostic Messages
MessageDescriptionRecommended Action
continued
Controller is not
configured
Controller
reported POST
error.
Error Code: x
Controller
restarted with a
signature of zero
Disable
command issued
Drive (bay) X
firmware needs
upgrading
If controller was previously
configured and you change drive
locations, there may be a problem
with placement of the drives. ADU
examines each physical drive and
looks for drives that have been
moved to new drive bays.
The controller returned an error from
its internal Power-On Self Tests.
ADU did not find a valid configuration
signature to use to get the data.
Nonvolatile RAM may not be present
(unconfigured) or the signature
present in nonvolatile RAM may not
match the signature on the
controller.
Posted-writes have been disabled by
the issuing of the Accelerator Disable
command. This occurred because of
an operating system device driver.
Firmware on this physical drive is
below the latest recommended
version.
Look for messages indicating which drives have
been moved. If none appear and drive swapping
did not occur, run the Array Configuration Utility
to configure the controller and run the System
Configuration Utility to configure nonvolatile
RAM. Do not run either utility if you believe
drive swapping has occurred.
Replace the controller.
Run the System Configuration Utility to
configure the controller and nonvolatile RAM.
Restart the system. Run the Array Configuration
Utility to reinitialize the Array Accelerator board.
Run the Options ROMPaq Utility to upgrade the
drive firmware to the latest revision.
Drive (bay) X has
insufficient
capacity for its
configuration
Drive (bay) X has
invalid M&P
stamp
Drive (bay) X has
loose cable
Drive (bay) X is a
replacement
drive
Drive has insufficient capacity to be
used in this logical drive
configuration.
Physical drive has invalid monitor
and performance data.
The array controller could not
communicate with this drive at
power-up. This drive has not
previously failed.
This drive has been replaced. This
message displays if a drive is
replaced in a fault tolerant logical
volume.
Replace this drive with a larger capacity drive.
Run the System Configuration Utility to properly
initialize this drive.
Check all cable connections first. The cables
could be bad, loose, or disconnected. Turn on
the system and attempt to reconnect
signal/power cable to the drive. If the error
recurs, replace the cable. If the error recurs
again, the drive may need to be replaced.
If the replacement was intentional, allow the
drive to rebuild.
continued
3-40 Compaq ProLiant ML370 Maintenance and Service Guide
Table 3-16
ADU Diagnostic Messages
MessageDescriptionRecommended Action
continued
Drive (bay) X is a
replacement
drive marked OK
Drive (bay) X is
failed
Drive (bay) X is
undergoing drive
recovery
Drive (bay) X
needs replacing
Drive (bay) X
upload code not
readable
Drive (bay) X
was
inadvertently
replaced
Drive Monitoring
features are
unobtainable
This drive has been replaced and
marked OK by the firmware. This
may occur if a drive has an
intermittent failure (for example, if a
drive has previously failed, and then
when ADU is run, the drive starts
working again).
The indicated physical drive has
failed.
This drive is being rebuilt from the
corresponding mirror or parity data.
The 210-megabyte hard drive has
firmware version 2.30 or 2.31.
An error occurred while ADU was
trying to read the upload code
information from this drive.
The physical drive was incorrectly
replaced after another drive failed.
ADU unable to get monitor and
performance data due to fatal
command problem such as drive
time-out, or unable to get data
because these features are not
supported on the controller.
Replace the drive.
Replace the drive.
Normal operations should resume.
Replace the drive.
If there were multiple errors, this drive may
need to be replaced.
Replace the drive that was incorrectly replaced
and replace the original drive that failed. Do not
run the System Configuration Utility and try to
reconfigure; data will be lost.
Check for other errors (time-outs, and so on). If
no other errors occur, upgrade the firmware to a
version that supports monitor and performance,
if desired.
Drive Monitoring
is NOT enabled
for SCSI Port x
Drive ID x
Drive time-out
occurred on
physical drive
bay X
Drive X indicates
position Y
Duplicate write
memory error
The monitor and performance
features have not been enabled on
this drive.
ADU issued a command to a physical
drive and the command was never
acknowledged.
Message indicates physical drive that
appears to be scrambled or in a drive
bay other than the one for which it
was originally configured.
Data could not be written to the Array
Accelerator board in duplicate due to
the detection of parity errors. This is
not a data loss situation.
Run the System Configuration Utility to initialize
the monitor and performance features.
The drive or cable may be bad. Check the other
error messages on the Diagnostics screen to
determine resolution.
Examine the graphical drive representation on
ADU to determine proper drive locations.
Remove drive X and place it in drive position Y.
Rearrange the drives according to the ADU
instructions.
Replace the Array Accelerator board.
continued
Diagnostics and Troubleshooting 3-41
Table 3-16
ADU Diagnostic Messages
MessageDescriptionRecommended Action
continued
Error occurred
reading RIS copy
from SCSI Port x
Drive ID x
FYI: Drive (bay) X
is non-Compaq
supplied
Identify
controller data
did not match
with NVRAM
Identify logical
drive data did
not match with
NVRAM
Insufficient
adapter
resources
An error occurred while ADU was
trying to read the RIS from this drive.
The installed drive was not supplied
by Compaq.
The identify controller data from the
array controller did not match the
information stored in nonvolatile
RAM. This could occur if new,
previously configured drives have
been placed in a system that has
also been previously configured. It
could also occur if the firmware on
the controller has been upgraded and
the System Configuration Utility was
not run.
The identify unit data from the array
controller did not match with the
information stored in nonvolatile
RAM. This could occur if new,
previously configured drives have
been placed in a system that has
also been previously configured.
The adapter does not have sufficient
resources to perform operations to
the Array Accelerator board. Drive
rebuild may be occurring.
If there were multiple errors, this drive may
need to be replaced.
If problems exist with this drive, replace it with
a Compaq drive.
Check the identify controller data under the
Inspect Utility. If the firmware version field is the
only difference between the controller and
nonvolatile RAM data, this is not a problem.
Otherwise, run the System Configuration Utility.
Run the System Configuration Utility to
configure the controller and nonvolatile RAM.
Operate the system without the Array
Accelerator board until the drive rebuild
completes.
Less than 75%
batteries at
sufficient voltage
Less than 75%
batteries at
sufficient
voltage. Battery
pack X below
reference
voltage.
Logical drive X
failed due to
cache error
Operation of Array Accelerator board
has been disabled due to less
than 75% of battery packs being at
sufficient voltage level.
Battery pack on Array Accelerator is
below required voltage levels.
This logical drive failed due to a
catastrophic cache error.
Allow sufficient time for batteries to recharge
(36 hours). If batteries have not recharged
after 36 hours, replace the Array Accelerator
board.
Allow sufficient time for batteries to recharge
(36 hours). If batteries have not recharged
after 36 hours, replace the Array Accelerator
board.
Replace the Array Accelerator board and
reconfigure using the Array Configuration Utility.
continued
3-42 Compaq ProLiant ML370 Maintenance and Service Guide
Table 3-16
ADU Diagnostic Messages
MessageDescriptionRecommended Action
continued
Logical Drive X
status = FAILED
Logical Drive X
status =
INTERIM
RECOVERY
Logical Drive X
status = LOOSE
CABLE
DETECTED
Logical Drive X
status = NEEDS
RECOVER
This status could be issued for
several reasons. If this logical drive is
configured for No Fault Tolerance
and one or more drives fail, this
status will occur. If mirroring is
enabled, and any two mirrored drives
fail, this status will occur. If Data
Guarding is enabled, and two or
more drives fail in this unit, this
status will occur. This status may
also occur if another configured
logical drive is in the WRONG DRIVE
REPLACED or LOOSE CABLE
DETECTED state.
A physical drive in this logical drive
has failed. The logical drive is
operating in interim recovery mode
and is vulnerable.
A physical drive has a cabling
problem.
A physical drive in this logical drive
has failed and has now been
replaced. This drive needs to be
rebuilt from the mirror drive or the
parity data.
Check for drive failures, wrong drive replaced,
or loose cable messages. If there was a drive
failure, replace the failed drive(s), then restore
the data for this logical drive from the tape
backup. Otherwise, follow the wrong drive
replaced or loose cable detected procedures.
Replace the failed drive as soon as possible.
Turn the system off and attempt to reattach the
cable onto the drive. If the error recurs, replace
the cable.
When starting the system, select the "F1rebuild
drive" option to rebuild the replaced drive.
Logical Drive X
status =
OVERHEATED
Logical Drive X
status =
OVERHEATING
Logical Drive X
status =
RECOVERING
Logical Drive X
status = WRONG
DRIVE REPLACED
Loose cable
detected—
logical drives
may be marked
FAILED until
corrected
The Intelligent Array Expansion
System drive temperature is beyond
safe operating levels and the system
has shut down to avoid damage.
The Intelligent Array Expansion
System drive temperature is beyond
safe operating levels and the system
has shut down to avoid damage.
A physical drive in this logical drive
has failed and has now been
replaced. The replaced drive is
rebuilding from the mirror drive or
the parity data.
A physical drive in this logical drive
has failed. The incorrect drive was
replaced.
ADU found loose cable. If logical
drives are marked FAILED, those
logical drives will be unusable until
problem corrected.
Check the fans and the operating environment.
Check the fans and the operating environment.
Nothing needs to be done. Normal operations
should resume.
Replace the drive that was incorrectly replaced.
Then, replace the original drive that failed with a
new drive. Do not run the System Configuration
Utility to reconfigure; data on drive will be lost.
Power down the system. Check cable(s) for tight
connection to logical drives. Restart system. If
same message recurs, cable(s) may be bad.
continued
Diagnostics and Troubleshooting 3-43
Table 3-16
ADU Diagnostic Messages
MessageDescriptionRecommended Action
continued
Loose cable
detected—
logical drives
may be marked
FAILED until
corrected
Mirror data
miscompare
No configuration
for accelerator
board
NVRAM
configuration
present,
controller not
detected
RIS copies
between drives
do not match
SCSI Port x Drive
ID x has
exceeded
threshold(s)
Controller unable to communicate
with one or more physical drives,
probably because of a cabling
problem. Logical drives may be in a
FAILED state until the condition is
corrected, preventing access to data
on the controller.
Data was found at reinitialization in
the posted write memory; however,
the mirror data compare test failed
resulting in data being marked as
invalid. Data loss is possible.
The Array Accelerator board has not
been configured.
Nonvolatile RAM has a configuration
for an array controller, but there is no
board in this slot. Either a board has
been removed from the system or a
board has been placed in the wrong
slot.
Drives on controller contain copies of
RIS which do not match.
Monitor and performance threshold
for this drive has been violated.
Check all controller and drive cable
connections.
Replace the Array Accelerator board.
If the Array Accelerator board is present, run the
Array Configuration Utility (ACU) to configure the
board.
Place the array controller in the proper slot or
run the System Configuration Utility to
reconfigure nonvolatile RAM to reflect the
removal or new position.
Upgrade ADU to most recent version.
Check for the particular threshold that has been
violated.
SCSI Port x Drive
ID x is not
stamped for
monitoring
SCSI Port x Drive
ID x RIS copy
mismatch
SCSI Port x Drive
ID x failed—
REPLACE (failure
message)
SCSI Port x Drive
ID x firmware
needs upgrading
Drive has not been stamped with
monitor and performance features.
Copies of RIS on drive do not match.Drive may need to be replaced. Check for other
ADU found a drive that needs to be
replaced.
Drive firmware is below
recommended version.
Run the Array Configuration Utility (ACU).
Changing the configuration and saving should
cause ACU to stamp drive with monitor and
performance features. To do this without
destroying the current configuration, change
Array Accelerator size and save configuration.
Change the Array Accelerator back to original
size and save again.
errors.
Replace drive or correct condition that caused
error.
Run Options ROMPaq to upgrade drive to latest
firmware revision.
continued
3-44 Compaq ProLiant ML370 Maintenance and Service Guide
Table 3-16
ADU Diagnostic Messages
MessageDescriptionRecommended Action
continued
SCSI Port x Drive
ID x has a loose
cable. SMART is
unable to
communicate
with drive
SCSI Port x Drive
ID x was
replaced on a
good volume:
(failure message)
SCSI port X,
drive ID Y
firmware needs
upgrading
Set configuration
command issued
Soft Firmware
Upgrade
required
Drive has loose cablePower down the system. Check drive cable for
tight connection.
ADU found drive was replaced even
though volume was fine.
Drive’s firmware may cause
problems and should be upgraded.
The configuration of the array
controller has been updated. The
Array Accelerator board may remain
disabled until it is reinitialized.
ADU has determined that the
controller is running firmware that
has been soft upgraded by the
Compaq Upgrade Utility. However,
the firmware running is not present
on all drives. This could be caused by
the addition of new drives in the
system.
No action needs to be taken.
Run Options ROMPaq to upgrade the firmware
on the drive to a later version.
Run the System Configuration Utility to
reinitialize the Array Accelerator board.
Run the Compaq Upgrade Utility to place the
latest firmware on all drives.
Unable to
communicate
with drive on
SCSI Port x,
Drive ID x
Unknown disable
code
Unrecoverable
read error
WARNING—
Drive Write
Cache is enabled
on X
Controller could not communicate
with drive.
A code was returned from the Array
Accelerator board that ADU does not
recognize.
Read parity errors were detected
when attempting to read same data
from both sides of mirrored memory.
Data loss will occur.
Drive has its internal write cache
enabled. The drive may be a
third-party drive or the operating
parameters of the drive may have
been altered. This condition may
cause data corruption if power to the
drive is interrupted.
If the amber LED on the drive is on, replace the
drive.
Obtain the latest version of ADU.
Replace the Array Accelerator board.
Replace the drive with a Compaq supplied drive,
or restore the operating parameters of the drive.
continued
Diagnostics and Troubleshooting 3-45
Table 3-16
ADU Diagnostic Messages
MessageDescriptionRecommended Action
continued
Warning bit
detected
Write memory
error
Wrong
Accelerator
A monitor and performance threshold
violation may have occurred. Status
of a logical drive may not be OK.
Data could not be written to cache
memory. This typically means that
parity error was detected while
writing data to cache. This could be
caused by incomplete connection
between cache and controller. This is
not a data loss circumstance.
This could mean that either the board
was replaced in the wrong slot or
placed in a system that was
previously configured with another
board type. Included with this
message is a message indicating the
type of adapter sensed by ADU and a
message indicating the type of
adapter last configured nonvolatile
RAM.
Check the other error messages on the
diagnosis screen for an indication of the
problem.
With power to system turned off, verify that the
cache board is fully connected to controller.
Check the diagnosis screen for other error
messages. Run the System Configuration Utility
to update the system configuration.
3-46 Compaq ProLiant ML370 Maintenance and Service Guide
Integrated Management Log
On servers supporting the Integrated Management Display, the Compaq Integrated Management
Log (IML) replaces the Critical Error Log and Correctable Memory Logs. IML records system
events and stores them in an easily viewable form. Each event is marked with a time-stamp with
one-minute granularity.
Events listed in the Integrated Management Log are categorized as one of four event severity
levels:
■ Statusindicates that the message is informational only.
■ Repairedindicates that corrective action has been taken.
■ Cautionindicates a nonfatal error condition.
■ Criticalindicates a component failure.
The Integrated Management Log requires Compaq operating system-dependent drivers. Refer to
the Compaq Support Software CD for instructions on installing the appropriate drivers.
Refer to the setup and installation guide that came with the server or the Integrated ManagementDisplay User Guide provided with the Integrated Management Display option kit.
Multiple Ways of Viewing the Log
You can view an event in the IML in several ways:
■ On the Integrated Management Display
■ From within Compaq Insight Manager
■ From within the Compaq Survey Utility
■ From within the IML Management Utility
Integrated Management Display
The Integrated Management Display is an LCD panel that presents information directly at the
server, assisting in diagnosing and servicing the server without a keyboard and monitor.
Compaq Insight Manager
Compaq Insight Manager is a comprehensive management tool to monitor and control the
operation of Compaq servers and clients. Compaq Insight Manager consists of two components:
a Windows-based console application, and server- or client-based management data collection
agents. Starting with Compaq Insight Manager 4.0, the agents for Windows NT and NetWare
are also Web-enhanced; that is, these agents enable Web browser access and monitoring of
management information.
The management agents monitor over 1,000 management parameters. Key subsystems are
instrumented to make health, configuration, and performance data available to agent software.
Agents act upon data by initiating information, such as network interface or storage subsystem
performance statistics.
Loading...
+ hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.