This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except
as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform,
publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is
prohibited.
The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing.
If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, then the following notice is applicable:
U.S. GOVERNMENT END USERS. Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, delivered
to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As
such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the hardware, and/or
documentation, shall be subject to license terms and license restrictions applicable to the programs. No other rights are granted to the U.S. Government.
This software or hardware is developed for general use in a variety of information management applications. It is not developed or intended for use in any inherently dangerous
applications, including applications that may create a risk of personal injury. If you use this software or hardware in dangerous applications, then you shall be responsible to take all
appropriate fail-safe, backup, redundancy, and other measures to ensure its safe use. Oracle Corporation and its affiliates disclaim any liability for any damages caused by use of this
software or hardware in dangerous applications.
Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners.
Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of
SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered
trademark of The Open Group.
This software or hardware and documentation may provide access to or information about content, products, and services from third parties. Oracle Corporation and its affiliates are
not responsible for and expressly disclaim all warranties of any kind with respect to third-party content, products, and services unless otherwise set forth in an applicable agreement
between you and Oracle. Oracle Corporation and its affiliates will not be responsible for any loss, costs, or damages incurred due to your access to or use of third-party content,
products, or services, except as set forth in an applicable agreement between you and Oracle.
Documentation Accessibility
For information about Oracle's commitment to accessibility, visit the Oracle Accessibility Program website at http://www.oracle.com/pls/topic/lookup?ctx=acc&id=docacc.
Access to Oracle Support
Oracle customers that have purchased support have access to electronic support through My Oracle Support. For information, visit http://www.oracle.com/pls/topic/lookup?
ctx=acc&id=info or visit http://www.oracle.com/pls/topic/lookup?ctx=acc&id=trs if you are hearing impaired.
Ce logiciel et la documentation qui l’accompagne sont protégés par les lois sur la propriété intellectuelle. Ils sont concédés sous licence et soumis à des restrictions d’utilisation et
de divulgation. Sauf stipulation expresse de votre contrat de licence ou de la loi, vous ne pouvez pas copier, reproduire, traduire, diffuser, modifier, breveter, transmettre, distribuer,
exposer, exécuter, publier ou afficher le logiciel, même partiellement, sous quelque forme et par quelque procédé que ce soit. Par ailleurs, il est interdit de procéder à toute ingénierie
inverse du logiciel, de le désassembler ou de le décompiler, excepté à des fins d’interopérabilité avec des logiciels tiers ou tel que prescrit par la loi.
Les informations fournies dans ce document sont susceptibles de modification sans préavis. Par ailleurs, Oracle Corporation ne garantit pas qu’elles soient exemptes d’erreurs et vous
invite, le cas échéant, à lui en faire part par écrit.
Si ce logiciel, ou la documentation qui l’accompagne, est concédé sous licence au Gouvernement des Etats-Unis, ou à toute entité qui délivre la licence de ce logiciel ou l’utilise pour
le compte du Gouvernement des Etats-Unis, la notice suivante s’applique:
U.S. GOVERNMENT END USERS. Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, delivered
to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As
such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the hardware, and/or
documentation, shall be subject to license terms and license restrictions applicable to the programs. No other rights are granted to the U.S. Government.
Ce logiciel ou matériel a été développé pour un usage général dans le cadre d’applications de gestion des informations. Ce logiciel ou matériel n’est pas conçu ni n’est destiné
à être utilisé dans des applications à risque, notamment dans des applications pouvant causer des dommages corporels. Si vous utilisez ce logiciel ou matériel dans le cadre d’
applications dangereuses, il est de votre responsabilité de prendre toutes les mesures de secours, de sauvegarde, de redondance et autres mesures nécessaires à son utilisation dans des
conditions optimales de sécurité. Oracle Corporation et ses affiliés déclinent toute responsabilité quant aux dommages causés par l’utilisation de ce logiciel ou matériel pour ce type
d’applications.
Oracle et Java sont des marques déposées d’Oracle Corporation et/ou de ses affiliés. Tout autre nom mentionné peut correspondre à des marques appartenant à d’autres propriétaires
qu’Oracle.
Intel et Intel Xeon sont des marques ou des marques déposées d’Intel Corporation. Toutes les marques SPARC sont utilisées sous licence et sont des marques ou des marques
déposées de SPARC International, Inc. AMD, Opteron, le logo AMD et le logo AMD Opteron sont des marques ou des marques déposées d’Advanced Micro Devices. UNIX est une
marque déposée d’The Open Group.
Ce logiciel ou matériel et la documentation qui l’accompagne peuvent fournir des informations ou des liens donnant accès à des contenus, des produits et des services émanant de
tiers. Oracle Corporation et ses affiliés déclinent toute responsabilité ou garantie expresse quant aux contenus, produits ou services émanant de tiers, sauf mention contraire stipulée
dans un contrat entre vous et Oracle. En aucun cas, Oracle Corporation et ses affiliés ne sauraient être tenus pour responsables des pertes subies, des coûts occasionnés ou des
dommages causés par l’accès à des contenus, produits ou services tiers, ou à leur utilisation, sauf mention contraire stipulée dans un contrat entre vous et Oracle.
Accessibilité de la documentation
Pour plus d’informations sur l’engagement d’Oracle pour l’accessibilité à la documentation, visitez le site Web Oracle Accessibility Program, à l'adresse http://www.oracle.com/
pls/topic/lookup?ctx=acc&id=docacc.
Accès au support électronique
Les clients Oracle qui ont souscrit un contrat de support ont accès au support électronique via My Oracle Support. Pour plus d'informations, visitez le site http://www.oracle.com/
pls/topic/lookup?ctx=acc&id=info ou le site http://www.oracle.com/pls/topic/lookup?ctx=acc&id=trs si vous êtes malentendant.
Page 5
Contents
Using This Documentation ............. ................ ................ ................ ................ ... 11
Index ............. ................ ................ ................ ................ ................ ................ ... 319
10Oracle Server X5-2L Service Manual • May 2015
Page 11
Using This Documentation
■
Overview – This service manual explains how to remove and replace parts in the Oracle
Server X5-2L, and how to maintain and troubleshoot the system.
■
Audience – This guide is intended for trained technicians and authorized service personnel
who have been instructed on the hazards within the equipment and qualified to remove and
replace hardware.
■
Required knowledge – Advanced experience troubleshooting and replacing hardware.
Product Documentation Library
Documentation and resources for this product and related products are available at http://www.
oracle.com/goto/X5-2L/docs.
Feedback
Provide feedback about this documentation at http://www.oracle.com/goto/docfeedback.
Using This Documentation11
Page 12
12Oracle Server X5-2L Service Manual • May 2015
Page 13
About the Oracle Server X5-2L
These sections describe the controls, connectors, LEDs, system components, and replaceable
components of the server.
Note - Always update the server with the latest firmware, drivers, and other hardware-related
software by downloading the latest software release package when you first receive the server,
and for every new software release. For information about the software release packages and
how to download the software, refer to “Getting Server Firmware and Software Updates” in
“Oracle Server X5-2L Installation Guide ”.
DescriptionLinks
Review the product description.“Product Description” on page 13
Review the controls and connectors on the server.“About Controls and Connectors” on page 14
Review system components.“About System Components” on page 19
Product Description
The Oracle Server X5-2L is an enterprise-class, two rack unit (2U) server. It supports the
following components:
■
Up to two Intel processors. Processors with the following capabilities are supported:
■
18-core, 2.3 GHz, 145W processors
■
12-core, 2.6 GHz, 135W processors
■
10-core, 2.6 GHz, 105W processors
■
8-core, 2.4 GHz, 85W processors
■
Up to 12 DIMMs per processor for a maximum of 24 DDR4 DIMMs and a maximum of
768 GB of memory are supported in dual-processor systems. A maximum of 12 DDR4
DIMMs and a maximum of 384 GB of memory are supported in single-processor systems.
RDIMMs (8 GB and 16 GB) and LRDIMMs (32 GB) are supported.
■
Six PCIe Gen3 slots in the dual-processor systems. PCIe slots 1, 2, and 3 are nonfunctional
in single-processor systems.
■
Storage drive configurations can comprise both hard disk drives (HDDs) or solid state disk
drives (SSDs). Configurations include:
About the Oracle Server X5-2L13
Page 14
About Controls and Connectors
■
Up to twelve 3.5-inch hot-pluggable SAS HDDs
■
Up to twenty-four 2.5-inch hot-pluggable SAS HDDs or SSDs, with optional support for
up to four PCIe-based NVMe SSDs
■
Up to eight 2.5-inch hot-pluggable SAS HDDs or SSDs, with optional support for a
SATA DVD and up to four PCIe-based NVMe SSDs
■
(Optional) Up to two rear-mounted 2.5-inch SAS storage drives in the twelve 3.5-inch
storage drive and twenty-four 2.5-inch storage drive configurations
■
Two hot-pluggable, redundant 1000W power supplies.
■
An on-board Oracle Integrated Lights Out Manager (Oracle ILOM) service processor (SP)
based on the Emulex Pilot 3 chip.
■
The optional Oracle System Assistant server setup tool, which is embedded on a preinstalled
USB flash drive.
About Controls and Connectors
The following sections describe the controls, indicators, connectors, and drives located on the
front and rear panels.
■
“Front Panel Components on Twelve Drive Systems” on page 16
■
“Front Panel Components on Twenty-Four Drive Systems” on page 17
■
“Rear Panel Components and Cable Connections” on page 18
Related Information
■
“About System Components” on page 19
■
“Illustrated Parts Breakdown” on page 19
■
“Troubleshooting Using the Server Front and Rear Panel Status Indicators ” on page 30
Front Panel Components on Eight Drive Systems
Note - Systems with eight drives can contain an optional DVD drive. Systems with twelve
drives and twenty-four drives do not contain a DVD drive.
14Oracle Server X5-2L Service Manual • May 2015
Page 15
About Controls and Connectors
FIGURE 1
Figure Legend
1
Locator LED/Locator button: white
2
Service Required LED: amber
3
Power/OK LED: green
4
Power button
5
SP OK LED: green
6
Product Serial Number (PSN) and Radio Frequency Identification (RFID) label
7
Storage drive 0 (HDD/SSD)
8
Storage drive 1 (HDD/SSD)
9
Storage drive 2 (HDD/SSD/NVMe 0)
10
Storage drive 3 (HDD/SSD/NVMe 1)
11
Storage drive 4 (HDD/SSD/NVMe 2)
12
Storage drive 5 (HDD/SSD/NVMe 3)
13
Storage drive 6 (HDD/SSD)
14
Storage drive 7 (HDD/SSD)
15
Optional SATA DVD drive
16
Top Fan Fault LED: amber
17
Power Supply (PS) Fault LED: amber
18
System Over Temperature Warning LED: amber
19
USB 2.0 connectors (2)
Front Panel With Eight 2.5-Inch Drives and DVD Configuration
Related Information
■
“Front Panel Components on Twelve Drive Systems” on page 16
■
“Front Panel Components on Twenty-Four Drive Systems” on page 17
About the Oracle Server X5-2L15
Page 16
About Controls and Connectors
■
“Troubleshooting Using the Server Front and Rear Panel Status Indicators ” on page 30
Front Panel Components on Twelve Drive Systems
FIGURE 2
Figure Legend
1
Locator LED/Locator button: white
2
Service Required LED: amber
3
Power/OK LED: green
4
Power button
5
SP OK LED: green
6
Storage drive 0 (HDD/SSD)
7
Storage drive 1 (HDD/SSD)
8
Storage drive 2 (HDD/SSD)
9
Storage drive 3 (HDD/SSD)
10
Storage drive 4 (HDD/SSD)
11
Storage drive 5 (HDD/SSD)
12
Storage drive 6 (HDD/SSD)
13
Storage drive 7 (HDD/SSD)
14
Storage drive 8 (HDD/SSD)
15
Storage drive 9 (HDD/SSD)
16
Storage drive 10 (HDD/SSD)
17
Storage drive 11 (HDD/SSD)
18
Top Fan Fault LED: amber; Power Supply (PS) Fault LED: amber; System Over Temperature Warning LED:
amber
Front Panel With Twelve 3.5-Inch Drives Configuration
16Oracle Server X5-2L Service Manual • May 2015
Page 17
About Controls and Connectors
19
USB 2.0 connectors (2)
Related Information
■
“Front Panel Components on Eight Drive Systems” on page 14
■
“Front Panel Components on Twenty-Four Drive Systems” on page 17
■
“Troubleshooting Using the Server Front and Rear Panel Status Indicators ” on page 30
Front Panel Components on Twenty-Four Drive
Systems
FIGURE 3
Figure Legend
1
Locator LED/Locator button: white
2
Service Required LED: amber
3
Power/OK LED: green
4
Power button
5
SP OK LED: green
6
Storage drives 0 through 11 (HDD/SSD; Storage drives 3 and 4: HDD/SSD/NVMe)
7
Storage drives 12 through 23 (HDD/SSD; Storage drives 19 and 20: HDD/SSD/NVMe)
8
Top Fan Fault LED: amber; Power Supply (PS) Fault LED: amber; System Over Temperature Warning LED:
amber
Front Panel With Twenty-Four 2.5-Inch Drives Configuration
About the Oracle Server X5-2L17
Page 18
About Controls and Connectors
9
USB 2.0 connectors (2)
Related Information
■
“Front Panel Components on Eight Drive Systems” on page 14
■
“Front Panel Components on Twelve Drive Systems” on page 16
■
“Troubleshooting Using the Server Front and Rear Panel Status Indicators ” on page 30
Rear Panel Components and Cable Connections
Note - Systems that are configured with twelve and twenty-four front panel storage drives also
contain two rear-mounted storage drives. Systems that are configured with eight front panel
storage drives do not contain rear-mounted storage drives.
FIGURE 4
Figure Legend
1
Rear storage drive 0 (HDD/SSD)
2
Rear storage drive 1 (HDD/SSD)
3
Power supply unit 0 (PS0)
4
Power supply unit 1 (PS1)
5
PCIe slot 1 (Nonfunctional in single-processor systems.)
6
PCIe slot 2 (Nonfunctional in single-processor systems.)
7
PCIe slot 3 (Nonfunctional in single-processor systems. This slot is the default slot for the optional Oracle PCIe
Switch Controller Card.)
8
Network (NET) 100/1000/10000 ports (NET3–NET0) (NET2 and NET3 are nonfunctional in single-processor
systems.)
9
System status LEDs: Locator/Button: white; Service Required: amber; Power/OK: green
Rear Panel Components With Storage Drives Configuration
18Oracle Server X5-2L Service Manual • May 2015
Page 19
About System Components
10
Serial management (SER MGT)/RJ-45 serial port
11
Oracle Integrated Lights Out Manager (ILOM) service processor (SP) network management 10/100/1000BASET port (NETMGT)
12
USB 2.0 ports (2)
13
DB-15 video connector
14
PCIe slot 4
15
PCIe slot 5
16
PCIe slot 6 (This slot is the default slot for the Oracle Storage 12 Gb/s SAS PCIe RAID HBA.)
Note - For more information on rear panel components and connections, refer to “Rear Panel
Connector Locations” on page 42. See also “Rear Panel Pinhole Switches” on page 47.
Related Information
■
“Troubleshooting Using the Server Front and Rear Panel Status Indicators ” on page 30
■
“Disconnect Cables From the Server” on page 60
■
“Reconnect Power and Data Cables” on page 233
■
“Identifying the Server Ports”
About System Components
These sections describe the components of the server:
■
“Illustrated Parts Breakdown” on page 19
■
“Customer-Replaceable Units” on page 21
■
“Field-Replaceable Units” on page 22
Related Information
■
“Servicing CRUs That Do Not Require Server Power-Off”
■
“Servicing CRUs That Require Server Power-Off”
■
“Servicing FRUs”
Illustrated Parts Breakdown
The following figure identifies the major components of the server.
About the Oracle Server X5-2L19
Page 20
About System Components
FIGURE 5
System Components
Figure
Description
Legend
1aTwelve 3.5-inch storage drives
1bTwenty-four 2.5-inch storage drives
1cEight 2.5-inch storage drives
2aFront disk backplane for twelve 3.5-inch drives
2bFront disk backplane for twenty-four 2.5-inch drives
2cFront disk backplane for eight 2.5-inch drives
20Oracle Server X5-2L Service Manual • May 2015
Page 21
About System Components
Figure
Description
Legend
3Left LED indicator module
4System chassis
5Motherboard assembly
6System battery
7Fan modules
8Processors and heatsinks (Single-processor systems contain only a single processor in socket P0; socket P1
contains a cover to protect processor socket pins.)
9DIMMs (Only twelve DIMMs are supported in single-processor systems and the DIMMs must be installed
in P0 DIMM sockets.)
10Air baffle
11Top cover
12Oracle Storage 12 Gb/s SAS PCIe RAID HBA and super capacitor
13PCIe cards (PCIe slots 1, 2, and 3 are nonfunctional in single-processor systems.)
14I/O and internal USB card
15Internal USB drive
16Rear-mounted storage drives
17Rear storage drive backplane
18Power supplies
19Right LED indicator module
20DVD drive (optional)
Customer-Replaceable Units
The following table lists the customer-replaceable units (CRUs) in the server and directs you to
the replacement instructions.
CRUDescriptionReplacement Instructions
BatteryLithium coin-cell battery that powers the CMOS BIOS
and real-time clock.
DIMMsAdd or replace memory modules in the system.“Servicing the DIMMs (CRU)” on page 101
Storage drivesStorage drive configurations can comprise both hard
disk drives (HDDs) or solid state disk drives (SSDs).
Configurations include:
■ Up to twelve 3.5-inch hot-pluggable SAS HDDs
■ Up to twenty-four 2.5-inch hot-pluggable SAS
HDDs or SSDs, with optional support for up to four
PCIe-based NVMe SSDs
“Servicing the Battery (CRU)” on page 126
“Servicing Storage Drives and Rear Drives
(CRU)” on page 71
About the Oracle Server X5-2L21
Page 22
About System Components
CRUDescriptionReplacement Instructions
■ Up to eight 2.5-inch hot-pluggable SAS HDDs or
SSDs, with optional support for up to four PCIebased NVMe SSDs
Fan modulesFour fan modules for cooling the server components.“Servicing Fan Modules (CRU)” on page 89
PCIe cardsOptional add-on cards that can expand the functionality
Power supply unit
(PSU)
of the server.
Two fully redundant AC-powered power supplies.“Servicing Power Supplies (CRU)” on page 93
“Servicing PCIe Cards (CRU)” on page 115
Related Information
■
“Field-Replaceable Units” on page 22
■
“Illustrated Parts Breakdown” on page 19
■
“Servicing CRUs That Do Not Require Server Power-Off”
■
“Servicing CRUs That Require Server Power-Off”
Field-Replaceable Units
The following table lists the field-replaceable units (FRUs) in the server and directs you to the
replacement instructions.
FRUDescriptionReplacement Instructions
Processor and
heatsink
Disk backplanesProvide power and communications connectors for
SAS cablesProvide signals between the front disk backplane and
NVMe cablesProvide signals between the Oracle PCIe NVMe switch
Oracle Storage 12
Gb/s SAS PCIe
RAID HBA
DVD driveOptional DVD drive on configurations with eight 2.5-
Front LED/USB
indicator modules
Motherboard
assembly
The processor that carries out the instructions of the
system.
storage drives.
the Oracle Storage 12 Gb/s SAS PCIe RAID HBA,
and from the front disk backplane to the rear disk
backplane.
card and the disk backplane.
Located in PCIe slot 6, the Oracle Storage 12 Gb/s SAS
PCIe RAID HBA manages SAS storage drives.
inch drives.
Contain the push-button circuitry and LEDs that are
displayed on the bezel of the chassis.
Provides connectors for the DIMMs, processors, PCIe
risers, and other components.
“Servicing Processors (FRU)” on page 129
“Servicing the Front and Rear Storage Drive Backplanes
(FRU)” on page 146
“Servicing SAS Cables (FRU)” on page 167
“Servicing NVMe Cables (FRU)” on page 174
“Servicing the Oracle Storage 12 Gb/s SAS PCIe RAID
HBA Card (FRU)” on page 180
“Servicing the DVD Drive (FRU)” on page 187
“Servicing the Front LED/USB Indicator Modules
(FRU)” on page 193
“Servicing the Motherboard Assembly
(FRU)” on page 209
22Oracle Server X5-2L Service Manual • May 2015
Page 23
Related Information
■
“Customer-Replaceable Units” on page 21
■
“Illustrated Parts Breakdown” on page 19
■
“Servicing FRUs”
About System Components
About the Oracle Server X5-2L23
Page 24
24Oracle Server X5-2L Service Manual • May 2015
Page 25
Troubleshooting and Diagnostics
This section includes information about troubleshooting hardware component faults for the
Oracle Server X5-2L. It contains the following topics.
DescriptionLink
Maintenance-related information and procedures that you
can use to troubleshoot and repair server hardware issues.
Information about software and firmware diagnostic
tools that you can use to isolate problems, monitor the
server, and exercise the server subsystems.
Information about attaching devices to the server to
perform troubleshooting.
Information about contacting Oracle support.“Getting Help” on page 48
Troubleshooting Server Component Hardware Faults
“Troubleshooting Server Component Hardware
Faults” on page 25
“Troubleshooting With Diagnostic Tools” on page 39
“Attaching Devices to the Server” on page 41
This section contains maintenance-related information and procedures that you can use to
troubleshoot and repair server hardware issues. The following topics are covered.
DescriptionSection Links
Troubleshooting overview information and procedure.“Troubleshooting Server Hardware
Resources for troubleshooting and diagnostic
information.
Discerning the server state using the front and rear panel
indicators.
Causes, actions, and preventative measures for problems
related to the cooling subsystem.
Causes, actions, and preventative measures for problems
related to the power subsystem.
Faults” on page 26
“Troubleshooting and Diagnostic
Information” on page 29
“Troubleshooting Using the Server Front and Rear Panel
Status Indicators ” on page 30
“Troubleshooting System Cooling
Issues” on page 35
“Troubleshooting Power Issues” on page 37
Troubleshooting and Diagnostics25
Page 26
Troubleshooting Server Component Hardware Faults
Troubleshooting Server Hardware Faults
When a server hardware fault event occurs the system lights the Service Required LED and
captures the event in the system event log (SEL). If you have set up notifications through
Oracle ILOM, you also receive an alert through the notification method you chose. When you
become aware of a hardware fault, you should address it immediately.
To investigate a hardware fault, see the following:
■
“Basic Troubleshooting Process” on page 26
■
“Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface” on page 27
Basic Troubleshooting Process
Use the following process to address a hardware fault (for the step-by-step procedure, see
“Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface” on page 27).
1. Identify the server subsystem containing the fault.
You can use Oracle ILOM to identify the failed component.
2.
Review the Oracle Server X5-2L Product Notes.
The product notes contain up-to-date information about the server, including hardwarerelated issues.
3. Prepare the server for service using Oracle ILOM.
If you have determined that the hardware fault requires service (physical access to the
server), use Oracle ILOM to take the server offline, activate the Locate LED, and power off
the server.
4. Prepare the service work space.
Before servicing the server, prepare the work space, ensuring ESD protection for the server
and components.
See “Preparing for Service”.
5. Service components.
To service the components, see the removal, installation, and replacement procedures in this
document.
Note - A component designated as a field-replaceable unit (FRU) must be replaced by
Oracle Service personnel. Contact Oracle Service.
6. Clear the fault in Oracle ILOM.
Depending on the component, you might need to clear the fault in Oracle ILOM. Generally,
components that have a FRU ID clear the fault automatically.
26Oracle Server X5-2L Service Manual • May 2015
Page 27
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Related Information
■
See “Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface” on page 27
Troubleshoot Hardware Faults Using the Oracle ILOM Web
Interface
Note - The screens shown in this procedure might differ from those for your server.
This procedure uses the basic troubleshooting steps described in “Basic Troubleshooting
Process” on page 26.
Use this procedure to troubleshoot hardware faults with the Oracle ILOM web interface and, if
necessary, prepare the server for service.
Note - This procedure provides one basic approach to troubleshooting hardware faults. It
uses a combination of the Oracle ILOM web interface and command-line interface (CLI).
However, the procedure can be performed using only the Oracle ILOM CLI interface. For
more information about the Oracle ILOM web interface and CLI, refer to the Oracle ILOM
documentation.
Before beginning this procedure, obtain the latest version of the Oracle Server X5-2L ProductNotes.
1.
Log in to the server SP Oracle ILOM web interface.
Open a browser and direct it using the IP address of the server SP. Enter a user name (with
administrator privileges) and password at the log-in screen. The Summary Information screen
appears.
The Status section of the Summary Information screen provides information about the server
subsystems, including:
■
Processors
■
Memory
■
Power
■
Cooling
■
Storage
■
Networking
Troubleshooting and Diagnostics27
Page 28
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
2.
In the Status section of the Summary Information screen, identify the server
subsystem that requires service.
In the above example, the Status screen shows that the Processor subsystem requires service.
This indicates that a hardware component within the subsystem is in a fault state.
3.
To identify the component, click on Processors in the Status section.
The Processor screen appears.
The above example shows the Processors subsystem screen and indicates that CPU 0 has a
fault.
4.
To get more information, click one of the Open Problems links.
28Oracle Server X5-2L Service Manual • May 2015
Page 29
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
The Open Problems screen provides detailed information, such as the time the event occurred,
the component and subsystem name, and a description of the issue. It also includes a link to an
Oracle KnowledgeBase article.
Tip - The System Log provides a chronological list of all the system events and faults that
have occurred since the log was last reset and includes additional information, such as severity
levels and error counts. The System Log also includes information on device not reported in the
Subsystem Summary screen. To access the System Log, in the left panel, click System Log .
In this example, the hardware fault with CPU 0 requires local/physical access to the server.
5.
Before going to the server, review the Oracle Server X5-2L Product Notes for
information related to the issue or the component.
The product notes contain up-to-date information about the server, including hardware-related
issues.
6.
To prepare the server for service, see “Preparing for Service”.
After servicing the component, you might need to clear the fault in Oracle ILOM. For more
information, refer the service procedure for the component.
7.
Service the component.
Troubleshooting and Diagnostic Information
The following table lists diagnostic and troubleshooting-related procedures and references that
can assist you with resolving server issues.
DescriptionLink
Diagnostic information for the x86 servers, including
procedures for performing runtime and firmware-based
tests, using Oracle ILOM, and running U-Boot and UEFI
Diagnostics tests to exercise the system and isolate subtle
and intermittent hardware-related problems.
Administrative information for the Oracle Sun Server
X-5 series servers, including information about how to
use Oracle System Assistant and using the Oracle ILOM
system event log (SEL) to identify a problem's possible
source.
Information about troubleshooting front and rear panel
status indicators.
Information about how to manage server hardware faults
using the Oracle ILOM Fault Management Shell.
Oracle x86 Servers Diagnostics, Applications, and
Utilities Guide for Servers with Oracle ILOM 3.1 and
Oracle ILOM 3.2.x
Oracle X5 Series Servers Administration Guide
“Troubleshooting Using the Server Front and Rear Panel
Status Indicators ” on page 30
“Managing Server Hardware Faults Through the Oracle
ILOM Fault Management Shell” on page 39
Troubleshooting and Diagnostics29
Page 30
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Troubleshooting Using the Server Front and Rear
Panel Status Indicators
These sections describe the status indicators (LEDs) located on the front and rear of the server,
including those found on components and ports. This section includes the following topics:
■
“Server Boot Process and Normal Operating State Indicators” on page 30
■
“Server System-Level Status Indicators” on page 31
■
“Server Fan Status Indicators” on page 32
■
“Storage and Boot Drive Indicators” on page 32
■
“Power Supply Status Indicators” on page 33
■
“Network Management Port Status Indicators” on page 33
■
“Ethernet Ports Status Indicators” on page 34
■
“Motherboard Status Indicators” on page 34
Related Information
■
“Front Panel Components on Eight Drive Systems” on page 14
■
“Front Panel Components on Twelve Drive Systems” on page 16
■
“Front Panel Components on Twenty-Four Drive Systems” on page 17
■
“Rear Panel Components and Cable Connections” on page 18
Server Boot Process and Normal Operating State Indicators
A normal server boot process involves two indicators, the service processor SP OK LED
indicator and the Power/OK LED indicator.
When AC power is connected to the server, the server boots into standby power mode:
1. The SP OK LED blinks slowly (0.5 seconds on, 0.5 seconds off) while the SP is starting,
and the main Power/OK LED remains off until the SP is ready.
2. After a few minutes, the main Power/OK LED slowly flashes the standby blink pattern (0.1
seconds on, 2.9 seconds off), indicating that the SP (and Oracle ILOM) is ready for use. In
standby power mode, the server is not initialized or fully powered on at this point.
When powering on the server (either by the Power button or Oracle ILOM), the server boots to
full power mode:
1. The Power/OK LED blinks slowly (0.5 seconds on, 0.5 seconds off), and the SP OK LED
remains lit (no blinking).
2. When the server has successfully booted, the Power/OK LED remains lit. When the Power/
OK LED and the SP OK LED indicators remain lit, the server is in full power mode.
30Oracle Server X5-2L Service Manual • May 2015
Page 31
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Note - The green Power/OK LED indicator and the green SP OK indicator remain lit (no
blinking) when the server is in a normal operating state.
Server System-Level Status Indicators
There are seven system-level status indicators (LEDs), some of which are located on both the
server front panel and the server back panel. The following table describes these indicators.
TABLE 1
Status
Indicator
Name
Locator LED
and button
Service
Required
Power/OKGreenIndicates the operational state of the chassis.
SP OKGreenIndicates the state of the service processor.
Server System-Level Status Indicators
IconColorState and Meaning
WhiteIndicates the location of the server.
■ OFF – Server is operating normally.
■ FAST BLINK – Use Oracle ILOM to activate this LED indicator to enable you to locate a
particular system quickly and easily.
■ Pressing the Locate button will toggle the LED indicator fast blink on or off.
AmberIndicates the fault state of the server.
■ OFF – The server is operating normally.
■ STEADY ON – A fault is present on the server. This LED indicator lights whenever a fault
indicator lights for a replaceable component on the server.
Note - When this LED indicator is lit, a system console message might appear that includes a
recommended service action.
■ OFF – AC power is not present or the Oracle ILOM boot is not complete.
■ STANDBY BLINK – Standby power is on, but the chassis power is off and the Oracle
ILOM SP is running.
■ SLOW BLINK – Startup sequence has been initiated on the host. This pattern should begin
soon after you power on the server. This status indicates either: power-on self-test (POST)
code checkpoint tests are running on the server host system, or the host is transitioning from
the powered-on state to the standby state on shutdown.
■ STEADY ON – The server is powered on, and all host POST code checkpoint tests
are complete. The server is in one of the following states: the server host is booting the
operating system (OS), or the server host is running the OS.
Top FanTopAmberIndicates that one or more of the internal fan modules have failed.
■ OFF – Service processor (SP) is not running.
■ SLOW BLINK – SP is booting.
■ STEADY ON – SP is fully operational.
■ OFF – Indicates steady state; no service is required.
■ STEADY ON – Indicates service required.
Troubleshooting and Diagnostics31
Page 32
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Status
Indicator
Name
Rear Power
Supply Fault
System Over
Temperature
Warning
IconColorState and Meaning
RearAmberIndicates that one of the server power supplies has failed.
■ OFF – Indicates steady state; no service is required.
■ STEADY ON – Indicates service required; service the power supply.
AmberIndicates a warning for an overtemperature condition.
■ OFF – Normal operation; no service is required.
■ STEADY ON – The system is experiencing an overtemperature warning condition.
Note - This is a warning indication, not a fatal overtemperature. Failure to correct this might
result in the system overheating and shutting down unexpectedly.
Server Fan Status Indicators
Each fan module has one status indicator (LED). The LEDs are located on the chassis midwall adjacent to and aligned with the fan modules and are visible when the top cover fan door is
open.
TABLE 2
Status
Indicator
Name
Fan StatusNoneAmber■ Off – The fan module is correctly installed and operating within specification.
Server Fan Status Indicators
IconColorState and Meaning
■ Amber – The fan module is faulty. The front TOP FAN LED and the front and rear
panel Service Required LEDs are also lit if the system detects a fan module fault.
Storage and Boot Drive Indicators
There are three status indicators (LEDs) on each drive.
TABLE 3
Status
Indicator Name
OK/ActivityGreen■ OFF – Power is off or installed drive is not recognized by the system.
Service
Required
32Oracle Server X5-2L Service Manual • May 2015
Server Front Storage and Boot Disk Drive Indicators
IconColorState and Meaning
Amber■ OFF – The storage drive is operating normally.
†
■ STEADY ON – The drive is engaged and is receiving power.
■ RANDOM BLINKING – There is disk activity. Status indicator LED blinks on and off to
indicate activity.
■ STEADY ON – The system has detected a fault with the storage drive.
Page 33
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Status
Indicator Name
OK to RemoveBlue■ STEADY ON – The storage drive can be removed safely during a hot-plug operation.
†
If a rear backplane hard disk's OK/Activity LED shuts off, it may also indicate that a rear hard disk has spun down into a temporary "suspend" mode, in which
case the disk would again spin up when coming out of suspend mode.
IconColorState and Meaning
■ OFF – The storage drive has not been prepared for removal.
Power Supply Status Indicators
There are two status indicators (LEDs) on each power supply. These indicators are visible from
the rear of the server.
TABLE 4
Status
Indicator
Name
AC OK/ DC
OK
Service
Required
Server Power Supply Indicators
IconColorState and Meaning
Green■ OFF – No AC power is present.
■ SLOW BLINK – Normal operation. Input power is within
specification. DC output voltage is not enabled.
■ STEADY ON – Normal operation. Input AC power and DC
output voltage are within specification.
Amber■ OFF – Normal operation. No service action is required.
■ STEADY ON – The power supply (PS) has detected a PS fan
failure, PS overtemperature, PS over current, or PS over or under
voltage.
Network Management Port Status Indicators
The server has one 10/100/1000BASE-T Ethernet management domain interface, labeled NET
MGT. There are two status indicators (LEDs) on this port. These indicators are visible from the
rear of the server.
TABLE 5
Status
Indicator
Name
ActivityTop leftAmber■ ON – Link up.
Link speedTop rightGreen■ Green ON – 1000BASE-T link.
Network Management Port Status Indicators
LocationColorState and Meaning
■ OFF – No link or down link.
■ FLASHING – Packet activity.
■ OFF – 10/100BASE-T link.
Troubleshooting and Diagnostics33
Page 34
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Ethernet Ports Status Indicators
The server has four Gigabit Ethernet ports (NET 3, NET 2, NET 1, and NET 0). There are two
status indicators on each port. These indicators (LEDs) are visible from the rear of the server.
Note - Ethernet ports NET 2 and NET 3 are nonfunctional in single-processor systems.
TABLE 6
Status
Indicator Name
ActivityTop leftGreen■ ON – No function.
Link speedTop rightBicolored: Amber/
Gigabit Ethernet Port Status Indicators
LocationColorState and Meaning
■ OFF– No activity.
■ FLASHING – Packet activity.
Green
■ OFF – 100BASE-T link.
■ Amber ON – 1000BASE-T link.
■ Green ON – 10GBBASE-T link.
■ FLASHING – No function.
Motherboard Status Indicators
The motherboard contains the following status indicators (LEDs).
■
DIMM Fault Status Indicators: Each of the 24 DIMM sockets on the motherboard has
an amber fault status indicator (LED) associated with it. If Oracle ILOM determines that
a DIMM is faulty, pressing the Fault Remind button on the motherboard I/O card signals
the service processor to light the fault LED associated with the failed DIMM. For more
information on DIMM fault status indicators and the location of the Fault Remind Button,
see “Servicing the DIMMs (CRU)” on page 101.
■
Processor Fault Status Indicators: The motherboard includes a fault status indicator
(LED) adjacent to each of the two processor sockets. These LEDs indicate when a processor
fails. Pressing the Fault Remind button on the motherboard I/O card signals the service
processor to light the fault status indicators associated with the failed processors. For more
information on processor fault status indicators and the location of the Fault Remind Button,
see “Servicing Processors (FRU)” on page 129.
■
Fault Remind Status Indicator: This status indicator (LED) is located next to the Fault
Remind button and is powered from the super capacitor that powers the fault LEDs on the
motherboard. This LED lights to indicate that the fault remind circuitry is working properly
in cases where no components have failed and, as a result, none of the component fault
LEDs illuminate. For more information on the fault remind status indicator and the location
of the Fault Remind Button, see “Using the Server Fault Remind Button” on page 108.
■
STBY PWRGD Status Indicator: This green status indicator (LED) is labeled STBY
PWRGD and is located on the motherboard near the rear of the server. This LED lights
to inform a service technician that the motherboard is receiving standby power from at
34Oracle Server X5-2L Service Manual • May 2015
Page 35
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
least one of the power supplies. This LED is provided to help prevent service actions on
the server internal components while the AC power cords are installed and power is being
supplied to the server.
Note - The service instructions for all internal components require that all AC power be
removed from the power supplies prior to the server top cover being removed.
Troubleshooting System Cooling Issues
Maintaining the proper internal operating temperature of the server is crucial to the health of the
server. To prevent server shutdown and damage to components, address over temperature and
hardware-related issues as soon as they occur. If your server has a temperature-related fault, the
cause of the problem might be:
■
“External Ambient Temperature Too High” on page 35
■
“Airflow Blockage” on page 35
■
“Cooling Areas Compromised” on page 36
■
“Hardware Component Failure” on page 36
External Ambient Temperature Too High
Server component cooling relies on the movement of cool air through the server. The cool air is
pulled into the server from its external environment. If the ambient temperature of the server's
external environment is too high, cooling does not occur, and the internal temperature of the
server and its components increases. This can cause poor server performance or a failure of one
or more components.
Action: Check the ambient temperature of the server space against the environmental
specifications for the server. If the temperature is not within the required operating range,
remedy the situation immediately.
Prevention: Periodically check the ambient temperature of the server space to ensure that it
is within the required range, especially if you have made any changes to the server space (for
example, added additional servers). The temperature must be consistent and stable.
Airflow Blockage
The server cooling system uses fans to pull cool air in from the server front intake vents and
exhaust warm air out the server back panel vents. If the front or back vents are blocked, the
Troubleshooting and Diagnostics35
Page 36
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
airflow through the server is disrupted and the cooling system fails to function properly causing
the server internal temperature to rise.
Action: Inspect the server front and back panel vents for blockage from dust or debris.
Additionally, inspect the server interior for improperly installed components or cables that can
block the flow of air through the server.
Prevention: Periodically inspect and clean the server vents using an ESD certified vacuum
cleaner. Ensure that all components, such as cards, cable, fans, air baffles and dividers are
properly installed. Never operate the server without the top cover installed.
Cooling Areas Compromised
To function properly, the server has cooling areas that are maintained by an air baffle,
component filler panels, and the server top cover. These server components need to be in place
for the server to function as a sealed system. If internal cooling areas are compromised, the
server cooling system, which relies on the movement of cool air through the server, cannot
function properly, and the airflow inside the server becomes chaotic and non-directional.
Action: Inspect the server interior to ensure that the air baffle is properly installed. Ensure that
all external-facing slots (storage drive, DVD, PCIe) are occupied with either a component or a
component filler panel. Ensure that the server top cover is in place and sits flat and snug on top
of the server.
Prevention: When servicing the server, ensure that the air baffle is installed correctly and that
the server has no unoccupied external-facing slots. Never operate the server without the top
cover installed.
Hardware Component Failure
Components, such as power supplies and fan modules, are an integral part of the server cooling
system. When one of these components fails, the server internal temperature can rise. This rise
in temperature can cause other components to enter into an over-temperature state. Additionally,
some components, such as processors, might overheat when they are failing, which can also
generate an over-temperature event.
To reduce the risk related to component failure, power supplies and fan modules are installed
in pairs to provide redundancy. Redundancy ensures that if one component in the pair fails,
the other functioning component can continue to maintain the subsystem. For example, power
supplies serve a dual function; they provide both power and airflow. If one power supply fails,
the other functioning power supply can maintain both the power and the cooling subsystems.
Action: Investigate the cause of the over-temperature event, and replace failed components
immediately. For hardware troubleshooting information, see “Troubleshooting Server Hardware
Faults” on page 26.
36Oracle Server X5-2L Service Manual • May 2015
Page 37
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Prevention: Component redundancy is provided to allow for component failure in critical
subsystems, such as the cooling subsystem. However, once a component in a redundant
system fails, the redundancy no longer exists, and the risk for server shutdown and component
failures increases. Therefore, it is important to maintain redundant systems and replace failed
components immediately.
Troubleshooting Power Issues
If your server does not power on, the cause of the problem might be:
■
“AC Power Connection” on page 37
■
“Power Supplies (PSUs)” on page 37
■
“Top Cover” on page 38
AC Power Connection
The AC power cords are the direct connection between the server power supplies and the power
sources. The server power supplies need separate stable AC circuits. Insufficient voltage levels
or fluctuations in power can cause server power problems. The power supplies are designed to
operate at a particular voltage and within an acceptable range of voltage fluctuations (refer to
“Electrical Power Requirements” in “Oracle Server X5-2L Installation Guide ”).
Action: Check that both AC power cords are connected to the server. Check that the correct
power is present at the outlets and monitor the power to verify that it is within the acceptable
range. You can verify proper connection and operation by checking the power supply (PS)
indicator panels, which are located at the back of the server on the power supplies. Lit green AC
OK indicators show a properly functioning power supply. An amber AC OK indicator indicates
that the AC power to the power supply is insufficient.
Prevention: Use the AC power cord Velcro retaining clips and position the cords to minimize
the risk of accidental disconnection. Ensure that the AC circuits that supply power to the server
are stable and not overburdened.
Power Supplies (PSUs)
The server power supplies (PSUs) provide the necessary server voltages from the AC power
outlets. If the power supplies are inoperable, unplugged, or disengaged from the internal
connectors, the server cannot power on.
Note - The Velcro straps shown in the following figure should be used to secure the power cord
connectors to the rear of the power supplies. The Velcro retaining straps minimize the risk of
accidental disconnection.
Troubleshooting and Diagnostics37
Page 38
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Action: Check that the AC cables are connected to both power supplies. Check that the power
supplies are operational (the PSU indicator panel should have a lit green AC OK indicator).
Ensure that the power supply is properly installed. A power supply that is not fully engaged
with its internal connector does not have power applied and does not have a lit green AC OK
indicator.
Prevention: When a power supply fails, replace it immediately. To ensure redundancy, the
server has two power supplies. This redundant configuration prevents server downtime, or
an unexpected shutdown, due to a failed power supply. The redundancy allows the server to
continue to operate if one of the power supplies fails. However, when a server is being powered
by a single power supply, the redundancy no longer exists, and the risk for downtime or an
unexpected shutdown increases. When installing a power supply, ensure that it is fully seated
and engaged with its connector inside the drive bay. A properly installed power supply has a lit
green AC OK indicator.
Top Cover
The server top cover is an important component for a properly functioning server. The top cover
is vital to the server cooling subsystem because it helps to maintain the pressure areas within
the server, and the top cover also protects against accidental exposure to hazardous voltages
and damage to internal components. For these reasons, the server top cover is interlocked to
the server power. The interlock is created by a switch that is mounted inside the server on the
housing for the power supply. The switch engages when the cover is installed, allowing power
38Oracle Server X5-2L Service Manual • May 2015
Page 39
Troubleshooting With Diagnostic Tools
to be applied to the server. When the cover is not installed, the server cannot power on. If the
server is powered on when the cover is removed, power is immediately switched off.
Action: If the server does not power on, check that the switch is intact and properly
aligned. Ensure that the server top cover is in place and sits flat and snug on top of the
server. Ensure that the interlock switch components have not been damaged, removed, or
misaligned. For instructions for properly installing the top cover, see “Install the Server Top
Cover” on page 226.
Prevention: After removing the top cover, take care that it does not get bent. When servicing
the server, take care that the internally mounted interlock switch component does not get
damaged or misaligned. Never operate the server without the top cover installed.
Managing Server Hardware Faults Through the
Oracle ILOM Fault Management Shell
The Oracle ILOM Fault Management Shell enables Oracle Services personnel to view and
manage fault activity on a managed servers and other types of devices.
For more information about how to use the Oracle ILOM Fault Management Shell, see the
Oracle ILOM User's Guide for System Monitoring and Diagnostics Firmware Release 3.2.x in
the Oracle Integrated Lights Out Manager (ILOM) 3.2 Documentation Library at http://www.
oracle.com/goto/ILOM/docs.
Caution - The purpose of the Oracle ILOM Fault Management Shell is to help Oracle Services
personnel diagnose system problems. Customers should not launch this shell or run fault
management commands in the shell unless requested to do so by Oracle Services.
Troubleshooting With Diagnostic Tools
The server and its accompanying software and firmware contain diagnostic tools and features
that can help you isolate component problems, monitor the status of a functioning system,
and exercise one or more subsystem to disclose more subtle or intermittent hardware-related
problems.
Each diagnostic tool has its own specific strength and application. Review the tools listed in
this section and determine which tool might be best to use for your situation. Once you have
determined the tool to use, you can access it locally, while at the server, or remotely.
■
“Diagnostic Tools” on page 40
Troubleshooting and Diagnostics39
Page 40
Troubleshooting With Diagnostic Tools
■
“Diagnostic Tool Documentation” on page 41
Diagnostic Tools
The selection of diagnostic tools available for your server range in complexity from a
comprehensive validation test suite (Oracle VTS) to a chronological event log (Oracle ILOM
System Log). The selection of diagnostic tools also includes standalone software packages,
firmware-based tests, and hardware-based LED indicators.
The following table summarizes the diagnostic tools that you can use when troubleshooting or
monitoring your server.
Diagnostic Tool TypeWhat It DoesAccessibilityRemote Capability
Oracle ILOMSP firmwareMonitors environmental
Preboot MenuSP firmware Enables you to restore some
Hardware-based
LED indicators
Power-On SelfTest (POST)
U-BootSP firmware Initializes and test aspects of
UEFI
Diagnostics
condition and component
functionality sensors, generates
alerts, performs fault isolation,
and provides remote access.
Oracle ILOM default settings
when Oracle ILOM is not
accessible.
Hardware
and SP
firmware
Host
firmware
SP firmware The UEFI diagnostics can
Indicates status of overall
system and particular
components.
Tests core components of
system: CPUs, memory,
and motherboard I/O bridge
integrated circuits.
the service processor (SP) prior
to booting the Oracle ILOM SP
and operating system. Tests SP
memory, SP, network devices
and I/O devices.
test and detect problems on
all processors, memory, disk
drives, and network ports. It is
used on newer Oracle systems,
Can function on either
standby power mode or
full power mode and is
not OS dependent.
Can function on standby
power and when
operating system is not
running.
Available when system
power is available.
Runs on startup.
Available when the
operating system is not
running.
Can function on standby
power and when
operating system is not
running.
You can use either
the Oracle ILOM
web interface or the
command-line interface
Designed for
remote and local
access.
Local, but remote
serial access is
possible if the
SP serial port is
connected to a
network-accessible
terminal server.
Local, but sensor
and indicators are
accessible from
Oracle ILOM
web interface or
command-line
interface (CLI).
Local, but can be
accessed through
Oracle ILOM
Remote System
Console Plus.
Local, but remote
serial access is
possible if the
SP serial port is
connected to a
network-accessible
terminal server.
Remote access
through Oracle
ILOM Remote
System Console
Plus.
40Oracle Server X5-2L Service Manual • May 2015
Page 41
Attaching Devices to the Server
Diagnostic Tool TypeWhat It DoesAccessibilityRemote Capability
Oracle Solaris
commands
Oracle Linux
commands
Oracle VTSDiagnostic
Operating
system
software
Operating
system
software
tool
standalone
software
such as the Oracle Server X52L.
Displays various kinds of
system information.
Displays various kinds of
system information.
Exercises and stresses the
system, running tests in
parallel.
(CLI) to run UEFI
diagnostics.
Requires operating
system.
Requires operating
system.
Requires the Solaris
operating system. Install
Oracle VTS software
separately.
Local, and over
network.
Local, and over
network.
View and control
over network.
Diagnostic Tool Documentation
The following table identifies where you can find more information about diagnostic tools.
Diagnostic ToolDocumentationLocation
Oracle ILOMOracle Integrated Lights Out Manager 3.2
Preboot Menu
System indicators and
sensors
U-Boot or UEFI Diags
Oracle VTSOracle VTS software and documentation
Documentation Library
Oracle x86 Servers Diagnostics Guide
This document“Troubleshooting Using the Server
Oracle x86 Servers Diagnostics Guide
http://www.oracle.com/goto/ILOM/docs
http://www.oracle.com/goto/
x86AdminDiag/docs
Front and Rear Panel Status Indicators
” on page 30
http://www.oracle.com/goto/
x86AdminDiag/docs
http://docs.oracle.com/cd/E19719-01/
index.html
Attaching Devices to the Server
The following sections contain procedural information for attaching devices to the server so you
can access diagnostic tools when troubleshooting and servicing the server:
■
“Attach Devices to the Server” on page 42
■
“Rear Panel Connector Locations” on page 42
■
“Configuring Serial Port Sharing” on page 44
■
“Server Operating System Names for the NVMe Storage Drives” on page 45
Troubleshooting and Diagnostics41
Page 42
Attach Devices to the Server
■
“Ethernet Port Boot Order and Device Naming” on page 46
■
“Rear Panel Pinhole Switches” on page 47
Attach Devices to the Server
This procedure explains how to connect devices to the server (remotely and locally), so that you
can interact with the service processor (SP) and the server console.
1.
Connect an Ethernet cable to the Gigabit Ethernet (NET) connectors as needed
for OS support.
See “Rear Panel Connector Locations” on page 42.
2.
To connect to the service processor's Oracle ILOM over the network, connect an
Ethernet cable to the Ethernet port labeled NET MGT.
See “Rear Panel Connector Locations” on page 42.
3.
To access the Oracle ILOM command-line interface (CLI) locally using the
management port, connect a serial null modem cable to the RJ-45 serial port
labeled SER MGT.
See “Rear Panel Connector Locations” on page 42.
4.
To interact with the system console locally, connect a mouse and keyboard to
the USB connectors and a monitor to the DB-15 video connector.
See “Rear Panel Connector Locations” on page 42.
Rear Panel Connector Locations
The following illustration shows and describes the locations of the back panel connectors. Use
this information to set up the server, so that you can access diagnostic tools and manage the
server during service.
42Oracle Server X5-2L Service Manual • May 2015
Page 43
No. Cable Port or Expansion SlotDescription
1Power supply 0 input power
The server has two power supply connectors, one for each power supply.
Attach Devices to the Server
Power supply 1 input power
Do not attach power cables to the power supplies until you have finished
connecting the data cables to the server. The server goes into standby power
mode, and the Oracle ILOM service processor initializes when the AC
power cables are connected to the power source. System messages might
be lost after 60 seconds if the server is not connected to a terminal, PC, or
workstation.
Note - Oracle ILOM will signal a fault on any installed power supply that
is not connected to an AC power source, since it might indicate a loss of
redundancy.
2Network management port (NET MGT)The service processor NET MGT port is the optional connection to the
Oracle ILOM service processor. The NET MGT port is configured by default
to use Dynamic Host Configuration Protocol (DHCP). The service processor
NET MGT port uses an RJ-45 cable for a 10/100/1000BASE-T connection.
3Serial management port (SER MGT)The service processor SER MGT port uses an RJ-45 cable and is the default
connection to the Oracle ILOM service processor. This port supports local
connections to the server and only recognizes Oracle ILOM command-line
interface (CLI) commands. Typically you connect a terminal or terminal
emulator to this port.
Note - This port does not support network connections.
4Ethernet ports (NET 3, NET 2, NET 1, NET0)The four 10-Gigabit Ethernet ports enable you to connect the system to the
network.
Note - Ethernet ports NET2 and NET3 are nonfunctional in single-processor
systems.
5, 6 USB ports (USB 0, USB 1)The two USB ports support hot-plugging. You can connect and disconnect
USB cables and peripheral devices while the server is running without
affecting system operations.
Troubleshooting and Diagnostics43
Page 44
Assign Serial Port Output Using the CLI
No. Cable Port or Expansion SlotDescription
7Video port (VGA)Use a 15-pin video cable to connect a VGA video device to the server.
Optionally, you can connect to the VGA port when installing the operating
system.
Configuring Serial Port Sharing
By default, the SP console (SER MGT) port sends serial port output from the server. Using
Oracle ILOM, you can specify that the host console (COM1) be assigned as owner of the server
serial port output. This feature is useful for Windows kernel debugging, as it enables you to
view non-ASCII character traffic from the host console.
You should set up the network on the SP before attempting to change the serial port owner to
the host server. If the network is not set up, and you switch the serial port owner to the host
server, you will be unable to connect using the CLI or web interface to change the serial port
owner back to the SP. To return the serial port owner setting to the SP, you need to restore
access to the serial port on the server. For more details about restoring access to the server
port on your server, see the Oracle Integrated Lights Out Manager (ILOM) 3.2 Documentation
Library at http://www.oracle.com/goto/ILOM/docs.
You can assign serial port output using either the Oracle ILOM web interface or the commandline interface (CLI). For instructions, see the following sections:
■
“Assign Serial Port Output Using the CLI” on page 44
■
“Assign Serial Port Output Using the Web Interface” on page 45
Assign Serial Port Output Using the CLI
1.
Open an SSH session and at the command line log in to the SP Oracle ILOM CLI.
Log in as a user with root or administrator privileges. For example:
ssh root@ipaddress
Where ipaddress is the IP address of the server SP.
For more information, see “Accessing Oracle ILOM” in the Oracle X5 Series ServersAdministration Guide.
The Oracle ILOM CLI prompt (->) appears.
2.
To set the serial port owner, type:
-> set /SP/serial/portsharing owner=host
Note - The serial port sharing value by default is owner=SP.
44Oracle Server X5-2L Service Manual • May 2015
Page 45
Assign Serial Port Output Using the Web Interface
3.
Connect a serial host to the server.
Assign Serial Port Output Using the Web Interface
1.
Log in to the SP Oracle ILOM web interface.
To log in, open a web browser and direct it using the IP address of the server SP. Log in as root
or a user with administrator privileges. See “Accessing Oracle ILOM” in the Oracle X5 SeriesServers Administration Guide.
The Summary Information screen appears.
2.
In the Oracle ILOM web interface, select ILOM Administration → Connectivity
from the navigation menu on the left side of the screen.
3.
Select the Serial Port tab.
The Serial Port Settings page appears.
Note - The serial port sharing setting by default is Service Processor.
4.
In the Serial Port Settings page, select HostServer as the serial port owner.
5.
Click Save for the changes to take effect.
6.
Connect a serial host to the server.
Server Operating System Names for the NVMe
Storage Drives
If the optional NVMe storage drives are installed in the server's front panel, they are labeled
NVMe0, NVMe1, NVMe2, and NVMe3. However, the server operating systems assign these
storage drives different names. For the corresponding names assigned by the operating systems,
see the following table.
TABLE 7
Storage Drive LabelsNames Assigned by the Server Operating Systems
NVMe0PCI-Express Slot 10
NVMe1PCI-Express Slot 11
NVMe2PCI-Express Slot 12
NVMe3PCI-Express Slot 13
Operating System NVMe Storage Drive Nomenclature
Troubleshooting and Diagnostics45
Page 46
Assign Serial Port Output Using the Web Interface
Ethernet Port Boot Order and Device Naming
This section contains information about the boot order and device naming for the four 10Gigabit Ethernet ports on the rear panel of the server. For location information, see “Rear Panel
Connector Locations” on page 42. From right to left, the ports are numbered NET 0 to NET
3.
Ethernet Port Boot Order
The order in which the BIOS detects the Ethernet ports during server boot is listed below.
Note - You can change the boot priority using the Boot Device Priority screen available in the
Boot menu of the BIOS Setup Utility.
1. NET 0
2. NET 1
3. NET 2
4. NET 3
Ethernet Port Device Naming
Note - Naming used by the interfaces might vary from that listed below depending on which
devices are installed in the system.
The device naming for the Ethernet interfaces is reported differently by different interfaces and
operating systems. The following table shows BIOS (physical) and operating system (logical)
naming conventions used for each interface. These naming conventions might vary depending
on conventions of your operating system and which devices are installed in the server.
PortBIOSSolarisLinuxWindows (example default
NET 03A00ixgbe 0eth 0Ethernet
NET 13A01ixgbe 1eth 1Ethernet 2
NET 28200ixgbe 2eth 2Ethernet 3
NET 38201ixgbe 3eth 3Ethernet 4
46Oracle Server X5-2L Service Manual • May 2015
names, see note below)
Page 47
Assign Serial Port Output Using the Web Interface
Note - For Windows, port names like Ethernet, Ethernet 2, Ethernet 3, and Ethernet 4 are used
by default. However, actual port naming is based on the order of enumeration, typically during
operating system installation. Additionally, Windows allows you to rename the ports to meet
application-specific needs.
Rear Panel Pinhole Switches
This section shows the location of the rear panel pinhole switches.
FIGURE 6
Rear Panel Pinhole Switches
Figure LegendDescription
1SP Reset
2Host Warm Reset
Troubleshooting and Diagnostics47
Page 48
Getting Help
Figure LegendDescription
3Non-maskable interrupt
†
Oracle Service use only.
Getting Help
The following sections describe how to get additional help to resolve server-related problems.
■
“Contacting Support” on page 48
■
“Locating the Chassis Serial Number” on page 49
Contacting Support
If the troubleshooting procedures in this chapter fail to solve your problem, use the following
table to collect information that you might need to communicate to support personnel.
(NMI)
†
System Configuration
Information Needed
Service contract number
System model
Operating environment
System serial number
Peripherals attached to the system
Email address and phone number
for you and a secondary contact
Street address where the system is
located
Superuser password
Summary of the problem and
the work being done when the
problem occurred
Other Useful Information
IP address
Server name (system host name)
Network or internet domain name
Proxy server configuration
Your Information
48Oracle Server X5-2L Service Manual • May 2015
Page 49
Getting Help
Related Information
■
“Locating the Chassis Serial Number” on page 49
Locating the Chassis Serial Number
You might need to have your server serial number when you ask for service on your system.
Record this number for future use. Use one of the following resources or methods to locate your
server serial number.
■
The serial number is located on the Radio-frequency Identification (RFID) label on the
bottom left side of the front panel bezel, below the general status LEDs.
For illustrations of the storage cell front panel, see “About Controls and
Connectors” on page 14.
■
The serial number is recorded on the service label that is attached to the top cover of the
system.
■
The serial number is recorded on the yellow Customer Information Sheet (CIS) that is
attached to your storage cell packaging.
■
Locate the yellow Customer Information Sheet (CIS) attached to your server packaging.
This sheet includes the serial number.
■
Using Oracle ILOM:
■
From the command-line interface (CLI), type the command: show/SYS.
■
From the web interface, view the serial number on the System Information screen.
■
Using Oracle System Assistant, view the serial number on the System Overview screen.
Troubleshooting and Diagnostics49
Page 50
50Oracle Server X5-2L Service Manual • May 2015
Page 51
Preparing for Service
These sections describe safety considerations and provide prerequisite procedures and
information about replacing components within the server.
DescriptionLinks
Understand the safety precautions, understand the
safety symbols, and take ESD precautions prior to
removing or installing parts in the server.
Understand the key identity properties auto-update
feature.
Assemble the required tools.“Required Tools” on page 54
Before working with components within the server,
power down the server and prepare for servicing.
Related Information
■
“Returning the Server to Operation”
“Safety Precautions” on page 51
“Safety Symbols” on page 52
“Electrostatic Discharge Safety” on page 52
“FRU Key Identity Properties (KIP) Automated
Update” on page 53
“Preparing the Server for Component
Replacement” on page 54
Safety Precautions
For your protection, observe the following safety precautions when setting up your equipment:
■
Follow all standard cautions, warnings, and instructions marked on the equipment and
described in the Oracle Server X5-2L Safety and Compliance Guide and Important SafetyInformation for Oracle's Hardware Systems.
■
Ensure that the voltage and frequency of your power source match the voltage and
frequency inscribed on the equipment electrical rating label.
■
Follow the electrostatic discharge safety practices as described in “Electrostatic Discharge
Safety” on page 52.
■
Disconnect both power supply cords before servicing components.
Preparing for Service51
Page 52
Safety Symbols
Related Information
■
“Safety Symbols” on page 52
■
“Electrostatic Discharge Safety” on page 52
Safety Symbols
The following symbols might appear in this document. Note their meanings.
Caution - Risk of personal injury or equipment damage. To avoid personal injury or
equipment damage, follow the instructions.
Caution - Hot surface. Avoid contact. Surfaces are hot and might cause personal injury if
touched.
Caution - Hazardous voltages are present. To reduce the risk of electric shock and danger to
personal health, follow the instructions.
Related Information
■
“Safety Precautions” on page 51
■
“Electrostatic Discharge Safety” on page 52
Electrostatic Discharge Safety
Devices that are sensitive to electrostatic discharge (ESD), such as the motherboard, PCIe cards,
drives, processors, and memory DIMMs require special handling.
Caution - The boards and drives contain electronic components that are extremely sensitive to
static electricity. Ordinary amounts of static electricity from clothing or the work environment
can destroy components. Do not touch the components along their connector edges.
Do the following when handling ESD-sensitive components:
■
Use an antistatic wrist strap.
Wear an antistatic wrist strap and use an antistatic mat when handling components such as
drive assemblies, boards, or cards. When servicing or removing server components, attach
an antistatic strap to your wrist and then to a metal area on the chassis. Then disconnect
the power cords from the server. Following this practice equalizes the electrical potentials
between you and the server.
Note - An antistatic wrist strap is not included in the Accessory Kit for the server. However,
antistatic wrist straps are included with options and components.
■
Use an antistatic mat.
Place ESD-sensitive components such as the motherboard, memory DIMMS, and other
printed circuit board (PCB) cards on an antistatic mat. The following items can be used as
an antistatic mat:
■
An antistatic bag used to wrap an Oracle replacement part
■
An Oracle ESD mat (orderable item)
■
A disposable ESD mat (shipped with some replacement parts or optional system
components)
Related Information
■
“Safety Precautions” on page 51
■
“Safety Symbols” on page 52
■
“Preparing the Server for Component Replacement” on page 54
Oracle ILOM includes a key identity properties (KIP) auto-update feature that ensures product
information that is used for service entitlement and warranty coverage is accurately maintained
by the server at all times, including during hardware replacement activities.
The KIP includes the server product name, product part number (PPN), and product serial
number (PSN). The KIP are stored in the FRUID (field-replaceable unit identifiers) container of
the three server FRUs that are designated quorum members.
The quorum members include:
■
Disk backplane (DBP), designated as a primary quorum member.
■
Motherboard (MB), designated as a backup quorum member.
■
Power supply (PS), designated as a backup quorum member.
When a server FRU that contains the KIP is removed and a replacement component is installed,
the KIP of the replacement component is programmed by Oracle ILOM to contain the same
KIP as the other two components.
Preparing for Service53
Page 54
Required Tools
Note - Only one of the quorum members can be replaced at a time. Automated updates can only
be completed when two of the three quorum members contain matching key identity properties.
Related Information
■
“Servicing Power Supplies (CRU)” on page 93
■
“Servicing the Front and Rear Storage Drive Backplanes (FRU)” on page 146
■
“Servicing the Motherboard Assembly (FRU)” on page 209
Required Tools
The server can be serviced with the following tools:
■
Antistatic wrist strap
■
Antistatic mat
■
Hex/flat head screwdriver
■
No. 2 Phillips screwdriver
Related Information
■
“Preparing for Service”
■
“Servicing CRUs That Do Not Require Server Power-Off”
■
“Servicing CRUs That Require Server Power-Off”
■
“Servicing FRUs”
Preparing the Server for Component Replacement
Note - When you are replacing the storage drives or power supplies, not all of these procedures
are necessary. See the replacement procedures for those components for more information.
Before you can remove and install components that are inside the server, you must perform the
procedures in the following sections:
■
“Powering Down the Server” on page 55
■
“Disconnect Cables From the Server” on page 60
■
“Extend the Server to the Maintenance Position” on page 61
■
“Remove the Server From the Rack” on page 63
54Oracle Server X5-2L Service Manual • May 2015
Page 55
Power Down the Server Gracefully Using the Oracle ILOM CLI
■
“Take Antistatic Measures” on page 64
■
“Remove the Server Top Cover” on page 64
■
“Remove the Fan Assembly Door From the Server” on page 65
■
“Remove the Disk Cage Cover From the Server” on page 67
Related Information
■
“Returning the Server to Operation”
Powering Down the Server
Determine how you want to power down the server by reviewing the options in the following
table.
DescriptionLink
Power down the server gracefully to prevent data from being
corrupted. Performing a graceful shutdown ensures that the
system is ready for restart.
If the server is not responding, or you must shut down the
server quickly, perform an immediate shutdown.
■ “Power Down the Server Gracefully Using the
Oracle ILOM CLI” on page 55
■ “Power Down the Server Gracefully Using the
Oracle ILOM Web Interface” on page 56
■ “Power Down the Server Gracefully Using the
Power Button” on page 57
■ “Power Down the Server for
Immediate Shutdown Using the Power
Button” on page 58
■ “Power Down the Server for Immediate
Shutdown Using the Oracle CLI” on page 59
■ “Power Down the Server for Immediate
Shutdown Using the Oracle ILOM Web
Interface” on page 59
Related Information
■
“Power On the Server” on page 233
Power Down the Server Gracefully Using the Oracle ILOM CLI
Performing a graceful shutdown ensures that all of your data is saved and the system is ready
for restart.
1.
Log in to the server as superuser or equivalent.
Depending on the nature of the problem, you might want to view the system status or the log
files or run diagnostics before you shut down the system. For log file information, refer to the
Preparing for Service55
Page 56
Power Down the Server Gracefully Using the Oracle ILOM Web Interface
Oracle Integrated Lights Out Manager (ILOM) 3.2 Documentation Collection at http://www.
oracle.com/goto/ILOM/docs.
2.
Notify affected users that the server will be powered down.
3.
Save any open files, and quit all running applications.
Refer to your application documentation for specific information about these processes.
4.
Log in to the Oracle ILOM command-line interface (CLI) using an Administrator
account.
For instructions, refer to the Oracle Integrated Lights Out Manager (ILOM) 3.2 Documentation
Collection.
5.
At the Oracle ILOM prompt, shut down the operating system:
-> stop /System
If the system is running the Oracle Solaris OS, refer to the Oracle Solaris system administration
documentation for additional information.
6.
Disconnect the power and cables from the server.
See “Disconnect Cables From the Server” on page 60.
Caution - When you power down the server using Oracle ILOM, the server enters standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, you must disconnect the power cords
from the power supplies.
Related Information
■
“Power Down the Server Gracefully Using the Oracle ILOM Web Interface” on page 56
■
“Power Down the Server Gracefully Using the Power Button” on page 57
■
“Power Down the Server for Immediate Shutdown Using the Power Button” on page 58
■
“Power On the Server” on page 233
Power Down the Server Gracefully Using the Oracle ILOM
Web Interface
Performing a graceful shutdown ensures that all of your data is saved and the system is ready
for restart.
1.
Log in to the server as superuser or equivalent.
56Oracle Server X5-2L Service Manual • May 2015
Page 57
Power Down the Server Gracefully Using the Power Button
Depending on the nature of the problem, you might want to view the system status or the log
files or run diagnostics before you shut down the system. For log file information, refer to the
Oracle Integrated Lights Out Manager (ILOM) 3.2 Documentation Collection at http://www.
oracle.com/goto/ILOM/docs.
2.
Notify affected users that the server will be powered down.
3.
Save any open files, and quit all running applications.
Refer to your application documentation for specific information about these processes.
4.
Log in to the Oracle ILOM web interface using an Administrator account.
The Oracle ILOM web interface System Information → Summary Information page appears.
5.
In the left pane, click Host Management → Power Control, and click Graceful
Shutdown and Power Off from the Select Action list.
6.
Click Save, and then click OK.
The host server performs an orderly power-down.
7.
Disconnect the power cords and data cables from the server.
See “Disconnect Cables From the Server” on page 60.
Caution - When you power down the server using Oracle ILOM, the server enters standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, you must disconnect the power cords
from the power supplies.
Related Information
■
“Power Down the Server Gracefully Using the Oracle ILOM CLI” on page 55
■
“Power Down the Server Gracefully Using the Power Button” on page 57
■
“Power Down the Server for Immediate Shutdown Using the Power Button” on page 58
■
“Power On the Server” on page 233
Power Down the Server Gracefully Using the Power Button
1.
Press and quickly release the Power button on the front panel.
This action causes ACPI-enabled operating systems to perform an orderly shutdown of the
operating system. Servers not running ACPI-enabled operating systems shut down to standby
power mode immediately.
When main power is off, the Power/OK LED on the front panel begins flashing, indicating that
the server is in standby power mode. See “Server System-Level Status Indicators” on page 31.
Preparing for Service57
Page 58
Power Down the Server for Immediate Shutdown Using the Power Button
2.
Disconnect the power cords and data cables from the server.
See “Disconnect Cables From the Server” on page 60.
Caution - When you power down the server using the Power button, the server enters standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, you must disconnect the power cords
from the power supplies.
Related Information
■
“Server System-Level Status Indicators” on page 31
■
“Power Down the Server Gracefully Using the Oracle ILOM CLI” on page 55
■
“Power Down the Server Gracefully Using the Oracle ILOM Web Interface” on page 56
■
“Power On the Server” on page 233
Power Down the Server for Immediate Shutdown Using the
Power Button
Caution - This procedure quickly forces the server main power off. You might corrupt your
system data during an immediate power-down, so use this procedure to power down the server
only after attempting the graceful power-down procedure.
1.
Press and hold the Power button for four seconds to force the main power off
and to enter standby power mode.
When main power is off, the Power/OK LED on the front panel begins flashing, indicating that
the server is in standby power mode. See “Server System-Level Status Indicators” on page 31.
2.
Disconnect the power cords and data cables from the server.
See “Disconnect Cables From the Server” on page 60.
Caution - When you power down the server using the Power button, the server enters standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, you must disconnect the power cords
from the power supplies.
Related Information
■
“Server System-Level Status Indicators” on page 31
■
“Power Down the Server for Immediate Shutdown Using the Oracle CLI” on page 59
■
“Power Down the Server for Immediate Shutdown Using the Oracle ILOM Web
Interface” on page 59
58Oracle Server X5-2L Service Manual • May 2015
Page 59
Power Down the Server for Immediate Shutdown Using the Oracle CLI
■
“Power On the Server” on page 233
Power Down the Server for Immediate Shutdown Using the
Oracle CLI
Caution - This procedure quickly forces the server main power off. You might corrupt your
system data during an immediate power-down, so use this procedure to power down the server
only after attempting the graceful power-down procedure.
1.
Log in to the Oracle ILOM command-line interface (CLI) using an Administrator
account.
Oracle ILOM displays the default command prompt (->), indicating that you have successfully
logged in to Oracle ILOM.
2.
From the CLI prompt, type the following command:
-> stop -f /System
The server powers down immediately.
3.
Disconnect the power and data cables from the server.
See “Disconnect Cables From the Server” on page 60.
Caution - When you power down the server using Oracle ILOM, the server enters standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, you must disconnect the power cords
from the power supplies.
Related Information
■
“Power Down the Server for Immediate Shutdown Using the Power Button” on page 58
■
“Power Down the Server for Immediate Shutdown Using the Oracle ILOM Web
Interface” on page 59
■
“Power On the Server” on page 233
Power Down the Server for Immediate Shutdown Using the
Oracle ILOM Web Interface
Caution - This procedure quickly forces the server main power off. You might corrupt your
system data during an immediate power-down, so use this procedure to power down the server
only after attempting the graceful power-down procedure.
Preparing for Service59
Page 60
Disconnect Cables From the Server
1.
Log in to the Oracle ILOM web interface using an Administrator account.
The Oracle ILOM web interface System Information → Summary Information page appears.
2.
In the left pane, click Host Management → Power Control, and click Immediate
Power Off in the Select Action list.
3.
Click Save, and then click OK.
The server powers down immediately.
4.
Disconnect the power and data cables from the server.
See “Disconnect Cables From the Server” on page 60.
Caution - When you power down the server using Oracle ILOM, the server enters standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, you must disconnect the power cords
from the power supplies.
Related Information
■
“Power Down the Server for Immediate Shutdown Using the Power Button” on page 58
■
“Power Down the Server for Immediate Shutdown Using the Oracle CLI” on page 59
■
“Power On the Server” on page 233
Disconnect Cables From the Server
Caution - The system supplies standby power to the circuit boards even when the system is
powered off.
1.
Label all cables connected to the server.
2.
Power down the server.
See “Powering Down the Server” on page 55.
3.
Disconnect the power cords from the rear of the server.
4.
Disconnect all data cables from the rear of the server.
5.
If your rackmount kit includes a cable management device, remove the cables
from it.
60Oracle Server X5-2L Service Manual • May 2015
Page 61
Extend the Server to the Maintenance Position
6.
Depending on the components that you are servicing, either extend the server to
the maintenance position, or remove the server from the rack.
Related Information
■
“Rear Panel Components and Cable Connections” on page 18
■
“Powering Down the Server” on page 55
■
“Extend the Server to the Maintenance Position” on page 61
■
“Remove the Server From the Rack” on page 63
■
“Reconnect Power and Data Cables” on page 233
Extend the Server to the Maintenance Position
The following components can be serviced with the server in the maintenance position:
■
Storage drives
■
Fan modules
■
Power supplies
■
DVD drive
■
DDR4 DIMMs
■
PCIe cards and cables
■
SAS cables
■
NVMe cables
■
Internal USB drives
■
System battery
■
Processors
■
Disk backplanes
■
Front indicator modules
■
Motherboard
If the server is installed in a rack with extendable slide-rails, use this procedure to extend the
server to the maintenance position.
1.
To prevent the rack from tipping forward when the server is extended, extend all
rack anti-tilt devices.
For instructions for stabilizing the rack, refer to “Stabilize the Rack for Installation” in “Oracle
Server X5-2L Installation Guide ”.
2.
Verify that no cables will be damaged or will interfere when the server is
extended.
Preparing for Service61
Page 62
Extend the Server to the Maintenance Position
Although the cable management arm (CMA) that is supplied with the server is hinged to
accommodate extending the server, you should ensure that all cables and cords are capable of
extending.
3.
From the front of the server, open and hold the left and right release latch covers
in the open position.
When in an open position, the release latch covers engage the slide-rail release latches [1].
Caution - Deploy any rack anti-tilt mechanism before releasing the slide-rail release latches.
FIGURE 7
4.
While the release latch covers are in the open position, slowly pull the server
Extending the Server to the Maintenance Position
forward until the slide-rails latch into a locked position [2].
The server is now in the extended maintenance position.
Related Information
■
“Disconnect Cables From the Server” on page 60
■
“Remove the Server From the Rack” on page 63
■
“Reinstall the Server in the Rack” on page 230
62Oracle Server X5-2L Service Manual • May 2015
Page 63
Remove the Server From the Rack
Remove the Server From the Rack
Caution - The server weighs approximately 63 pounds (28.5 kg). Two people are required to
unmount and carry the chassis.
1.
Disconnect all the cables and power cords from the server.
2.
Remove the cable management arm (CMA).
For instructions for removing the CMA, see the “Remove the Cable Management Arm” in
“Oracle Server X5-2L Installation Guide ”.
3.
Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 61.
4.
From the front of the server, pull the green slide-rail release tabs toward the front
of the server, and pull the server out of the rack until it is free of the rack rails [1
and 2].
A slide-rail release tab is located on each slide-rail.
FIGURE 8
5.
Set the server on a sturdy work surface.
Removing the Server From the Chassis
Related Information
■
“Disconnect Cables From the Server” on page 60
■
“Extend the Server to the Maintenance Position” on page 61
■
“Reinstall the Server in the Rack” on page 230
Preparing for Service63
Page 64
Take Antistatic Measures
Take Antistatic Measures
1.
Prepare an antistatic surface on which to set parts during removal and
installation.
Place electrostatic discharge (ESD-sensitive components on an antistatic mat. The following
items can be used as an antistatic mat:
■
■
■
2.
Attach an antistatic wrist strap.
When servicing or removing server components, attach an antistatic strap to your wrist and then
to a metal area on the chassis. Then perform procedures for servicing the server.
Note - An antistatic wrist strap is not included in the Accessory Kit for the server. However,
antistatic wrist straps are included with options and components.
An antistatic bag used to wrap a replacement part
An Oracle ESD mat (orderable item)
A disposable ESD mat (shipped with some replacement parts or optional system
components)
Related Information
■
“Extend the Server to the Maintenance Position” on page 61
■
“Remove the Server From the Rack” on page 63
■
“Remove the Server Top Cover” on page 64
■
“Remove the Fan Assembly Door From the Server” on page 65
■
“Remove Antistatic Measures” on page 230
Remove the Server Top Cover
Caution - If you remove the top cover without first removing the AC power cords, then not
only will the server host shut down, but also this condition will be flagged as a system fault
(that is, the fault LEDs will light).
Note - Servicing some components requires that the top cover be removed.
1.
Ensure that AC power cords are disconnected from the server power supplies.
2.
Unlatch the top cover [1].
Press the green release button on the top of the server cover.
64Oracle Server X5-2L Service Manual • May 2015
Page 65
Remove the Fan Assembly Door From the Server
FIGURE 9
3.
While pressing the top cover release button, slide the cover toward the rear of
Removing the Server Top Cover
the server [2].
4.
If necessary, open the fan assembly door.
5.
Lift up and remove the top cover [2].
Related Information
■
“Take Antistatic Measures” on page 64
■
“Remove the Fan Assembly Door From the Server” on page 65
■
“Install the Server Top Cover” on page 226
Remove the Fan Assembly Door From the Server
Before you service the server's storage drive backplane, you should first remove the server's
front fan assembly door.
Note - This procedure is only required for systems configured with twelve 3.5-inch storage
drives or twenty-four 2.5-inch storage drives. If your system is configured with eight 2.5-inch
storage drives, see “Remove the Disk Cage Cover From the Server” on page 67.
Preparing for Service65
Page 66
Remove the Fan Assembly Door From the Server
1.
Ensure that the AC power cords are disconnected from the server power
supplies.
2.
Use a No. 2 Phillips screwdriver to remove the screws that secure the fan
assembly door [1].
■
If your system is configured with twelve 3.5-inch storage drives, remove the
two screws on each side of the chassis and the three screws on top of the
chassis.
■
If your system is configured with twenty-four 2.5-inch storage drives,
remove the two screws on each side of the chassis.
FIGURE 10
3.
Slide the fan assembly door toward the rear of the server, and lift and remove the
Removing the Fan Assembly Door
door from the chassis [2].
Related Information
■
“Take Antistatic Measures” on page 64
■
“Remove the Server Top Cover” on page 64
■
“Install the Server Top Cover” on page 226
■
“Install the Fan Assembly Door” on page 228
66Oracle Server X5-2L Service Manual • May 2015
Page 67
Remove the Disk Cage Cover From the Server
Remove the Disk Cage Cover From the Server
Before you service the server's storage drive backplane, you should first remove the server's
disk cage cover.
Note - This procedure is only required for systems configured with eight 2.5-inch storage
drives. If your system is configured with twelve 3.5-inch storage drives or twenty-four 2.5-inch
storage drives, see “Remove the Fan Assembly Door From the Server” on page 65.
1.
Ensure that the AC power cords are disconnected from the server power
supplies.
2.
Using a No. 2 Phillips screwdriver, remove the twelve screws (three on each top
side of the chassis, and three on each side of the chassis) that secure the disk
cage cover [1].
Preparing for Service67
Page 68
Remove the Disk Cage Cover From the Server
FIGURE 11
Removing the Disk Cage Cover
3.
Slide the cover forward and lift to remove the disk cage cover from the chassis
[2].
Related Information
■
“Take Antistatic Measures” on page 64
■
“Remove the Server Top Cover” on page 64
■
“Install the Server Top Cover” on page 226
68Oracle Server X5-2L Service Manual • May 2015
Page 69
■
“Install the Disk Cage Cover” on page 229
Remove the Disk Cage Cover From the Server
Preparing for Service69
Page 70
70Oracle Server X5-2L Service Manual • May 2015
Page 71
Servicing CRUs That Do Not Require Server
Power-Off
These sections describe how to service customer-replaceable units (CRUs) that do not
require you to power off the server. For more information about CRUs, see “Illustrated Parts
Breakdown” on page 19 and “Customer-Replaceable Units” on page 21.
DescriptionLinks
Service storage and boot drives.“Servicing Storage Drives and Rear Drives
Service fan modules.“Servicing Fan Modules (CRU)” on page 89
Service power supplies.“Servicing Power Supplies (CRU)” on page 93
Related Information
■
“Servicing CRUs That Require Server Power-Off”
■
“Servicing FRUs”
(CRU)” on page 71
Servicing Storage Drives and Rear Drives (CRU)
These sections describe how to remove and install storage drives.
■
“Storage Drives Hot-Plug Conditions” on page 72
■
“Storage Drive Failure and RAID” on page 72
■
“Removing and Replacing an HDD or SSD Storage Drive” on page 73
■
“Removing and Replacing an NVMe Storage Drive Using Oracle Solaris” on page 79
■
“Removing and Replacing an NVMe Storage Drive Using Oracle Linux” on page 82
■
“Removing and Replacing a Rear Storage Drive” on page 86
Related Information
■
“Servicing SAS Cables (FRU)” on page 167
Servicing CRUs That Do Not Require Server Power-Off71
Page 72
Servicing Storage Drives and Rear Drives (CRU)
■
“Servicing NVMe Cables (FRU)” on page 174
Storage Drives Hot-Plug Conditions
The hard-disk drives (HDDs), solid-state drives (SSDs), and NVMe SSD drives installed in the
server are hot-pluggable, but this capability depends on how the drives are configured. To hotplug a drive you must be able to take the drive offline before you can remove it. When you take
the drive offline, you prevent any application from accessing the drive and remove the logical
software links to the drive.
The following conditions inhibit the ability to perform hot-plugging of a drive:
■
The drive provides the operating system, and the operating system is not mirrored on
another drive.
■
The drive cannot be logically isolated from the online operations of the server.
If either of the disk drive conditions is true, then you must shut down the system before you
replace the drive. See “Powering Down the Server” on page 55.
Note - Replacing a drive does not require removing the server from a rack.
Related Information
■
“Storage and Boot Drive Indicators” on page 32
■
“Removing and Replacing an HDD or SSD Storage Drive” on page 73
■
“Removing and Replacing an NVMe Storage Drive Using Oracle Solaris” on page 79
■
“Removing and Replacing an NVMe Storage Drive Using Oracle Linux” on page 82
■
“Removing and Replacing a Rear Storage Drive” on page 86
Storage Drive Failure and RAID
A single storage drive failure does not cause a data failure if the storage drives are configured
as a mirrored RAID 1 volume (optional). The storage drive can be removed, and when a new
storage drive is inserted, the contents are automatically rebuilt from the rest of the array with no
need to reconfigure the RAID parameters. If the replaced storage drive was configured as a hotspare, the new HDD is automatically configured as a new hot-spare.
For information about the implementation of RAID on this server, refer to “Configuring Server
Drives for OS Installation” in “Oracle Server X5-2L Installation Guide ”.
72Oracle Server X5-2L Service Manual • May 2015
Page 73
Remove an HDD or SSD Storage Drive
Removing and Replacing an HDD or SSD Storage
Drive
The following sections describe how to remove and replace an HDD or SSD storage drive.
■
“Remove an HDD or SSD Storage Drive” on page 73
■
“Install an HDD or SSD Storage Drive” on page 78
Remove an HDD or SSD Storage Drive
1.
Prepare the system for the drive removal.
Refer to “Take Antistatic Measures” on page 64.
2.
Identify the location of the drive that you want to remove.
The following illustrations show the location of the drives and the internal system software
designations for the drives. For information on storage drive status indicators (LEDs), refer to
“Storage and Boot Drive Indicators” on page 32.
FIGURE 12
Figure Legend
1
Storage drive 0 (HDD/SSD)
2
Storage drive 1 (HDD/SSD)
3
Storage drive 2 (HDD/SSD/NVMe 0)
4
Storage drive 3 (HDD/SSD/NVMe 1)
5
Storage drive 4 (HDD/SSD/NVMe 2)
6
Storage drive 5 (HDD/SSD/NVMe 3)
Drive Location and Numbering on a Server With Eight 2.5-Inch Drives
Servicing CRUs That Do Not Require Server Power-Off73
Page 74
Remove an HDD or SSD Storage Drive
7
Storage drive 6 (HDD/SSD)
8
Storage drive 7 (HDD/SSD)
9
Optional SATA DVD drive
FIGURE 13
Figure Legend
1
Storage drive 0 (HDD/SSD)
2
Storage drive 4 (HDD/SSD)
3
Storage drive 8 (HDD/SSD)
4
Storage drive 1 (HDD/SSD)
5
Storage drive 5 (HDD/SSD)
6
Storage drive 9 (HDD/SSD)
7
Storage drive 2 (HDD/SSD)
8
Storage drive 6 (HDD/SSD)
9
Storage drive 10 (HDD/SSD)
10
Storage drive 3 (HDD/SSD)
11
Storage drive 7 (HDD/SSD)
12
Storage drive 11 (HDD/SSD)
Drive Location and Numbering on a Server With Twelve 3.5-Inch Drives
74Oracle Server X5-2L Service Manual • May 2015
Page 75
Remove an HDD or SSD Storage Drive
FIGURE 14
Figure Legend
1
Storage drives 0 through 11 (HDD/SSD; Storage drives 3 and 4: HDD/SSD/NVMe)
2
Storage drives 12 through 23 (HDD/SSD; Storage drives 19 and 20: HDD/SSD/NVMe)
3.
If you are removing an HDD or SSD storage drive, type the required operating
Drive Location and Numbering on a Server With Twenty-Four 2.5-Inch Drives
system commands to stop the server from using the drive; otherwise, proceed to
the next step.
The exact commands required depend on the configuration of your drives. Unmount file
systems, or issue RAID commands as needed.
Note - The blue OK to Remove status indicator LED on the drive may or may not light, as
support for this varies depending on the operating system in use.
4.
On the drive you plan to remove, push the latch release button to open the drive
latch [1, 2].
Servicing CRUs That Do Not Require Server Power-Off75
Page 76
Remove an HDD or SSD Storage Drive
FIGURE 15
Opening the Drive Latch
Figure Legend
1
Pressing the latch release button.
2
Opening the latch.
Caution - The latch is not an ejector. Do not open the latch too far to the right. Doing so can
damage the latch.
5.
Grasp the latch and pull the drive out of the drive slot.
76Oracle Server X5-2L Service Manual • May 2015
Page 77
Remove an HDD or SSD Storage Drive
FIGURE 16
Removing a Storage Drive
6.
Consider your next steps:
■
If you are replacing the drive, continue to “Install an HDD or SSD Storage
Drive” on page 78.
■
If you are not replacing the drive, install a filler panel in the empty drive slot to maintain
proper airflow and perform administrative tasks to configure the server to operate without
the drive.
For information on how to install a storage drive filler panel, see “Remove and Install Filler
Panels” on page 226.
Related Information
■
“Storage and Boot Drive Indicators” on page 32
■
“Storage Drives Hot-Plug Conditions” on page 72
■
“Storage Drive Failure and RAID” on page 72
■
“Install an HDD or SSD Storage Drive” on page 78
■
“Install a Rear Storage Drive” on page 88
Servicing CRUs That Do Not Require Server Power-Off77
Page 78
Install an HDD or SSD Storage Drive
Install an HDD or SSD Storage Drive
1.
Remove the replacement drive from its packaging, and place the drive on an
antistatic mat.
2.
If necessary, remove the drive filler panel.
3.
Align the replacement drive with the drive slot.
The drive is physically addressed according to the slot in which it is installed. It is important to
install a replacement drive in the same slot as the drive that was removed.
4.
Slide the drive into the slot until the drive is fully seated.
FIGURE 17
Installing a Storage Drive
5.
Close the drive latch to lock the drive in place.
6.
Perform administrative procedures to reconfigure the drive.
The procedures that you perform at this point depend on how your data is configured. You
might need to partition the drive, create file systems, load data from backups, or have the drive
updated from a RAID configuration.
78Oracle Server X5-2L Service Manual • May 2015
Page 79
Unmount an NVMe Storage Drive
Related Information
■
“Storage and Boot Drive Indicators” on page 32
■
“Storage Drives Hot-Plug Conditions” on page 72
■
“Storage Drive Failure and RAID” on page 72
■
“Remove an HDD or SSD Storage Drive” on page 73
■
“Remove a Rear Storage Drive” on page 86
Removing and Replacing an NVMe Storage Drive
Using Oracle Solaris
Note - NVMe storage drives are supported only on servers that are running Oracle Solaris or
Oracle Linux operating systems. Servers that are running Oracle VM, Windows Server, Red
Hat Enterprise Linux, SUSE Linux Enterprise Server, or VMware ESXi do not support NVMe
drives.
The following sections describe how to remove and replace an NVMe storage drive on a server
that is running the Oracle Solaris operating system.
■
“Unmount an NVMe Storage Drive” on page 79
■
“Remove an NVMe Storage Drive From the Server” on page 80
■
“Verify Removal of an NVMe Storage Drive” on page 80
■
“Install an NVMe Storage Drive in the Server” on page 81
■
“Power On an NVMe Storage Drive and Attach a Device Driver” on page 81
Unmount an NVMe Storage Drive
1.
Log in to Oracle Solaris that is running on the server.
2.
To find the NVMe drive slot number, type:
# hotplug list –lc
This command produces output similar to the following for each of the NVMe drives installed
in the server:
# hotplug list –lc
Connection State Description
Path
Verify that the OK to Remove indicator (LED) on the NVMe storage drive is lit.
Remove an NVMe Storage Drive From the Server
Perform this procedure to physically remove an NVMe storage drive from the server.
1.
Identify the physical location of the NVMe drive that you want to remove.
For storage drive locations, see “Remove an HDD or SSD Storage Drive” on page 73.
2.
Verify that the OK to Remove indicator (LED) on the NVMe storage drive is lit.
3.
On the drive you plan to remove, push the latch release button to open the drive
latch.
See Figure 15.
4.
Grasp the latch and pull the drive out of the drive slot.
5.
Consider your next steps:
■
If you are replacing the drive, continue to “Verify Removal of an NVMe
Storage Drive” on page 80, and then “Install an NVMe Storage Drive in the
Server” on page 81.
■
If you are not replacing the drive, install a filler panel in the empty drive slot to maintain
proper airflow and perform administrative tasks to configure the server to operate without
the drive.
For information on how to install a storage drive filler panel, see “Remove and Install Filler
Panels” on page 226.
Verify Removal of an NVMe Storage Drive
To verify that the NVMe drive has been removed, type:
# hotplug list –lc
The following output appears for the NVMe drive that you removed:
80Oracle Server X5-2L Service Manual • May 2015
Page 81
Install an NVMe Storage Drive in the Server
# hotplug list –lc
Connection State Description
Path
Perform this procedure to physically install an NVMe storage drive into the server.
Note - After you physically remove an NVMe storage drive from the server, wait at least 10
seconds before installing a replacement drive.
1.
Remove the replacement drive from its packaging and place the drive on an
antistatic mat.
2.
If necessary, remove the drive filler panel.
3.
Align the replacement drive with the drive slot.
The drive is physically addressed according to the slot in which it is installed. It is important to
install a replacement drive in the same slot as the drive that was removed.
4.
Slide the drive into the slot until the drive is fully seated.
See Figure 17.
5.
Close the drive latch to lock the drive in place.
Power On an NVMe Storage Drive and Attach a Device Driver
1.
Type: # hotplug enable pcie13
2.
Type: # hotplug list –lc
The following status is displayed for the NVMe storage drive.
# hotplug list –lc
Connection State Description
Path
To check the NVMe drive health, firmware level, temperature, get error log,
SMART data, security erase, low level format, etc., type:
# nvmeadm list
Servicing CRUs That Do Not Require Server Power-Off81
Page 82
Unmount an NVMe Storage Drive
Related Information
■
“Storage and Boot Drive Indicators” on page 32
■
“Storage Drives Hot-Plug Conditions” on page 72
■
“Storage Drive Failure and RAID” on page 72
■
“Remove an HDD or SSD Storage Drive” on page 73
■
“Install an HDD or SSD Storage Drive” on page 78
Removing and Replacing an NVMe Storage Drive
Using Oracle Linux
Note - NVMe storage drives are supported only on servers that are running Oracle Solaris or
Oracle Linux operating systems. Servers that are running Oracle VM, Windows Server, Red
Hat Enterprise Linux, SUSE Linux Enterprise Server, or VMware ESXi do not support NVMe
drives.
The following sections describe how to remove and replace an NVMe storage drive on a server
that is running the Oracle Linux operating system.
■
“Unmount an NVMe Storage Drive” on page 82
■
“Remove an NVMe Storage Drive” on page 84
■
“Verify Removal of an NVMe Storage Drive” on page 84
■
“Install an NVMe Storage Drive in the Server” on page 85
■
“Power On an NVMe Storage Drive and Attach a Device Driver” on page 85
■
“Verify Operation of an NVMe Storage Drive” on page 85
Unmount an NVMe Storage Drive
1.
Log in to Oracle Linux that is running on the server.
2.
Remove the NVMe storage device path.
a.
To find the PCIe addresses (Bus Device Function), type:
# find /sys/devices |egrep ‘nvme[0-9][0-9]?$’
This commands returns output similar to the following:
In the above listing, the PCIe addresses are highlighted in bold text.
82Oracle Server X5-2L Service Manual • May 2015
Page 83
Unmount an NVMe Storage Drive
b.
To obtain the slot number (APIC ID) for the bus address, type the following
command to list all of the PCIe slot numbers and the corresponding bus
addresses:
# egrep –H ‘.*’ /sys/bus/pci/slots/*/address
This commands returns output similar to the following:
In the above output, the bus addresses for the corresponding NVMe instances are
highlighted in bold text.
Note - In the above output, notice that the instance names for the NVMe drives does not
correspond to the NVMe drive labels on the front of the server, that is, pci/slots/12/address:
0000:b2:00 corresponds to instance nvme0; however, on the front of the server, this drive is
labeled NVMe2. For a table that shows the relationship between the pci/slot# and the NVMe
storage drive label on the front of the server, see “Server Operating System Names for the
NVMe Storage Drives” on page 45.
c.
Disconnect all users of the NVMe drive and back up the NVMe drive data as
needed.
i.
Use the umount command to unmount any file systems that are mounted
on the device.
ii.
Remove the device from any multiple device (md) and Logical Volume
Manager (LVM) volume using it.
If the device is a member of an LVM Volume group, then it may be necessary
to move data off the device using the pvmove command, then use the vgreduce
command to remove the physical volume, and (optionally) pvremove to remove the
LVM meta data from the disk.
iii.
If the device uses multipathing, run multipath -l and note all the paths
to the device. Then, remove the multipathed device using the multipath
-f device command.
iv.
Run the blockdev --flushbufs device command to flush any outstanding
I/O to all paths to the device.
Servicing CRUs That Do Not Require Server Power-Off83
Page 84
Remove an NVMe Storage Drive
3.
To prepare the NVMe drive for removal, that is, detach the NVMe device driver
and power off the NVMe drive slot, type:
# echo 0 > /sys/bus/pci/slots/$slot/power
Where $slot is the slot number obtained in step Step 2.b above.
4.
Verify that the OK to Remove indicator (LED) on the NVMe drive is lit.
Remove an NVMe Storage Drive
Perform this procedure to physically remove an NVMe storage drive from the server.
1.
Identify the location of the NVMe drive that you want to remove.
For storage drive locations, see “Remove an HDD or SSD Storage Drive” on page 73.
2.
Verify that the OK to Remove indicator (LED) on the NVMe drive is lit.
3.
On the NVMe drive you plan to remove, push the latch release button to open the
drive latch.
See Figure 15.
4.
Grasp the latch and pull the drive out of the drive slot.
5.
Consider your next steps:
■
If you are replacing the drive, continue to “Verify Removal of an NVMe
Storage Drive” on page 84, and then “Install an NVMe Storage Drive in the
Server” on page 85.
■
If you are not replacing the drive, install a filler panel in the empty drive slot to maintain
proper airflow and perform administrative tasks to configure the server to operate without
the drive.
For information on how to install a storage drive filler panel, see “Remove and Install Filler
Panels” on page 226.
Verify Removal of an NVMe Storage Drive
1.
To check the NVMe drive enumeration so as to verify that NVMe drive has been
removed, type:
# lspci -nnd :0953
2.
View the command output and verify that the entry for the slot number that was
disabled no longer appears.
This commands returns output similar to the following:
Perform this procedure to physically install an NVMe storage drive into the server.
Note - After you physically remove an NVMe drive from the server, wait at least 10 seconds
before installing a replacement drive.
1.
Remove the replacement drive from its packaging and place the drive on an
antistatic mat.
2.
If necessary, remove the drive filler panel.
3.
Align the replacement drive with the drive slot.
The drive is physically addressed according to the slot in which it is installed. It is important to
install a replacement drive in the same slot as the drive that was removed.
4.
Slide the drive into the slot until the drive is fully seated.
See Figure 17.
5.
Close the drive latch to lock the drive in place.
Power On an NVMe Storage Drive and Attach a Device Driver
To power on the slot and attach the device driver, type:
# echo 1 /sys/bus/pci/slots/$slot/power
Where $slot is the slot number for the NVMe storage drive.
Verify Operation of an NVMe Storage Drive
To verify that an NVMe drive is operating properly, do one of the following:
■
Check the /var/log/messages log file.
■
Type: ls -l /dev/nvme*.
Related Information
■
“Storage and Boot Drive Indicators” on page 32
Servicing CRUs That Do Not Require Server Power-Off85
Page 86
Remove a Rear Storage Drive
■
“Storage Drives Hot-Plug Conditions” on page 72
■
“Storage Drive Failure and RAID” on page 72
■
“Remove an HDD or SSD Storage Drive” on page 73
■
“Install an HDD or SSD Storage Drive” on page 78
Removing and Replacing a Rear Storage Drive
The following sections describe how to remove and replace a rear storage drive.
■
“Remove a Rear Storage Drive” on page 86
■
“Install a Rear Storage Drive” on page 88
Remove a Rear Storage Drive
Note - Rear storage drives are supported in the twelve 3.5-inch storage drive and twenty-four
2.5-inch storage drive configurations only. Rear storage drives are not supported in the eight
2.5-inch storage drive configuration.
1.
Prepare the system for the drive removal.
Refer to “Take Antistatic Measures” on page 64.
2.
Identify the rear storage drive you want to remove.
For specific storage drive locations, see “Rear Panel Components and Cable
Connections” on page 18. For information on storage drive status indicators (LEDs), refer to
“Storage and Boot Drive Indicators” on page 32.
3.
Type the operating system commands required to stop the server from using the
drive.
The exact commands required depend on the configuration of your drives. Unmount file
systems, or issue RAID commands as needed.
Note - The blue OK to Remove status indicator LED on the drive may or may not light, as
support for this varies depending on the operating system in use.
4.
Remove the rear storage drive.
a.
On the storage drive that you want to remove, push the drive release button
to open the latch [1].
86Oracle Server X5-2L Service Manual • May 2015
Page 87
Remove a Rear Storage Drive
b.
Grasp the drive latch, and pull the drive out of the drive slot [2].
Caution - The drive latch is not an ejector. Do not open the latch too far to the right. Doing so
can damage the latch.
FIGURE 18
5.
Consider your next steps:
■
If you are replacing the drive, continue to “Install a Rear Storage Drive” on page 88.
■
If you are not replacing the drive, install a filler panel in the empty drive slot to maintain
proper airflow and perform administrative tasks to configure the server to operate without
the drive.
For information on how to install a storage drive filler panel, see “Remove and Install Filler
Panels” on page 226.
Removing a Rear-Mounted Storage Drive
Related Information
■
“Rear Panel Components and Cable Connections” on page 18
■
“Storage and Boot Drive Indicators” on page 32
■
“Remove an HDD or SSD Storage Drive” on page 73
Servicing CRUs That Do Not Require Server Power-Off87
Page 88
Install a Rear Storage Drive
Install a Rear Storage Drive
Note - Rear storage drives are supported in the twelve 3.5-inch storage drive and twenty-four
2.5-inch storage drive configurations only. Rear storage drives are not supported in the eight
2.5-inch storage drive configuration.
1.
If necessary, remove the drive filler panel.
2.
Determine the drive slot location for the replacement drive.
If you removed an existing drive from a slot in the system, you must install the replacement
drive in the same slot as the drive that was removed. Drives are physically addressed
according to the slot in which they are installed. See “Rear Panel Components and Cable
Connections” on page 18 for boot drive locations.
3.
Slide the drive into the drive slot until it is fully seated.
FIGURE 19
Installing a Rear-Mounted Storage Drive
4.
Close the drive latch to lock the drive in place.
5.
Perform administrative procedures to reconfigure the drive.
88Oracle Server X5-2L Service Manual • May 2015
Page 89
The procedures that you perform at this point depend on how your data is configured. You
might need to partition the drive, create file systems, load data from backups, or have the drive
updated from a RAID configuration.
Related Information
■
“Rear Panel Components and Cable Connections” on page 18
■
“Storage and Boot Drive Indicators” on page 32
■
“Install an HDD or SSD Storage Drive” on page 78
Servicing Fan Modules (CRU)
The fan modules are located in the front of the server. See the following procedures:
■
“Remove a Fan Module” on page 89
■
“Install a Fan Module” on page 92
Servicing Fan Modules (CRU)
Related Information
■
“Remove the Server Top Cover” on page 64
Remove a Fan Module
You do not have to power off the server to service fan modules.
You should not begin this procedure unless you have a replacement fan module and you are
ready to install it right away.
1.
Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 61.
2.
To access the fan modules, open the top cover fan assembly door.
Caution - Close the top cover fan assembly door within 60 seconds to maintain adequate
airflow to properly cool the server. Leaving the door open for more than 60 seconds, while the
server is running, might cause the server to automatically shut down.
3.
Identify the faulty fan module.
Servicing CRUs That Do Not Require Server Power-Off89
Page 90
Remove a Fan Module
Each fan module has a fan status indicator (LED) that is located next to the module. If the LED
is amber, the fan has failed. The location of the fan status LED is shown in the following figure.
FIGURE 20
Fan Module Locations and Fan Status Indicators
LED Color and State Meaning
Amber – The fan module is faulty. The front Top Fan LED and the front and rear panel Service Required LEDs are
also lit if the system detects a fan module fault.
4.
Use a No. 2 Phillips screwdriver to loosen the captive screw that secures the
faulty fan module in the chassis [1].
90Oracle Server X5-2L Service Manual • May 2015
Page 91
Remove a Fan Module
FIGURE 21
Removing a Fan Module
5.
Grasp both the captive screw and the opposite end of the module and lift the fan
module straight up and out of the chassis, and set it aside on an antistatic mat
[2].
Caution - When removing a fan module, do not rock it back and forth. Rocking the fan module
can cause damage to the motherboard connectors.
Caution - Do not service any other components in the fan compartment unless the system is
shut down and the power cords are removed.
6.
Consider your next step:
■
If you removed the fan assembly as part of another procedure, return to that procedure.
■
Otherwise, continue to “Install a Fan Module” on page 92.
Servicing CRUs That Do Not Require Server Power-Off91
Page 92
Install a Fan Module
1.
2.
Related Information
■
“Install a Fan Module” on page 92
Install a Fan Module
Remove the replacement fan module from its packaging, and place it on an
antistatic mat.
With the top cover fan assembly door open, install the replacement fan module
into the server [1].
The fan modules are keyed to ensure that they are installed in the correct orientation.
FIGURE 22
Installing a Fan Module
92Oracle Server X5-2L Service Manual • May 2015
Page 93
Servicing Power Supplies (CRU)
3.
Press down on the fan module and apply firm pressure to fully seat the fan
module.
4.
Verify that the fan module status indicator (LED) on the replacement fan module
is not illuminated.
5.
Using your thumb and forefinger, tighten the captive screw to secure the fan
module to the chassis. Then use a No. 2 Phillips screwdriver to tighten the screw
an additional 1/4 turn to secure the fan module to the chassis [2].
6.
Close the top cover fan assembly door.
7.
Verify that the Top Fan Fault LED on the front of the server and the Service
Required LEDs on the front and rear of the server are extinguished.
See “Server System-Level Status Indicators” on page 31 for more information about identifying
and interpreting system status indicators.
8.
Consider your next step:
■
If you installed the fan module as part of another procedure, return to that procedure.
■
Otherwise, return the server to operation. See “Returning the Server to Operation”.
Related Information
■
“Remove a Fan Module” on page 89
Servicing Power Supplies (CRU)
The server's redundant power supplies support concurrent maintenance, which enables you to
remove and replace a power supply without shutting down the server, provided that the other
power supply is online and working.
The server supports model A258 (1000 watt) power supplies. The A258 power supply unit
(PSU) provides conversion from the AC lines to the system, accepting ranges from 100-240
volts AC (VAC). These PSUs are designed to be hot-swappable, and in most cases will provide
fully redundant “1+1” power, allowing the system to suffer the loss of a PSU or an AC feed
with no loss to system availability.
In maximally configured systems, it is possible that the worst-case power consumption of the
system could exceed the capacity of a single PSU. The PSUs provide an over-subscription
mode, which allows the system to operate with fault-tolerance even with modest excursions
beyond the rated capacity of a single PSU. This over-subscription support is accomplished
Servicing CRUs That Do Not Require Server Power-Off93
Page 94
Remove a Power Supply
using hardware signaling between the PSU and motherboard circuitry, which can force the
system to throttle CPU and memory power in the event that a PSU is lost. The resulting power
savings will be enough to allow the system to continue to run (in a lower-performance state)
until the power problem is resolved.
The following status indicators (LEDs) are lit when a power supply fault is detected:
■
■
■
If a power supply fails and you do not have a replacement available, leave the failed power
supply installed to ensure proper airflow in the server. For more information, refer to the
following topic and procedures:
■
■
■
Front and rear Service Required LEDs
Amber Service Required LED on the faulty power supply
Rear Power Supply Fault LED located on the server front panel
“Power Supply Status Indicators” on page 33
“Remove a Power Supply” on page 94
“Install a Power Supply” on page 97
Related Information
■
“Troubleshooting Using the Server Front and Rear Panel Status Indicators ” on page 30
■
“Powering Down the Server” on page 55
Remove a Power Supply
1.
Prepare the system for power supply removal.
Refer to “Take Antistatic Measures” on page 64.
2.
Identify which power supply requires replacement.
94Oracle Server X5-2L Service Manual • May 2015
Page 95
Remove a Power Supply
FIGURE 23
Figure Legend
1
Power supply 0
2
Power supply 1
Power Supplies and Latches Locations
A lit amber Service Required LED on a power supply indicates that a failure was detected.
You can also use the Oracle ILOM showfaulty command at the Oracle ILOM command-line
prompt (->) to identify a power supply failure.
Alternatively, to list all known faults in the server, log into the Oracle Solaris OS and issue
the fmadm faulty command, or log into the Oracle ILOM service processor from the Oracle
ILOM Fault Management Shell and issue the fmadm faulty command. For more information
about how to use the Oracle ILOM Fault Management Shell and supported commands, see the
Oracle ILOM User's Guide for System Monitoring and Diagnostics Firmware Release 3.2.x in
the Oracle Integrated Lights Out Manager (ILOM) 3.2 Documentation Library at http://www.
oracle.com/goto/ILOM/docs.
Note - The fans of a failed power supply might still be spinning when the system is powered on.
The fans will stop spinning when you disconnect the power cords.
Servicing CRUs That Do Not Require Server Power-Off95
Page 96
Remove a Power Supply
3.
Gain access to the rear of the server where the faulty power supply is located.
4.
If the cable management arm (CMA) is installed, disconnect both CMA left-side
connectors and move the CMA out of the way.
For instructions for disconnecting the CMA left-side connectors, refer to “Remove the Cable
Management Arm” in “Oracle Server X5-2L Installation Guide ”.
Caution - When disconnecting the CMA left-side connectors, be sure to use your arm to
support the CMA so that it does not hang down under its own weight and stress the right-side
connectors; otherwise, the CMA might be damaged. You must continue to support the CMA
until you have reconnected both of the left-side connectors.
5.
Disconnect the power cord from the faulty power supply.
6.
Grasp the power supply handle and push the power supply latch to the left [1].
FIGURE 24
7.
Pull the power supply out of the chassis [2].
Caution - Whenever you remove a power supply, you should replace it with another power
Removing a Power Supply
supply; otherwise, the server might overheat due to improper airflow.
8.
Continue to “Install a Power Supply” on page 97.
96Oracle Server X5-2L Service Manual • May 2015
Page 97
Install a Power Supply
Related Information
■
“Rear Panel Components and Cable Connections” on page 18
■
“Power Supply Status Indicators” on page 33
■
“Install a Power Supply” on page 97
Install a Power Supply
Caution - Always replace the failed power supply with the same type model of power supply.
1.
Remove the replacement power supply from its packaging, and place it on an
antistatic mat.
2.
Align the replacement power supply with the empty power supply slot.
3.
Slide the power supply into the slot until it is fully seated.
You will hear an audible click when the power supply fully seats.
Servicing CRUs That Do Not Require Server Power-Off97
Page 98
Install a Power Supply
FIGURE 25
Installing a Power Supply
4.
Reconnect the power cord to the power supply.
5.
Verify that the amber Service Required LED on the replaced power supply and
the Service Required LEDs are not lit on the front and rear panels.
Note - After you have replaced Power Supply 0, you might need to reset the Oracle ILOM
service processor (SP) to propagate the key identity properties (KIP) data to the new power
supply. For instructions on resetting the SP, see the Oracle Integrated Lights Out Manager
(ILOM) 3.2 Configuration and Maintenance Guide at http://www.oracle.com/goto/ILOM/
docs. Power Supply 1 does not contain KIP data, and therefore does not require an SP reset
after replacement.
6.
If you disconnected the two CMA left-side connectors, reconnect the
connectors.
For instructions for reconnecting the CMA left-side connectors, refer to “Install the Cable
Management Arm” in “Oracle Server X5-2L Installation Guide ”.
98Oracle Server X5-2L Service Manual • May 2015
Page 99
Related Information
■
“Rear Panel Components and Cable Connections” on page 18
■
“Power Supply Status Indicators” on page 33
■
“Remove a Power Supply” on page 94
Install a Power Supply
Servicing CRUs That Do Not Require Server Power-Off99
Page 100
100Oracle Server X5-2L Service Manual • May 2015
Loading...
+ hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.