This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except
as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform,
publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is
prohibited.
The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing.
If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, then the following notice is applicable:
U.S. GOVERNMENT END USERS: Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation,
delivered to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental
regulations. As such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the
hardware, and/or documentation, shall be subject to license terms and license restrictions applicable to the programs. No other rights are granted to the U.S. Government.
This software or hardware is developed for general use in a variety of information management applications. It is not developed or intended for use in any inherently dangerous
applications, including applications that may create a risk of personal injury. If you use this software or hardware in dangerous applications, then you shall be responsible to take all
appropriate fail-safe, backup, redundancy, and other measures to ensure its safe use. Oracle Corporation and its affiliates disclaim any liability for any damages caused by use of this
software or hardware in dangerous applications.
Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners.
Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of
SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered
trademark of The Open Group.
This software or hardware and documentation may provide access to or information about content, products, and services from third parties. Oracle Corporation and its affiliates are
not responsible for and expressly disclaim all warranties of any kind with respect to third-party content, products, and services unless otherwise set forth in an applicable agreement
between you and Oracle. Oracle Corporation and its affiliates will not be responsible for any loss, costs, or damages incurred due to your access to or use of third-party content,
products, or services, except as set forth in an applicable agreement between you and Oracle.
Access to Oracle Support
Oracle customers that have purchased support have access to electronic support through My Oracle Support. For information, visit http://www.oracle.com/pls/topic/lookup?
ctx=acc&id=info or visit http://www.oracle.com/pls/topic/lookup?ctx=acc&id=trs if you are hearing impaired.
Ce logiciel et la documentation qui l'accompagne sont protégés par les lois sur la propriété intellectuelle. Ils sont concédés sous licence et soumis à des restrictions d'utilisation et
de divulgation. Sauf stipulation expresse de votre contrat de licence ou de la loi, vous ne pouvez pas copier, reproduire, traduire, diffuser, modifier, accorder de licence, transmettre,
distribuer, exposer, exécuter, publier ou afficher le logiciel, même partiellement, sous quelque forme et par quelque procédé que ce soit. Par ailleurs, il est interdit de procéder à toute
ingénierie inverse du logiciel, de le désassembler ou de le décompiler, excepté à des fins d'interopérabilité avec des logiciels tiers ou tel que prescrit par la loi.
Les informations fournies dans ce document sont susceptibles de modification sans préavis. Par ailleurs, Oracle Corporation ne garantit pas qu'elles soient exemptes d'erreurs et vous
invite, le cas échéant, à lui en faire part par écrit.
Si ce logiciel, ou la documentation qui l'accompagne, est livré sous licence au Gouvernement des Etats-Unis, ou à quiconque qui aurait souscrit la licence de ce logiciel pour le
compte du Gouvernement des Etats-Unis, la notice suivante s'applique :
U.S. GOVERNMENT END USERS: Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation,
delivered to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental
regulations. As such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the
hardware, and/or documentation, shall be subject to license terms and license restrictions applicable to the programs. No other rights are granted to the U.S. Government.
Ce logiciel ou matériel a été développé pour un usage général dans le cadre d'applications de gestion des informations. Ce logiciel ou matériel n'est pas conçu ni n'est destiné à être
utilisé dans des applications à risque, notamment dans des applications pouvant causer un risque de dommages corporels. Si vous utilisez ce logiciel ou ce matériel dans le cadre
d'applications dangereuses, il est de votre responsabilité de prendre toutes les mesures de secours, de sauvegarde, de redondance et autres mesures nécessaires à son utilisation dans
des conditions optimales de sécurité. Oracle Corporation et ses affiliés déclinent toute responsabilité quant aux dommages causés par l'utilisation de ce logiciel ou matériel pour des
applications dangereuses.
Oracle et Java sont des marques déposées d'Oracle Corporation et/ou de ses affiliés. Tout autre nom mentionné peut correspondre à des marques appartenant à d'autres propriétaires
qu'Oracle.
Intel et Intel Xeon sont des marques ou des marques déposées d'Intel Corporation. Toutes les marques SPARC sont utilisées sous licence et sont des marques ou des marques
déposées de SPARC International, Inc. AMD, Opteron, le logo AMD et le logo AMD Opteron sont des marques ou des marques déposées d'Advanced Micro Devices. UNIX est une
marque déposée de The Open Group.
Ce logiciel ou matériel et la documentation qui l'accompagne peuvent fournir des informations ou des liens donnant accès à des contenus, des produits et des services émanant de
tiers. Oracle Corporation et ses affiliés déclinent toute responsabilité ou garantie expresse quant aux contenus, produits ou services émanant de tiers, sauf mention contraire stipulée
dans un contrat entre vous et Oracle. En aucun cas, Oracle Corporation et ses affiliés ne sauraient être tenus pour responsables des pertes subies, des coûts occasionnés ou des
dommages causés par l'accès à des contenus, produits ou services tiers, ou à leur utilisation, sauf mention contraire stipulée dans un contrat entre vous et Oracle.
Accès aux services de support Oracle
Les clients Oracle qui ont souscrit un contrat de support ont accès au support électronique via My Oracle Support. Pour plus d'informations, visitez le site http://www.oracle.com/
pls/topic/lookup?ctx=acc&id=info ou le site http://www.oracle.com/pls/topic/lookup?ctx=acc&id=trs si vous êtes malentendant.
Page 5
Contents
Using This Documentation ............. ................ ................ ................ ................ ... 11
Index ............. ................ ................ ................ ................ ................ ................ ... 257
10Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 11
Using This Documentation
■
Overview – This service manual explains how to remove and replace parts in the Oracle
Exadata Storage Server X5-2 High Capacity, and how to maintain and troubleshoot the
system.
■
Audience – This guide is intended for trained technicians and authorized service personnel
who have been instructed on the hazards within the equipment and qualified to remove and
replace hardware.
■
Required knowledge – Advanced experience troubleshooting and replacing hardware.
Product Documentation Library
Documentation and resources for this product and related products are available at http://www.
oracle.com/goto/x5-2l/docs.
Feedback
Provide feedback about this documentation at http://www.oracle.com/goto/docfeedback.
Using This Documentation11
Page 12
12Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 13
About the Oracle Exadata Storage Server X5-2
High Capacity
These sections describe the controls, connectors, LEDs, system components, and replaceable
components of the server.
Note - Always update the server with the latest firmware, drivers, and other hardware-related
software by downloading the latest software release package when you first receive the server,
and for every new software release.
DescriptionLinks
Review the product description.“Product Description” on page 13
Review the controls and connectors on the server.“About Controls and Connectors” on page 14
Review system components.“About System Components” on page 16
Product Description
The Oracle Exadata Storage Server X5-2 High Capacity is an enterprise-class, two rack unit
(2U) server. It supports the following components:
■
Two Intel E5-2630, 8-core, 2.4 GHz, 85W processors.
■
Four 16-GB DIMMs and four 8-GB DIMMs for a total of 96 GB of memory. Only
RDIMMs are supported.
■
Six PCIe Gen3 slots.
■
Twelve 3.5-inch hot-pluggable SAS HDDs on the front panel.
■
Two hot-pluggable, redundant 1000W power supplies.
■
An on-board Oracle Integrated Lights Out Manager (Oracle ILOM) service processor (SP).
About the Oracle Exadata Storage Server X5-2 High Capacity13
Page 14
About Controls and Connectors
About Controls and Connectors
The following section describe the controls, indicators, connectors, and drives located on the
front and rear panels.
■
“Front Panel Components” on page 14
■
“Rear Panel Components and Cable Connections” on page 15
Related Information
■
“About System Components” on page 16
■
“Illustrated Parts Breakdown” on page 16
■
“Troubleshooting Using the Server Front and Rear Panel Status Indicators ” on page 26
Front Panel Components
Figure
Legend
1Locator LED/Locator button: white
14Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
14System status LEDs: Locator/Button: white; Service Required: amber; Power/OK: green
15 - 16Power supply unit 1 (PS1) and power supply unit 0 (PS0)
Description
10/100/1000BASE-T port (NET MGT)
Related Information
■
“Troubleshooting Using the Server Front and Rear Panel Status Indicators ” on page 26
■
“Disconnect Cables From the Server” on page 54
■
“Reconnect Power and Data Cables” on page 167
■
“Identifying the Server Ports” on page 169
About System Components
These sections describe the components of the server:
■
“Illustrated Parts Breakdown” on page 16
■
“Field-Replaceable Units” on page 18
■
“Field-Replaceable Units” on page 18
Related Information
■
“Servicing Components That Do Not Require Server Power-Off” on page 61
■
“Servicing Components That Require Server Power-Off” on page 77
Illustrated Parts Breakdown
The following figure identifies the major components of the server.
16Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 17
About System Components
Figure
Description
Legend
1Twelve 3.5-inch storage drives
2Front disk backplane for twelve 3.5-inch drives
3Left LED indicator module
About the Oracle Exadata Storage Server X5-2 High Capacity17
Page 18
About System Components
Figure
Legend
4System chassis
5Motherboard assembly
6System battery
7Fan modules
8Processors and heatsinks
9DIMMs
10Air baffle
11Top cover
12Oracle Storage 12 Gb/s SAS PCIe RAID HBA and super capacitor
13PCIe cards
14I/O and internal USB card
15Internal USB drive
16Rear-mounted storage drives
17Rear storage drive backplane
18Power supplies
19Right LED indicator module
Description
Field-Replaceable Units
The following table lists the field-replaceable units in the server and directs you to the
replacement instructions.
CRUDescriptionReplacement Instructions
BatteryLithium coin-cell battery that powers the CMOS BIOS
DIMMsAdd or replace memory in the system.“Servicing the DIMMs” on page 77
Storage drivesReplace one of the twelve 3.5-inch hot-pluggable SAS
Fan modulesContains four fan modules for cooling the server
PCIe cardsAdd-in cards.“Servicing PCIe Cards” on page 86
Power supply unit
(PSU)
Processor and
heatsink
and real-time clock.
HDDs.
components.
Two fully redundant AC-powered power supplies.“Servicing Power Supplies” on page 71
The processor that carries out the instructions of the
system.
“Servicing the Battery” on page 96
“Servicing Storage Drives and Rear
Drives” on page 61
“Servicing Fan Modules” on page 66
“Servicing Processors” on page 99
18Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 19
CRUDescriptionReplacement Instructions
Disk backplanesProvide power and communications connectors for
SAS cablesProvide signals between the front disk backplane and
Oracle Storage 12
Gb SAS PCIe RAID
HBA
Front LED/USB
indicator modules
Motherboard
assembly
storage drives.
the Oracle Storage 12 Gb SAS PCIe RAID HBA,
and from the front disk backplane to the rear disk
backplane.
Located in PCIe slot 6, the Sun Storage 12 Gb SAS
PCIe RAID HBA manages SAS storage drives.
Contain the push-button circuitry and LEDs that are
displayed on the bezel of the chassis.
Provides connectors for the DIMMs, processors, PCIe
risers, and other components.
“Servicing the Storage Drive Backplanes” on page 112
“Servicing SAS Cables” on page 118
“Servicing the Oracle Storage 12 Gb/s SAS PCIe RAID
HBA Card” on page 123
“Servicing the Front LED/USB Indicator
Modules” on page 130
“Servicing the Motherboard Assembly” on page 145
Related Information
■
“Illustrated Parts Breakdown” on page 16
■
“Servicing Components That Do Not Require Server Power-Off” on page 61
■
“Servicing Components That Require Server Power-Off” on page 77
About System Components
About the Oracle Exadata Storage Server X5-2 High Capacity19
Page 20
20Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 21
Troubleshooting and Diagnostics
This section includes information about troubleshooting hardware component faults for the
Oracle Exadata Storage Server X5-2 High Capacity. It contains the following topics.
DescriptionLink
Maintenance-related information and procedures that
you can use to troubleshoot and repair server hardware
issues.
Information about software and firmware diagnostic
tools that you can use to isolate problems, monitor the
server, and exercise the server subsystems.
Information about attaching devices to the server to
perform troubleshooting.
Information about contacting Oracle support.“Getting Help” on page 44
Troubleshooting Server Component Hardware Faults
“Troubleshooting Server Component Hardware
Faults” on page 21
“Troubleshooting With Diagnostic Tools” on page 36
“Attaching Devices to the Server” on page 38
This section contains maintenance-related information and procedures that you can use to
troubleshoot and repair server hardware issues. The following topics are covered.
DescriptionSection Links
Troubleshooting overview information and procedure.“Troubleshooting Server Hardware
Resources for troubleshooting and diagnostic
information.
Discerning the server state using the front and rear panel
indicators.
Causes, actions, and preventative measures for problems
related to the cooling subsystem.
Causes, actions, and preventative measures for problems
related to the power subsystem.
Faults” on page 22
“Troubleshooting and Diagnostic
Information” on page 26
“Troubleshooting Using the Server Front and Rear Panel
Status Indicators ” on page 26
“Troubleshooting System Cooling
Issues” on page 31
“Troubleshooting Power Issues” on page 33
Troubleshooting and Diagnostics21
Page 22
Troubleshooting Server Component Hardware Faults
Troubleshooting Server Hardware Faults
When a server hardware fault event occurs the system lights the Service Required LED and
captures the event in the system event log (SEL). If you have set up notifications through
Oracle ILOM, you also receive an alert through the notification method you chose. When you
become aware of a hardware fault, you should address it immediately.
To investigate a hardware fault, see the following:
■
“Basic Troubleshooting Process” on page 22
■
“Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface” on page 23
Basic Troubleshooting Process
Use the following process to address a hardware fault (for the step-by-step procedure, see
“Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface” on page 23):
1. Identify the server subsystem containing the fault.
You can use Oracle ILOM to identify the failed component.
2. Prepare the server for service using Oracle ILOM.
If you have determined that the hardware fault requires service (physical access to the
server), use Oracle ILOM to power off the server, activate the Locate LED, and take the
server offline.
3. Prepare the service work space.
Before servicing the server, prepare the work space, ensuring ESD protection for the server
and components.
See “Preparing for Service” on page 47.
4. Service components.
To service the components, see the removal, installation, and replacement procedures in this
document.
Note - A component designated as a field-replaceable unit (FRU) must be replaced by
Oracle Service personnel. Contact Oracle Service.
5. Clear the fault in Oracle ILOM.
Depending on the component, you might need to clear the fault in Oracle ILOM. Generally,
components that have a FRU ID clear the fault automatically.
22Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 23
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Related Information
■
See “Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface” on page 23
Troubleshoot Hardware Faults Using the Oracle ILOM Web
Interface
Note - The screens shown in this procedure might differ from those for your server.
This procedure uses the basic troubleshooting steps described in “Basic Troubleshooting
Process” on page 22.
Use this procedure to troubleshoot hardware faults with the Oracle ILOM web interface and, if
necessary, prepare the server for service.
Note - This procedure provides one basic approach to troubleshooting hardware faults. It
uses a combination of the Oracle ILOM web interface and command-line interface (CLI).
However, the procedure can be performed using only the Oracle ILOM CLI interface. For
more information about the Oracle ILOM web interface and CLI, refer to the Oracle ILOM
documentation.
Before beginning this procedure, obtain the latest version of the system product notes.
1.
On a system with network access to the server, open a browser window.
2.
In the web browser address bar, type the IP address or host name of the service
processor (SP), and then press Enter.
The Oracle ILOM login page appears.
3.
Log in to the Oracle ILOM web interface.
Enter a user name (with administrator privileges) and password at the log-in screen. The Oracle
ILOM Summary Information page appears.
The Status section of the Summary screen provides information about the server subsystems,
including:
■
Processors
■
Memory
■
Power
■
Cooling
Troubleshooting and Diagnostics23
Page 24
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
■
Storage
■
Networking
4.
In the Status section of the Summary Information screen, identify the server
subsystem that requires service.
In the above example, the Status screen shows that the Memory subsystem requires service.
This indicates that a hardware component within the subsystem is in a fault state.
5.
To identify the component, click on Memory in the Status section.
24Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 25
The Memory screen appears.
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
The above example shows the Memory subsystem screen and indicates that DIMM 8 on CPU 0
has an uncorrectable ECC fault.
6.
To get more information, click one of the Open Problems links.
The Open Problems screen provides detailed information, such as the time the event occurred,
the component and subsystem name, and a description of the issue. It also includes a link to an
Oracle KnowledgeBase article.
Tip - The System Log provides a chronological list of all the system events and faults that have
occurred since the log was last reset and includes additional information, such as severity levels
and error counts. The System Log also includes information on devices not reported in the
Subsystem Summary screen. To access the System Log, click the System Log link.
In this example, the hardware fault with DIMM 8 of CPU 0 requires local/physical access to the
server.
7.
To prepare the server for service, see “Preparing for Service” on page 47.
After servicing the component, you might need to clear the fault in Oracle ILOM. For more
information, refer the service procedure for the component.
8.
Service the component.
Troubleshooting and Diagnostics25
Page 26
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Troubleshooting and Diagnostic Information
The following table lists diagnostic and troubleshooting-related procedures and references that
can assist you with resolving server issues.
DescriptionLink
Diagnostic information for the x86 servers, including
procedures for performing runtime and firmware-based
tests, using Oracle ILOM, and running U-Boot and UEFI
Diagnostics tests to exercise the system and isolate subtle
and intermittent hardware-related problems.
Administrative information for the Oracle Server X5
series servers, including information about how to use
the Oracle ILOM system event log (SEL) to identify a
problem's possible source.
Information about troubleshooting front and rear panel
status indicators.
Information about how to manage server hardware faults
using the Oracle ILOM Fault Management Shell.
Oracle x86 Servers Diagnostics, Applications, and
Utilities Guide for Servers with Oracle ILOM 3.1 and
Oracle ILOM 3.2.x
Oracle X5 Series Servers Administration Guide
“Troubleshooting Using the Server Front and Rear Panel
Status Indicators ” on page 26
“Managing Server Hardware Faults Through the Oracle
ILOM Fault Management Shell” on page 36
Troubleshooting Using the Server Front and Rear
Panel Status Indicators
These sections describe the status indicators (LEDs) located on the front and rear of the server,
including those found on components and ports. This section includes the following topics:
■
“Server Boot Process and Normal Operating State Indicators” on page 27
■
“Server System-Level Status Indicators” on page 27
■
“Server Fan Status Indicators” on page 28
■
“Storage and Boot Drive Indicators” on page 29
■
“Power Supply Status Indicators” on page 29
■
“Network Management Port Status Indicators” on page 30
■
“Ethernet Ports Status Indicators” on page 30
■
“Motherboard Status Indicators” on page 31
Related Information
■
“Front Panel Components” on page 14
■
“Rear Panel Components and Cable Connections” on page 15
26Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 27
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Server Boot Process and Normal Operating State Indicators
A normal server boot process involves two indicators, the service processor SP OK LED
indicator and the Power/OK LED indicator.
When AC power is connected to the server, the server boots into standby power mode:
1. The SP OK LED blinks slowly (0.5 seconds on, 0.5 seconds off) while the SP is starting,
and the main Power/OK LED remains off until the SP is ready.
2. After a few minutes, the main Power/OK LED slowly flashes the standby blink pattern (0.1
seconds on, 2.9 seconds off), indicating that the SP (and Oracle ILOM) is ready for use. In
standby power mode, the server is not initialized or fully powered on at this point.
When powering on the server (either by the Power button or Oracle ILOM), the server boots to
full power mode:
1. The Power/OK LED blinks slowly (0.5 seconds on, 0.5 seconds off), and the SP OK LED
remains lit (no blinking).
2. When the server has successfully booted, the Power/OK LED remains lit. When the Power/
OK LED and the SP OK LED indicators remain lit, the server is in full power mode.
TABLE 1
Status
Indicator
Name
Locator LED
and button
Service
Required
Note - The green Power/OK LED indicator and the green SP OK indicator remain lit (no
blinking) when the server is in a normal operating state.
Server System-Level Status Indicators
There are seven system-level status indicators (LEDs), some of which are located on both the
server front panel and the server back panel. The following table describes these indicators.
Server System-Level Status Indicators
IconColorState and Meaning
WhiteIndicates the location of the server.
■ OFF – Server is operating normally.
■ FAST BLINK – Use Oracle ILOM to activate this LED indicator to enable you to locate a
particular system quickly and easily.
■ Pressing the Locate button will toggle the LED indicator fast blink on or off.
AmberIndicates the fault state of the server.
■ OFF – The server is operating normally.
Troubleshooting and Diagnostics27
Page 28
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Status
Indicator
Name
Power/OKGreenIndicates the operational state of the chassis.
SP OKGreenIndicates the state of the service processor.
Top FanTopAmberIndicates that one or more of the internal fan modules have failed.
Rear Power
Supply Fault
System Over
Temperature
Warning
IconColorState and Meaning
■ STEADY ON – A fault is present on the server. This LED indicator lights whenever a fault
indicator lights for a replaceable component on the server.
Note - When this LED indicator is lit, a system console message might appear that includes a
recommended service action.
■ OFF – AC power is not present or the Oracle ILOM boot is not complete.
■ STANDBY BLINK – Standby power is on, but the chassis power is off and the Oracle
ILOM SP is running.
■ SLOW BLINK – Startup sequence has been initiated on the host. This pattern should begin
soon after you power on the server. This status indicates either: power-on self-test (POST)
code checkpoint tests are running on the server host system, or the host is transitioning from
the powered-on state to the standby state on shutdown.
■ STEADY ON – The server is powered on, and all host POST code checkpoint tests
are complete. The server is in one of the following states: the server host is booting the
operating system (OS), or the server host is running the OS.
■ OFF – Service processor (SP) is not running.
■ SLOW BLINK – SP is booting.
■ STEADY ON – SP is fully operational.
■ OFF – Indicates steady state; no service is required.
■ STEADY ON – Indicates service required.
RearAmberIndicates that one of the server power supplies has failed.
■ OFF – Indicates steady state; no service is required.
■ STEADY ON – Indicates service required; service the power supply.
AmberIndicates a warning for an overtemperature condition.
■ OFF – Normal operation; no service is required.
■ STEADY ON – The system is experiencing an overtemperature warning condition.
Note - This is a warning indication, not a fatal overtemperature. Failure to correct this might
result in the system overheating and shutting down unexpectedly.
Server Fan Status Indicators
Each fan module has one status indicator (LED). The LEDs are located on the chassis midwall adjacent to and aligned with the fan modules and are visible when the top cover fan door is
open.
28Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 29
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
TABLE 2
Status
Indicator
Name
Fan StatusNoneAmber■ Off – The fan module is correctly installed and operating within specification.
Server Fan Status Indicators
IconColorState and Meaning
■ Amber – The fan module is faulty. The front TOP FAN LED and the front and rear
panel Service Required LEDs are also lit if the system detects a fan module fault.
Storage and Boot Drive Indicators
There are three status indicators (LEDs) on each drive.
TABLE 3
Status
Indicator Name
OK/ActivityGreen■ OFF – Power is off or installed drive is not recognized by the system.
Service
Required
Server Front Storage and Boot Disk Drive Indicators
IconColorState and Meaning
†
■ STEADY ON – The drive is engaged and is receiving power.
■ RANDOM BLINKING – There is disk activity. Status indicator LED blinks on and off to
indicate activity.
Amber■ OFF – The storage drive is operating normally.
■ STEADY ON – The system has detected a fault with the storage drive.
OK to RemoveBlue■ STEADY ON – The storage drive can be removed safely during a hot-plug operation.
■ OFF – The storage drive has not been prepared for removal.
†
If a rear backplane hard disk's OK/Activity LED shuts off, it may also indicate that a rear hard disk has spun down into a temporary "suspend" mode, in which
case the disk would again spin up when coming out of suspend mode.
Power Supply Status Indicators
There are two status indicators (LEDs) on each power supply. These indicators are visible from
the rear of the server.
TABLE 4
Status
Indicator
Name
AC OK/ DC
OK
Server Power Supply Indicators
IconColorState and Meaning
Green■ OFF – No AC power is present.
■ SLOW BLINK – Normal operation. Input power is within
specification. DC output voltage is not enabled.
Troubleshooting and Diagnostics29
Page 30
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Status
Indicator
Name
Service
Required
IconColorState and Meaning
■ STEADY ON – Normal operation. Input AC power and DC
output voltage are within specification.
Amber■ OFF – Normal operation. No service action is required.
■ STEADY ON – The power supply (PS) has detected a PS fan
failure, PS overtemperature, PS over current, or PS over or under
voltage.
Network Management Port Status Indicators
The server has one 10/100/1000BASE-T Ethernet management domain interface, labeled NET
MGT. There are two status indicators (LEDs) on this port. These indicators are visible from the
rear of the server.
TABLE 5
Status
Indicator
Name
ActivityTop rightGreen■ ON – Link up.
Link speedTop leftGreen■ Green ON – 1000BASE-T link.
Network Management Port Status Indicators
LocationColorState and Meaning
■ OFF – No link or down link.
■ FLASHING – Packet activity.
■ OFF – 10/100BASE-T link.
Ethernet Ports Status Indicators
The server has four Gigabit Ethernet ports (NET 3, NET 2, NET 1, and NET 0). There are two
status indicators on each port. These indicators (LEDs) are visible from the rear of the server.
TABLE 6
Status
Indicator Name
ActivityTop leftGreen■ ON – No function.
Link speedTop rightBicolored: Amber/
30Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Gigabit Ethernet Port Status Indicators
LocationColorState and Meaning
■ OFF – No activity.
■ FLASHING – Packet activity.
Green
■ OFF – 100BASE-T link.
■ Amber ON – 1000BASE-T link.
■ Green ON – 10GBBASE-T link.
■ FLASHING – No function.
Page 31
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Motherboard Status Indicators
The motherboard contains the following status indicators (LEDs).
■
DIMM Fault Status Indicators: Each of the 24 DIMM sockets on the motherboard has
an amber fault status indicator (LED) associated with it. If Oracle ILOM determines that
a DIMM is faulty, pressing the Fault Remind button on the motherboard I/O card signals
the service processor to light the fault LED associated with the failed DIMM. For more
information on DIMM fault status indicators and the location of the Fault Remind Button,
see “Servicing the DIMMs” on page 77.
■
Processor Fault Status Indicators: The motherboard includes a fault status indicator
(LED) adjacent to each of the two processor sockets. These LEDs indicate when a processor
fails. Pressing the Fault Remind button on the motherboard I/O card signals the service
processor to light the fault status indicators associated with the failed processors. For more
information on processor fault status indicators and the location of the Fault Remind Button,
see “Servicing Processors” on page 99.
■
Fault Remind Status Indicator: This status indicator (LED) is located next to the Fault
Remind button and is powered from the super capacitor that powers the fault LEDs on
the motherboard. This LED lights to indicate that the fault remind circuitry is working
properly in cases where no components have failed and, as a result, none of the component
fault LEDs illuminate. For more information on the fault remind status indicator and
the location of the Fault Remind Button, see “Using the Storage Server Fault Remind
Button” on page 80.
■
STBY PWRGD Status Indicator: This green status indicator (LED) is labeled STBY
PWRGD and is located on the motherboard near the rear of the server. This LED lights
to inform a service technician that the motherboard is receiving standby power from at
least one of the power supplies. This LED is provided to help prevent service actions on
the server internal components while the AC power cords are installed and power is being
supplied to the server.
Note - The service instructions for all internal components require that all AC power be
removed from the power supplies prior to the server top cover being removed.
Troubleshooting System Cooling Issues
Maintaining the proper internal operating temperature of the server is crucial to the health of the
server. To prevent server shutdown and damage to components, address over temperature and
hardware-related issues as soon as they occur. If your server has a temperature-related fault, the
cause of the problem might be:
Troubleshooting and Diagnostics31
Page 32
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
■
“External Ambient Temperature Too High” on page 32
■
“Airflow Blockage” on page 32
■
“Cooling Areas Compromised” on page 32
■
“Hardware Component Failure” on page 33
External Ambient Temperature Too High
Server component cooling relies on the movement of cool air through the server. The cool air is
pulled into the server from its external environment. If the ambient temperature of the server's
external environment is too high, cooling does not occur, and the internal temperature of the
server and its components increases. This can cause poor server performance or a failure of one
or more components.
Action: Check the ambient temperature of the server space against the environmental
specifications for the server. If the temperature is not within the required operating range,
remedy the situation immediately.
Prevention: Periodically check the ambient temperature of the server space to ensure that it
is within the required range, especially if you have made any changes to the server space (for
example, added additional servers). The temperature must be consistent and stable.
Airflow Blockage
The server cooling system uses fans to pull cool air in from the server front intake vents and
exhaust warm air out the server back panel vents. If the front or back vents are blocked, the
airflow through the server is disrupted and the cooling system fails to function properly causing
the server internal temperature to rise.
Action: Inspect the server front and back panel vents for blockage from dust or debris.
Additionally, inspect the server interior for improperly installed components or cables that can
block the flow of air through the server.
Prevention: Periodically inspect and clean the server vents using a vacuum cleaner. Ensure that
all components, such as cards, cable, fans, air baffles and dividers are properly installed. Never
operate the server without the top cover installed.
Cooling Areas Compromised
To function properly, the server has cooling areas that are maintained by an air baffle,
component filler panels, and the server top cover. These server components need to be in place
32Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 33
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
for the server to function as a sealed system. If internal cooling areas are compromised, the
server cooling system, which relies on the movement of cool air through the server, cannot
function properly, and the airflow inside the server becomes chaotic and non-directional.
Action: Inspect the server interior to ensure that the air baffle is properly installed. Ensure that
all external-facing slots (storage drive, DVD, PCIe) are occupied with either a component or a
component filler panel. Ensure that the server top cover is in place and sits flat and snug on top
of the server.
Prevention: When servicing the server, ensure that the air baffle is installed correctly and that
the server has no unoccupied external-facing slots. Never operate the server without the top
cover installed.
Hardware Component Failure
Components, such as power supplies and fan modules, are an integral part of the server cooling
system. When one of these components fails, the server internal temperature can rise. This rise
in temperature can cause other components to enter into an over-temperature state. Additionally,
some components, such as processors, might overheat when they are failing, which can also
generate an over-temperature event.
To reduce the risk related to component failure, power supplies and fan modules are installed
in pairs to provide redundancy. Redundancy ensures that if one component in the pair fails,
the other functioning component can continue to maintain the subsystem. For example, power
supplies serve a dual function; they provide both power and airflow. If one power supply fails,
the other functioning power supply can maintain both the power and the cooling subsystems.
Action: Investigate the cause of the over-temperature event, and replace failed components
immediately. For hardware troubleshooting information, see “Troubleshooting Server Hardware
Faults” on page 22.
Prevention: Component redundancy is provided to allow for component failure in critical
subsystems, such as the cooling subsystem. However, once a component in a redundant
system fails, the redundancy no longer exists, and the risk for server shutdown and component
failures increases. Therefore, it is important to maintain redundant systems and replace failed
components immediately.
Troubleshooting Power Issues
If your server does not power on, the cause of the problem might be:
■
“AC Power Connection” on page 34
Troubleshooting and Diagnostics33
Page 34
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
■
“Power Supplies (PSUs)” on page 34
■
“Top Cover” on page 35
AC Power Connection
The AC power cords are the direct connection between the server power supplies and the power
sources. The server power supplies need separate stable AC circuits. Insufficient voltage levels
or fluctuations in power can cause server power problems. The power supplies are designed to
operate at a particular voltage and within an acceptable range of voltage fluctuations (refer to ).
Action: Check that both AC power cords are connected to the server. Check that the correct
power is present at the outlets and monitor the power to verify that it is within the acceptable
range. You can verify proper connection and operation by checking the power supply (PS)
indicator panels, which are located at the back of the server on the power supplies. Lit green AC
OK indicators show a properly functioning power supply. An amber AC OK indicator indicates
that the AC power to the power supply is insufficient.
Prevention: Use the AC power cord Velcro retaining clips and position the cords to minimize
the risk of accidental disconnection. Ensure that the AC circuits that supply power to the server
are stable and not overburdened.
Power Supplies (PSUs)
The server power supplies (PSUs) provide the necessary server voltages from the AC power
outlets. If the power supplies are inoperable, unplugged, or disengaged from the internal
connectors, the server cannot power on.
Note - The Velcro straps shown in the following figure should be used to secure the power cord
connectors to the rear of the power supplies. The Velcro retaining straps minimize the risk of
accidental disconnection.
34Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 35
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Action: Check that the AC cables are connected to both power supplies. Check that the power
supplies are operational (the PSU indicator panel should have a lit green AC OK indicator).
Ensure that the power supply is properly installed. A power supply that is not fully engaged
with its internal connector does not have power applied and does not have a lit green AC OK
indicator.
Prevention: When a power supply fails, replace it immediately. To ensure redundancy, the
server has two power supplies. This redundant configuration prevents server downtime, or
an unexpected shutdown, due to a failed power supply. The redundancy allows the server to
continue to operate if one of the power supplies fails. However, when a server is being powered
by a single power supply, the redundancy no longer exists, and the risk for downtime or an
unexpected shutdown increases. When installing a power supply, ensure that it is fully seated
and engaged with its connector inside the drive bay. A properly installed power supply has a lit
green AC OK indicator.
Top Cover
The server top cover is an important component for a properly functioning server. The top cover
is vital to the server cooling subsystem because it helps to maintain the pressure areas within
the server, and the top cover also protects against accidental exposure to hazardous voltages
Troubleshooting and Diagnostics35
Page 36
Troubleshooting With Diagnostic Tools
and damage to internal components. For these reasons, the server top cover is interlocked to
the server power. The interlock is created by a switch that is mounted inside the server on the
housing for the power supply. The switch engages when the cover is installed, allowing power
to be applied to the server. When the cover is not installed, the server cannot power on. If the
server is powered on when the cover is removed, power is immediately switched off.
Action: If the server does not power on, check that switch is intact and properly aligned. Ensure
that the server top cover is in place and sits flat and snug on top of the server. Ensure that the
interlock switch components have not been damaged, removed, or misaligned. For instructions
for properly installing the top cover, see “Install the Server Top Cover” on page 161.
Prevention: After removing the top cover, take care that it does not get bent. When servicing
the server, take care that the internally mounted interlock switch component does not get
damaged or misaligned. Never operate the server without the top cover installed.
Managing Server Hardware Faults Through the
Oracle ILOM Fault Management Shell
The Oracle ILOM Fault Management Shell enables you to view and manage fault activity on a
managed servers and other types of devices.
For more information about how to use the Oracle ILOM Fault Management Shell, refer to the
Oracle ILOM User's Guide for System Monitoring and Diagnostics Firmware Release 3.2.x in
the Oracle Integrated Lights Out Manager (ILOM) 3.2 Documentation Library at http://www.
oracle.com/goto/ilom/docs.
Troubleshooting With Diagnostic Tools
The server and its accompanying software and firmware contain diagnostic tools and features
that can help you isolate component problems, monitor the status of a functioning system,
and exercise one or more subsystem to disclose more subtle or intermittent hardware-related
problems.
Each diagnostic tool has its own specific strength and application. Review the tools listed in
this section and determine which tool might be best to use for your situation. Once you have
determined the tool to use, you can access it locally, while at the server, or remotely.
■
“Diagnostic Tools” on page 37
36Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 37
Troubleshooting With Diagnostic Tools
■
“Diagnostic Tool Documentation” on page 38
Diagnostic Tools
The selection of diagnostic tools available for your server range in complexity from a
comprehensive validation test suite (Oracle VTS) to a chronological event log (Oracle ILOM
System Log). The selection of diagnostic tools also includes standalone software packages,
firmware-based tests, and hardware-based LED indicators.
The following table summarizes the diagnostic tools that you can use when troubleshooting or
monitoring your server.
Diagnostic Tool TypeWhat It DoesAccessibilityRemote Capability
Oracle ILOMSP firmwareMonitors environmental
Preboot MenuSP firmware Enables you to restore some
Hardware-based
LED indicators
Power-On SelfTest (POST)
U-BootSP firmwareInitializes and test aspects of
UEFI
Diagnostics
Hardware
and SP
firmware
Host
firmware
SP firmware The UEFI diagnostics can
condition and component
functionality sensors, generates
alerts, performs fault isolation,
and provides remote access.
Oracle ILOM default settings
when Oracle ILOM is not
accessible.
Indicates status of overall
system and particular
components.
Tests core components of
system: CPUs, memory,
and motherboard I/O bridge
integrated circuits.
the service processor (SP) prior
to booting the Oracle ILOM SP
and operating system. Tests SP
memory, SP, network devices
and I/O devices.
test and detect problems on
Can function on either
standby power mode or
full power mode and is
not OS dependent.
Can function on standby
power and when
operating system is not
running.
Available when system
power is available.
Runs on startup.
Available when the
operating system is not
running.
Can function on standby
power and when
operating system is not
running.
You can use either
the Oracle ILOM
Designed for
remote and local
access.
Local, but remote
serial access is
possible if the
SP serial port is
connected to a
network-accessible
terminal server.
Local, but sensor
and indicators are
accessible from
Oracle ILOM
web interface or
command-line
interface (CLI).
Local, but can be
accessed through
Oracle ILOM
Remote System
Console Plus.
Local, but remote
serial access is
possible if the
SP serial port is
connected to a
network-accessible
terminal server.
Remote access
through Oracle
Troubleshooting and Diagnostics37
Page 38
Attaching Devices to the Server
Diagnostic Tool TypeWhat It DoesAccessibilityRemote Capability
Oracle Linux
commands
Diagnostic Tool Documentation
The following table identifies where you can find more information about diagnostic tools.
Diagnostic ToolDocumentationLocation
Oracle ILOMOracle Integrated Lights Out Manager 3.2
Preboot MenuOracle x86 Servers Diagnostics Guide
System indicators and
sensors
U-Boot or UEFI DiagsOracle x86 Servers Diagnostics Guide
Oracle VTSOracle VTS software and documentation
all processors, memory, disk
drives, and network ports.
Operating
system
software
Displays various kinds of
system information.
Documentation Library
This document“Troubleshooting Using the Server
web interface or the
command-line interface
(CLI) to run UEFI
diagnostics.
Requires operating
system.
http://www.oracle.com/goto/ilom/docs
http://www.oracle.com/goto/
x86admindiag/docs
Front and Rear Panel Status Indicators
” on page 26
http://www.oracle.com/goto/
x86admindiag/docs
http://docs.oracle.com/cd/E19719-01/
index.html
ILOM Remote
System Console
Plus.
Local, and over
network.
Attaching Devices to the Server
The following sections contain procedural information for attaching devices to the server so you
can access diagnostic tools when troubleshooting and servicing the server:
■
“Attach Devices to the Server” on page 39
■
“Rear Panel Connector Locations” on page 39
■
“Configuring Serial Port Sharing” on page 41
■
“Ethernet Port Boot Order and Device Naming” on page 42
■
“Rear Panel Pinhole Switches” on page 43
38Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 39
Attach Devices to the Server
Attach Devices to the Server
This procedure explains how to connect devices to the server (remotely and locally), so that you
can interact with the service processor (SP) and the server console.
1.
Connect an Ethernet cable to the Gigabit Ethernet (NET) connectors as needed
for OS support.
See “Rear Panel Connector Locations” on page 39.
2.
To connect to the service processor's Oracle ILOM over the network, connect an
Ethernet cable to the Ethernet port labeled NET MGT.
See “Rear Panel Connector Locations” on page 39.
3.
To access the Oracle ILOM command-line interface (CLI) locally using the
management port, connect a serial null modem cable to the RJ-45 serial port
labeled SER MGT.
See “Rear Panel Connector Locations” on page 39.
4.
To interact with the system console locally, connect a mouse and keyboard to
the USB connectors and a monitor to the DB-15 video connector.
See “Rear Panel Connector Locations” on page 39.
Rear Panel Connector Locations
The following illustration shows and describes the locations of the back panel connectors. Use
this information to set up the server, so that you can access diagnostic tools and manage the
server during service.
Troubleshooting and Diagnostics39
Page 40
Attach Devices to the Server
No. Cable Port or Expansion SlotDescription
1Power supply 0 input power
The server has two power supply connectors, one for each power supply.
Power supply 1 input power
Do not attach power cables to the power supplies until you have finished
connecting the data cables to the server. The server goes into standby power
mode, and the Oracle ILOM service processor initializes when the AC
power cables are connected to the power source. System messages might
be lost after 60 seconds if the server is not connected to a terminal, PC, or
workstation.
Note - Oracle ILOM will signal a fault on any installed power supply that
is not connected to an AC power source, since it might indicate a loss of
redundancy.
2Network management port (NET MGT)The service processor NET MGT port is the optional connection to the
Oracle ILOM service processor. The NET MGT port is configured by default
to use Dynamic Host Configuration Protocol (DHCP). The service processor
NET MGT port uses an RJ-45 cable for a 10/100/1000BASE-T connection.
3Serial management port (SER MGT)The service processor SER MGT port uses an RJ-45 cable and is the default
connection to the Oracle ILOM service processor. This port supports local
connections to the server and only recognizes Oracle ILOM command-line
interface (CLI) commands. Typically you connect a terminal or terminal
emulator to this port.
Note - This port does not support network connections.
4Ethernet ports (NET 3, NET 2, NET 1, NET 0)The four 10-Gigabit Ethernet ports enable you to connect the system to the
network.
5, 6 USB ports (USB 0, USB 1)The two USB ports support hot-plugging. You can connect and disconnect
USB cables and peripheral devices while the server is running without
affecting system operations.
40Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 41
Assign Serial Port Output Using the CLI
No. Cable Port or Expansion SlotDescription
7Video port (VGA)Use a 15-pin video cable to connect a VGA video device to the server.
Optionally, you can connect to the VGA port when installing the operating
system.
Configuring Serial Port Sharing
By default, the SP console (SER MGT) port sends serial port output from the server. Using
Oracle ILOM, you can specify that the host console (COM1) be assigned as owner of the server
serial port output.
You should set up the network on the SP before attempting to change the serial port owner to
the host server. If the network is not set up, and you switch the serial port owner to the host
server, you will be unable to connect using the CLI or web interface to change the serial port
owner back to the SP. To return the serial port owner setting to the SP, you need to restore
access to the serial port on the server. For more details about restoring access to the server
port on your server, see the Oracle Integrated Lights Out Manager (ILOM) 3.2 Documentation
Library at http://www.oracle.com/goto/ilom/docs.
You can assign serial port output using either the Oracle ILOM web interface or the commandline interface (CLI). For instructions, see the following sections:
■
“Assign Serial Port Output Using the CLI” on page 41
■
“Assign Serial Port Output Using the Web Interface” on page 42
Assign Serial Port Output Using the CLI
1.
Open an SSH session and at the command line log in to the SP Oracle ILOM CLI.
Log in as a user with root or administrator privileges. For example:
ssh root@ipaddress
Where ipaddress is the IP address of the server SP.
For more information, see “Accessing Oracle ILOM” in the Oracle X5 Series ServersAdministration Guide.
The Oracle ILOM CLI prompt (->) appears.
2.
To set the serial port owner, type:
-> set /SP/serial/portsharing owner=host
Troubleshooting and Diagnostics41
Page 42
Assign Serial Port Output Using the Web Interface
Note - The serial port sharing value by default is owner=SP.
3.
Connect a serial host to the server.
Assign Serial Port Output Using the Web Interface
1.
Log in to the SP Oracle ILOM web interface.
To log in, open a web browser and direct it using the IP address of the server SP. Log in as root
or a user with administrator privileges. See “Accessing Oracle ILOM” in the Oracle X5 SeriesServers Administration Guide.
The Summary Information screen appears.
2.
In the Oracle ILOM web interface, select ILOM Administration → Connectivity
from the navigation menu on the left side of the screen.
3.
Select the Serial Port tab.
The Serial Port Settings page appears.
Note - The serial port sharing setting by default is Service Processor.
4.
In the Serial Port Settings page, select HostServer as the serial port owner.
5.
Click Save for the changes to take effect.
6.
Connect a serial host to the server.
Ethernet Port Boot Order and Device Naming
This section contains information about the boot order and device naming for the four 10Gigabit Ethernet ports on the rear panel of the server. For location information, see “Rear Panel
Connector Locations” on page 39. From right to left, the ports are numbered NET 0 to NET
3.
Ethernet Port Boot Order
The order in which the BIOS detects the Ethernet ports during server boot is listed below.
42Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 43
Assign Serial Port Output Using the Web Interface
Note - You can change the boot priority using the Boot Device Priority screen available in the
Boot menu of the BIOS Setup Utility.
1. NET 0
2. NET 1
3. NET 2
4. NET 3
Ethernet Port Device Naming
Note - Naming used by the interfaces might vary from that listed below depending on which
devices are installed in the system.
The device naming for the Ethernet interfaces is reported differently by different interfaces and
operating systems. The following table shows BIOS (physical) and operating system (logical)
naming conventions used for each interface. These naming conventions might vary depending
on conventions of your operating system and which devices are installed in the server.
PortBIOSSolarisLinuxWindows (example default
NET 03A00ixgbe 0eth 0Ethernet
NET 13A01ixgbe 1eth 1Ethernet 2
NET 28200ixgbe 2eth 2Ethernet 3
NET 38201ixgbe 3eth 3Ethernet 4
names, see note below)
Rear Panel Pinhole Switches
This section shows the location of the rear panel pinhole switches.
Troubleshooting and Diagnostics43
Page 44
Getting Help
Figure LegendDescription
1SP Reset
2Host Warm Reset
3Non-maskable interrupt
†
Oracle Service use only.
Getting Help
The following sections describe how to get additional help to resolve server-related problems.
■
“Contacting Support” on page 45
■
“Locating the Chassis Serial Number” on page 45
(NMI)
†
44Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 45
Getting Help
Contacting Support
If the troubleshooting procedures in this chapter fail to solve your problem, use the following
table to collect information that you might need to communicate to support personnel.
System Configuration
Information Needed
Service contract number
System model
Operating environment
System serial number
Peripherals attached to the system
Email address and phone number
for you and a secondary contact
Street address where the system is
located
Superuser password
Summary of the problem and
the work being done when the
problem occurred
Other Useful Information
IP address
Server name (system host name)
Network or internet domain name
Proxy server configuration
Your Information
Related Information
■
“Locating the Chassis Serial Number” on page 45
Locating the Chassis Serial Number
You might need to have your server serial number when you ask for service on your system.
Record this number for future use. Use one of the following resources or methods to locate your
server serial number.
Troubleshooting and Diagnostics45
Page 46
Getting Help
■
The serial number is located on the Radio-frequency Identification (RFID) label on the
bottom left side of the front panel bezel, below the general status LEDs.
For illustrations of the storage server front panel, see “About Controls and
Connectors” on page 14.
■
The serial number is recorded on the service label that is attached to the top cover of the
system.
■
The serial number is recorded on the yellow Customer Information Sheet (CIS) that is
attached to your storage server packaging.
■
Using Oracle ILOM:
■
From the command-line interface (CLI), type the command: show/System.
■
From the web interface, view the serial number on the System Information screen.
46Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 47
Preparing for Service
These sections describe safety considerations and provide prerequisite procedures and
information about replacing components within the server.
DescriptionLinks
Understand the safety precautions, understand the
safety symbols, and take ESD precautions prior to
removing or installing parts in the server.
Understand the key identity properties auto-update
feature.
Assemble the required tools.“Required Tools” on page 50
Before working with components within the server,
power down the server and prepare for servicing.
Related Information
■
“Returning the Server to Operation” on page 161
“Safety Precautions” on page 47
“Safety Symbols” on page 48
“Electrostatic Discharge Safety” on page 48
“FRU Key Identity Properties (KIP) Automated
Update” on page 49
“Preparing the Server for Component
Replacement” on page 50
Safety Precautions
For your protection, observe the following safety precautions when setting up your equipment:
■
Follow all standard cautions, warnings, and instructions marked on the equipment and
described in Oracle Exadata Storage Server X5-2 High Capacity Safety and ComplianceGuide and Important Safety Information for Oracle's Hardware Systems.
■
Ensure that the voltage and frequency of your power source match the voltage and
frequency inscribed on the equipment electrical rating label.
■
Follow the electrostatic discharge safety practices as described in “Electrostatic Discharge
Safety” on page 48.
■
Disconnect both power supply cords before servicing components.
Preparing for Service47
Page 48
Safety Symbols
Related Information
■
“Safety Symbols” on page 48
■
“Electrostatic Discharge Safety” on page 48
Safety Symbols
The following symbols might appear in this document. Note their meanings.
Caution - Risk of personal injury or equipment damage. To avoid personal injury or
equipment damage, follow the instructions.
Caution - Hot surface. Avoid contact. Surfaces are hot and might cause personal injury if
touched.
Caution - Hazardous voltages are present. To reduce the risk of electric shock and danger to
personal health, follow the instructions.
Related Information
■
“Safety Precautions” on page 47
■
“Electrostatic Discharge Safety” on page 48
Electrostatic Discharge Safety
Devices that are sensitive to electrostatic discharge (ESD), such as the motherboard, PCIe cards,
drives, processors, and memory DIMMs require special handling.
Caution - The boards and drives contain electronic components that are extremely sensitive to
static electricity. Ordinary amounts of static electricity from clothing or the work environment
can destroy components. Do not touch the components along their connector edges.
Do the following when handling ESD-sensitive components:
■
Use an antistatic wrist strap.
48Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Wear an antistatic wrist strap and use an antistatic mat when handling components such as
drive assemblies, boards, or cards. When servicing or removing server components, attach
an antistatic strap to your wrist and then to a metal area on the chassis. Then disconnect
the power cords from the server. Following this practice equalizes the electrical potentials
between you and the server.
Note - An antistatic wrist strap is not included in the Accessory Kit for the server. However,
antistatic wrist straps are included with options and components.
■
Use an antistatic mat.
Place ESD-sensitive components such as the motherboard, memory DIMMS, and other
printed circuit board (PCB) cards on an antistatic mat. The following items can be used as
an antistatic mat:
■
An antistatic bag used to wrap an Oracle replacement part
■
An Oracle ESD mat (orderable item)
■
A disposable ESD mat (shipped with some replacement parts or optional system
components)
Related Information
■
“Safety Precautions” on page 47
■
“Safety Symbols” on page 48
■
“Preparing the Server for Component Replacement” on page 50
Oracle ILOM includes a key identity properties (KIP) auto-update feature that ensures product
information that is used for service entitlement and warranty coverage is accurately maintained
by the server at all times, including during hardware replacement activities.
The KIP includes the server product name, product part number (PPN), and product serial
number (PSN). The KIP are stored in the FRUID (field-replaceable unit identifiers) container of
the three server FRUs that are designated quorum members.
The quorum members include:
■
Disk backplane (DBP), designated as a primary quorum member.
Preparing for Service49
Page 50
Required Tools
■
Motherboard (MB), designated as a backup quorum member.
■
Power supply (PS), designated as a backup quorum member.
When a server FRU that contains the KIP is removed and a replacement component is installed,
the KIP of the replacement component is programmed by Oracle ILOM to contain the same
KIP as the other two components.
Note - Only one of the quorum members can be replaced at a time. Automated updates can only
be completed when two of the three quorum members contain matching key identity properties.
Related Information
■
“Servicing Power Supplies” on page 71
■
“Servicing the Storage Drive Backplanes” on page 112
■
“Servicing the Motherboard Assembly” on page 145
Required Tools
The server can be serviced with the following tools:
■
Antistatic wrist strap
■
Antistatic mat
■
No. 2 Phillips screwdriver
Related Information
■
“Preparing for Service” on page 47
■
“Servicing Components That Do Not Require Server Power-Off” on page 61
■
“Servicing Components That Require Server Power-Off” on page 77
Preparing the Server for Component Replacement
Note - When you are replacing the storage drives or power supplies, not all of these procedures
are necessary. See the replacement procedures for those components for more information.
50Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 51
Preparing the Server for Component Replacement
Before you can remove and install components that are inside the server, you must perform the
procedures in the following sections:
■
“Powering Down the Server” on page 51
■
“Disconnect Cables From the Server” on page 54
■
“Extend the Server to the Maintenance Position” on page 55
■
“Remove the Server From the Rack” on page 56
■
“Take Antistatic Measures” on page 57
■
“Remove the Server Top Cover” on page 58
■
“Remove the Fan Assembly Door From the Server” on page 59
Related Information
■
“Returning the Server to Operation” on page 161
Powering Down the Server
Determine how you want to power down the server by reviewing the options in the following
table.
DescriptionLink
If the server is not responding, or you must shut down the
server quickly, perform an immediate shutdown.
■ “Power Down the Server for
Immediate Shutdown Using the Power
Button” on page 52
■ “Power Down the Server for Immediate
Shutdown Using the Oracle CLI” on page 52
■ “Power Down the Server for Immediate
Shutdown Using the Oracle ILOM Web
Interface” on page 53
Related Information
■
“Power On the Server” on page 167
Preparing for Service51
Page 52
Power Down the Server for Immediate Shutdown Using the Power Button
Power Down the Server for Immediate Shutdown Using the
Power Button
Caution - This procedure quickly forces the server main power off. You might corrupt your
system data during an immediate power-down, so use this procedure to power down the server
only after attempting the graceful power-down procedure.
1.
Press and hold the Power button for four seconds to force the main power off
and to enter standby power mode.
When main power is off, the Power/OK LED on the front panel begins flashing, indicating that
the server is in standby power mode. See “Server System-Level Status Indicators” on page 27.
2.
Disconnect the power cords and data cables from the server.
See “Disconnect Cables From the Server” on page 54.
Caution - When you power down the server using the Power button, the server enters standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, you must disconnect the power cords
from the power supplies.
Related Information
■
“Server System-Level Status Indicators” on page 27
■
“Power On the Server” on page 167
Power Down the Server for Immediate Shutdown Using the
Oracle CLI
Caution - This procedure quickly forces the server main power off. You might corrupt your
system data during an immediate power-down, so use this procedure to power down the server
only after attempting the graceful power-down procedure.
1.
Log in to the Oracle ILOM command-line interface (CLI) using an Administrator
account.
Oracle ILOM displays the default command prompt (->), indicating that you have successfully
logged in to Oracle ILOM.
52Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 53
Power Down the Server for Immediate Shutdown Using the Oracle ILOM Web Interface
2.
From the CLI prompt, type the following command:
-> stop -f /System
The server powers down immediately.
3.
Disconnect the power and data cables from the server.
See “Disconnect Cables From the Server” on page 54.
Caution - When you power down the server using Oracle ILOM, the server enters standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, you must disconnect the power cords
from the power supplies.
Related Information
■
“Power Down the Server for Immediate Shutdown Using the Power Button” on page 52
■
“Power Down the Server for Immediate Shutdown Using the Oracle ILOM Web
Interface” on page 53
■
“Power On the Server” on page 167
Power Down the Server for Immediate Shutdown Using the
Oracle ILOM Web Interface
Caution - This procedure quickly forces the server main power off. You might corrupt your
system data during an immediate power-down, so use this procedure to power down the server
only after attempting the graceful power-down procedure.
1.
Log in to the Oracle ILOM web interface using an Administrator account.
The Oracle ILOM web interface System Information → Summary Information page appears.
2.
In the left pane, click Host Management → Power Control, and click Immediate
Power Off in the Select Action list.
3.
Click Save, and then click OK.
The server powers down immediately.
4.
Disconnect the power and data cables from the server.
See “Disconnect Cables From the Server” on page 54.
Preparing for Service53
Page 54
Disconnect Cables From the Server
Caution - When you power down the server using Oracle ILOM, the server enters standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, you must disconnect the power cords
from the power supplies.
Related Information
■
“Power Down the Server for Immediate Shutdown Using the Power Button” on page 52
■
“Power Down the Server for Immediate Shutdown Using the Oracle CLI” on page 52
■
“Power On the Server” on page 167
Disconnect Cables From the Server
Caution - The system supplies standby power to the circuit boards even when the system is
powered off.
1.
Depending on the components that you are servicing, either extend the server to
the maintenance position, or remove the server from the rack.
2.
Power down the server.
See “Powering Down the Server” on page 51.
3.
Disconnect the power cords from the rear of the server.
4.
Disconnect all data cables from the rear of the server.
5.
Detach and remove cables from the cable management device.
Related Information
■
“Rear Panel Components and Cable Connections” on page 15
■
“Powering Down the Server” on page 51
■
“Extend the Server to the Maintenance Position” on page 55
■
“Remove the Server From the Rack” on page 56
■
“Reconnect Power and Data Cables” on page 167
54Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 55
Extend the Server to the Maintenance Position
Extend the Server to the Maintenance Position
The following components can be serviced with the server in the maintenance position:
■
Storage drives
■
Fan modules
■
Power supplies
■
DVD drive
■
DDR4 DIMMs
■
PCIe cards and cables
■
SAS cables
■
NVMe cables
■
Internal USB drives
■
System battery
■
Processors
■
Disk backplanes
■
Front indicator modules
■
Motherboard
1.
To prevent the rack from tipping forward when the server is extended, extend all
rack anti-tilt devices.
For instructions for stabilizing the rack, refer to “Stabilize the Rack for Installation” in Oracle
Exadata Storage Server X5-2 High Capacity Installation Guide.
2.
Verify that no cables will be damaged or will interfere when the server is
extended.
Although the cable management arm (CMA) that is supplied with the server is hinged to
accommodate extending the server, you should ensure that all cables and cords are free from
obstruction by other devices.
3.
From the front of the server, open and hold the left and right release latch covers
in the open position.
When in an open position, the release latch covers engage the slide-rail release latches [1].
Preparing for Service55
Page 56
Remove the Server From the Rack
Caution - Deploy any rack anti-tilt mechanism before releasing the slide-rail release latches.
4.
While the release latch covers are in the open position, slowly pull the server
forward until the slide-rails latch into a locked position [2].
The server is now in the extended maintenance position.
Related Information
■
“Disconnect Cables From the Server” on page 54
■
“Remove the Server From the Rack” on page 56
■
“Reinstall the Server in the Rack” on page 164
Remove the Server From the Rack
Caution - The server weighs approximately 63 pounds (28.5 kg). Two people are required to
unmount and carry the chassis.
1.
Disconnect all the cables and power cords from the server.
56Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 57
Take Antistatic Measures
2.
Remove the cable management arm (CMA).
For instructions for removing the CMA, see the “Remove the Cable Management Arm” in
Oracle Exadata Storage Server X5-2 High Capacity Installation Guide.
3.
Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 55.
4.
From the front of the server, pull the green slide-rail release tabs toward the front
of the server, and pull the server out of the rack until is free of the rack rails [1
and 2].
A slide-rail release tab is located on each slide-rail.
5.
Set the server on a sturdy work surface.
Related Information
■
“Disconnect Cables From the Server” on page 54
■
“Extend the Server to the Maintenance Position” on page 55
■
“Reinstall the Server in the Rack” on page 164
Take Antistatic Measures
1.
Prepare an antistatic surface on which to set parts during removal and
installation.
Place electrostatic discharge (ESD-sensitive components on an antistatic mat. The following
items can be used as an antistatic mat:
Preparing for Service57
Page 58
Remove the Server Top Cover
■
An antistatic bag used to wrap a replacement part
■
An Oracle ESD mat (orderable item)
■
A disposable ESD mat (shipped with some replacement parts or optional system
components)
2.
Attach an antistatic wrist strap.
When servicing or removing server components, attach an antistatic strap to your wrist and then
to a metal area on the chassis. Then perform procedures for servicing the server.
Note - An antistatic wrist strap is not included in the Accessory Kit for the server. However,
antistatic wrist straps are included with options and components.
Related Information
■
“Extend the Server to the Maintenance Position” on page 55
■
“Remove the Server From the Rack” on page 56
■
“Remove the Server Top Cover” on page 58
■
“Remove the Fan Assembly Door From the Server” on page 59
■
“Remove Antistatic Measures” on page 164
Remove the Server Top Cover
Caution - If you remove the top cover without first removing the AC power cords, then not
only will the server host shut down, but also this condition will be flagged as a system fault
(that is, the fault LEDs will light).
Note - Servicing some components requires that the top cover be removed.
1.
Ensure that AC power cords are disconnected from the server power supplies.
2.
Unlatch the top cover [1].
58Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 59
Remove the Fan Assembly Door From the Server
Press the green release button on the top of the server cover.
3.
While pressing the top cover release button, slide the cover toward the rear of
the server [2].
4.
If necessary, open the fan assembly door.
5.
Lift up and remove the top cover [2].
Related Information
■
“Take Antistatic Measures” on page 57
■
“Remove the Fan Assembly Door From the Server” on page 59
■
“Install the Server Top Cover” on page 161
Remove the Fan Assembly Door From the Server
Before you service the server's storage drive backplane, you should first remove the server's
front fan assembly door.
1.
Ensure that the AC power cords are disconnected from the server power
supplies.
Preparing for Service59
Page 60
Remove the Fan Assembly Door From the Server
2.
Use a No. 2 Phillips screwdriver to remove the two screws on each side of the
chassis and the three screws on top of the chassis. [1].
3.
Slide the fan assembly door toward the rear of the server, and lift and remove the
door from the chassis [2].
Related Information
■
“Take Antistatic Measures” on page 57
■
“Remove the Server Top Cover” on page 58
■
“Install the Server Top Cover” on page 161
■
“Install the Fan Assembly Door” on page 163
60Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 61
Servicing Components That Do Not Require
Server Power-Off
These sections describe how to service components that do not require you to power
off the server. For more information about server components, see “Illustrated Parts
Breakdown” on page 16.
DescriptionLinks
Service storage and boot drives.“Servicing Storage Drives and Rear Drives” on page 61
Service fan modules.“Servicing Fan Modules” on page 66
Service power supplies.“Servicing Power Supplies” on page 71
Related Information
■
“Servicing Components That Require Server Power-Off” on page 77
Servicing Storage Drives and Rear Drives
These sections describe how to remove and install storage drives.
■
“Storage Drives Hot-Plug Conditions” on page 62
■
“Remove a Storage Drive” on page 62
■
“Install a Storage Drive” on page 65
Related Information
■
“Servicing SAS Cables” on page 118
Servicing Components That Do Not Require Server Power-Off61
Page 62
Remove a Storage Drive
Storage Drives Hot-Plug Conditions
The hard disk drives (HDDs) in the server are hot-pluggable, but this capability depends on how
the drives are configured. To hot-plug a drive you must be able to take the drive offline before
you can remove it. When you take the drive offline, you prevent any application from accessing
the drive and remove the logical software links to the drive.
Note - Replacing a drive does not require removing the server from a rack.
Related Information
■
■
■
“Storage and Boot Drive Indicators” on page 29
“Remove a Storage Drive” on page 62
“Install a Storage Drive” on page 65
Remove a Storage Drive
1.
Prepare the system for the drive removal.
Refer to “Take Antistatic Measures” on page 57.
2.
Type the operating system commands required to stop the server from using the
drive.
The exact commands required depend on the configuration of your drives. Unmount file
systems, or issue RAID commands as needed.
Note - The blue OK to Remove status indicator LED on the drive may or may not light, as
support for this varies depending on the operating system in use.
3.
Identify the location of the drive that you want to remove.
62Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 63
Remove a Storage Drive
The following illustrations show the location of the drives and the internal system software
designations for the drives. For information on storage drive status indicators (LEDs), refer to
“Storage and Boot Drive Indicators” on page 29.
Figure
Legend
1Storage drive 8 (HDD)7Storage drive 10 (HDD)
2Storage drive 4 (HDD)8Storage drive 6 (HDD)
3Storage drive 0 (HDD)9Storage drive 2 (HDD)
4Storage drive 9 (HDD)10Storage drive 11 (HDD)
5Storage drive 5 (HDD)11Storage drive 7 (HDD)
6Storage drive 1 (HDD)12Storage drive 3 (HDD)
DescriptionFigure
Legend
Description
Servicing Components That Do Not Require Server Power-Off63
Page 64
Remove a Storage Drive
4.
On the drive you plan to remove, push the latch release button to open the drive
latch [1, 2].
Figure
Description
Legend
1Pressing the latch release button.
2Opening the latch.
Caution - The latch is not an ejector. Do not open the latch too far to the right. Doing so can
damage the latch.
64Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 65
5.
Grasp the latch and pull the drive out of the drive slot.
Install a Storage Drive
6.
Consider your next steps:
■
If you are replacing the drive, continue to “Install a Storage Drive” on page 65.
Caution - Whenever you remove a storage drive, you should replace it with another storage
drive; otherwise, the server might overheat due to improper airflow.
Related Information
■
“Storage and Boot Drive Indicators” on page 29
■
“Storage Drives Hot-Plug Conditions” on page 62
■
“Install a Storage Drive” on page 65
Install a Storage Drive
1.
Remove the replacement drive from its packaging, and place the drive on an
antistatic mat.
2.
If necessary, remove the drive filler panel.
Servicing Components That Do Not Require Server Power-Off65
Page 66
Servicing Fan Modules
The system might have filler panels covering unoccupied drive slots.
3.
Align the replacement drive with the drive slot.
The drive is physically addressed according to the slot in which it is installed. It is important to
install a replacement drive in the same slot as the drive that was removed.
4.
Slide the drive into the slot until the drive is fully seated.
5.
Close the drive latch to lock the drive in place.
6.
Perform administrative procedures to reconfigure the drive.
Related Information
■
“Storage and Boot Drive Indicators” on page 29
■
“Storage Drives Hot-Plug Conditions” on page 62
■
“Remove a Storage Drive” on page 62
Servicing Fan Modules
The fan modules are located in the front of the server. See the following procedures:
66Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 67
Remove a Fan Module
■
“Remove a Fan Module” on page 67
■
“Install a Fan Module” on page 70
Related Information
■
“Remove the Server Top Cover” on page 58
Remove a Fan Module
You do not have to power off the server to service fan modules.
You should not begin this procedure unless you have a replacement fan module and you are
ready to install it right away.
1.
Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 55.
2.
To access the fan modules, open the top cover fan assembly door.
Caution - Close the top cover fan assembly door within 60 seconds to maintain adequate
airflow to properly cool the server. Leaving the door open for more than 60 seconds, while the
server is running, will cause the server to automatically shut down.
3.
Identify the faulty fan module.
Servicing Components That Do Not Require Server Power-Off67
Page 68
Remove a Fan Module
Each fan module has a fan status indicator (LED) that is located next to the module. If the LED
is amber, the fan has failed. The location of the fan status LED is shown in the following figure.
LED Color and State Meaning
Amber – The fan module is faulty. The front Top Fan LED and the front and rear panel Service Required LEDs are
also lit if the system detects a fan module fault.
Green – Indicates that the fan module is correctly installed and operating within specification.
68Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 69
Remove a Fan Module
4.
Use a No. 2 Phillips screwdriver to loosen the captive screw that secures the
faulty fan module in the chassis [1].
5.
Grasp both the captive screw and the opposite end of the module and lift the fan
module straight up and out of the chassis, and set it aside on an antistatic mat
[2].
Caution - When removing a fan module, do not rock it back and forth. Rocking the fan module
can cause damage to the motherboard connectors.
Caution - Do not service any other components in the fan compartment unless the system is
shut down and the power cords are removed.
6.
Consider your next step:
■
If you removed the fan assembly as part of another procedure, return to that procedure.
■
Otherwise, continue to “Install a Fan Module” on page 70.
Servicing Components That Do Not Require Server Power-Off69
Page 70
Install a Fan Module
1.
2.
Related Information
■
“Install a Fan Module” on page 70
Install a Fan Module
Remove the replacement fan module from its packaging, and place it on an
antistatic mat.
With the top cover fan assembly door open, install the replacement fan module
into the server [1].
The fan modules are keyed to ensure that they are installed in the correct orientation.
3.
Press down on the fan module and apply firm pressure to fully seat the fan
module.
70Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 71
Servicing Power Supplies
4.
Verify that the fan module status LED on the replacement fan module is lit green.
5.
Using your thumb and forefinger, tighten the captive screw to secure the fan
module to the chassis. Then use a No. 2 Phillips screwdriver to tighten the screw
an additional 1/4 turn to secure the fan module to the chassis [2].
6.
Close the top cover fan assembly door.
7.
Verify that the Top Fan Fault LED on the front of the server and the Service
Required LEDs on the front and rear of the server are extinguished.
See “Server System-Level Status Indicators” on page 27 for more information about identifying
and interpreting system status indicators.
8.
Consider your next step:
■
If you installed the fan module as part of another procedure, return to that procedure.
■
Otherwise, return the server to operation. See “Returning the Server to
Operation” on page 161.
Related Information
■
“Remove a Fan Module” on page 67
Servicing Power Supplies
The server's redundant power supplies support concurrent maintenance, which enables you to
remove and replace a power supply without shutting down the server, provided that the other
power supply is online and working.
The server supports model A258 (1000 watt) power supplies. The A258 power supply unit
(PSU) provides conversion from the AC lines to the system, accepting ranges from 100-240
volts AC (VAC). These PSUs are designed to be hot-swappable, and in most cases will provide
fully redundant “1+1” power, allowing the system to suffer the loss of a PSU or an AC feed
with no loss to system availability.
In maximally configured systems, it is possible that the worst-case power consumption of the
system could exceed the capacity of a single PSU. The PSUs provide an over-subscription
mode, which allows the system to operate with fault-tolerance even with modest excursions
beyond the rated capacity of a single PSU. This over-subscription support is accomplished
using hardware signaling between the PSU and motherboard circuitry, which can force the
Servicing Components That Do Not Require Server Power-Off71
Page 72
Servicing Power Supplies
system to throttle CPU and memory power in the event that a PSU is lost. The resulting power
savings will be enough to allow the system to continue to run (in a lower-performance state)
until the power problem is resolved.
The following status indicators (LEDs) are lit when a power supply fault is detected:
■
■
■
If a power supply fails and you do not have a replacement available, leave the failed power
supply installed to ensure proper airflow in the server. For more information, refer to the
following sections:
■
■
■
Related Information
Front and rear Service Required LEDs
Amber Service Required LED on the faulty power supply
Rear Power Supply Fault LED located on the server front panel
“Power Supply Status Indicators” on page 29
“Remove a Power Supply” on page 73
“Install a Power Supply” on page 75
■
“Troubleshooting Using the Server Front and Rear Panel Status Indicators ” on page 26
■
“Powering Down the Server” on page 51
72Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 73
Remove a Power Supply
Remove a Power Supply
1.
Identify which power supply requires replacement; either PS0 [1] or PSU1 [2].
A lit amber Service Required LED on a power supply indicates that a failure was detected.
You can also use the Oracle ILOM showfaulty command at the Oracle ILOM prompt (->) to
identify a power supply failure.
Alternatively, to list all known faults in the server, log into the Oracle Solaris OS and issue the
fmadm faulty command, or log into the Oracle ILOM service processor from the Oracle ILOM
Fault Management Shell and issue the fmadm faulty command.
Note - The fans of a failed power supply might still be spinning when the system is powered on.
The fans will stop spinning when you disconnect the power cords.
2.
Gain access to the rear of the server where the faulty power supply is located.
3.
Disconnect both cable management arm (CMA) left-side connectors and move
the CMA out of the way.
Servicing Components That Do Not Require Server Power-Off73
Page 74
Remove a Power Supply
For instructions for disconnecting the CMA left-side connectors, see “Remove the Cable
Management Arm” in Oracle Exadata Storage Server X5-2 High Capacity Installation Guide.
Caution - When disconnecting the CMA left-side connectors, be sure to use your arm to
support the CMA so that it does not hang down under its own weight and stress the right-side
connectors; otherwise, the CMA might be damaged. You must continue to support the CMA
until you have reconnected both of the left-side connectors.
4.
Disconnect the power cord from the faulty power supply.
5.
Grasp the power supply handle and push the power supply latch to the left [1].
6.
Pull the power supply out of the chassis [2].
Caution - Whenever you remove a power supply, you should replace it with another power
supply; otherwise, the server might overheat due to improper airflow.
7.
Continue to “Install a Power Supply” on page 75.
Related Information
■
“Rear Panel Components and Cable Connections” on page 15
■
“Power Supply Status Indicators” on page 29
■
“Install a Power Supply” on page 75
74Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 75
Install a Power Supply
Install a Power Supply
Caution - Always replace the failed power supply with the same type of power supply.
1.
Remove the replacement power supply from its packaging, and place it on an
antistatic mat.
2.
Align the replacement power supply with the empty power supply slot.
3.
Slide the power supply into the slot until it is fully seated.
You will hear an audible click when the power supply fully seats.
4.
Reconnect the power cord to the power supply.
5.
Verify that the amber Service Required LED on the replaced power supply and
the Service Required LEDs are not lit on the front and rear panels.
Servicing Components That Do Not Require Server Power-Off75
Page 76
Install a Power Supply
6.
Note - After you have replaced Power Supply 0, you must reset the ILOM service processor
(SP) to propagate the FRU top-level indicator (TLI) data to the new power supply. For
instructions on resetting the SP, see the Oracle Integrated Lights Out Manager (ILOM) 3.2Configuration and Maintenance Guide at http://www.oracle.com/goto/ilom/docs. Power
Supply 1 does not contain FRU TLI data, and therefore does not require an SP reset after
replacement.
Reconnect the two CMA left-side connectors.
For instructions for reconnecting the CMA left-side connectors, see “Install the Cable
Management Arm” in Oracle Exadata Storage Server X5-2 High Capacity Installation Guide.
Related Information
■
“Rear Panel Components and Cable Connections” on page 15
■
“Power Supply Status Indicators” on page 29
■
“Remove a Power Supply” on page 73
76Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 77
Servicing Components That Require Server
Power-Off
These sections describe how to service components that require you to power off the server. For
more information about server components, refer to “Illustrated Parts Breakdown” on page 16.
DescriptionLinks
Service the DIMMs.“Servicing the DIMMs” on page 77
Service the PCIe cards.“Servicing PCIe Cards” on page 86
Service the internal USB flash drives.“Servicing the Internal USB Flash Drive” on page 92
Service the battery.“Servicing the Battery” on page 96
Service the processors.“Servicing Processors” on page 99
Service the front and rear storage drive backplanes.“Servicing the Storage Drive Backplanes” on page 112
Service SAS cables.“Servicing SAS Cables” on page 118
Service the Sun Storage 12 Gb SAS PCIe RAID HBA
card and super capacitor.
Service the front indicator LED/USB modules.“Servicing the Front LED/USB Indicator
Service the motherboard assembly.“Servicing the Motherboard Assembly” on page 145
“Servicing the Oracle Storage 12 Gb/s SAS PCIe RAID
HBA Card” on page 123
Modules” on page 130
Related Information
■
“Servicing Components That Do Not Require Server Power-Off” on page 61
Servicing the DIMMs
The Oracle Exadata Storage Server X5-2 High Capacity supports a DDR4 DIMM configuration
that includes dual-rank (DR) and single-rank (SR) DDR4 DIMMs.
Servicing Components That Require Server Power-Off77
Page 78
Servicing the DIMMs
Caution - These procedures require that you handle components that are sensitive to
electrostatic discharge. This sensitivity can cause the components to fail. To avoid
damage, ensure that you follow antistatic practices as described in “Electrostatic Discharge
Safety” on page 48.
Caution - Ensure that all power is removed from the server before removing or installing
DIMMs, or damage to the DIMMs might occur. You must disconnect all power cables from the
system before performing these procedures.
The following sections provide information to assist you when replacing a DIMM or upgrading
DIMMs:
■
“DIMM and Processor Physical Layout” on page 78
■
“DIMM Population Rules” on page 79
■
“DIMM Rank Classification Labels” on page 79
■
“Inconsistencies Between DIMM Fault Indicators and the BIOS Isolation of Faulty
DIMMs” on page 80
■
“Using the Storage Server Fault Remind Button” on page 80
■
“Identify and Remove a Faulty DIMM” on page 81
■
“Install a DIMM” on page 84
Related Information
■
“Servicing Processors” on page 99
DIMM and Processor Physical Layout
The physical layout of the DIMMs and processor(s) is shown in the following figure. When
viewing the server from the front, processor 0 (P0) is on the left. Notice that each processor, P0
and P1, have four memory channels that are labeled, from left to right, Ch C, Ch D, Ch B, and
Ch A.
78Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 79
Related Information
■
“DIMM Population Rules” on page 79
■
“DIMM Rank Classification Labels” on page 79
■
“Inconsistencies Between DIMM Fault Indicators and the BIOS Isolation of Faulty
DIMMs” on page 80
Servicing the DIMMs
DIMM Population Rules
The population rules for adding DIMMs to the server are as follows:
■
The server supports four 16-GB dual-rank (DR) Registered DIMMs (RDIMMs) on
Processor 0, and four 8-GB single-rank (SR) Registered DIMMs (RDIMMs) on Processor
1.
■
Within a memory channel, DIMMs must be populated in the black sockets. The storage
server does not support DIMMs in the white sockets or the black sockets with white tabs.
DIMM Rank Classification Labels
DIMMs come in a variety of ranks: single, dual, or quad. Each DIMM is shipped with a label
identifying its rank classification. The following table identifies the label corresponding to each
DIMM rank classification:
Servicing Components That Require Server Power-Off79
Page 80
Servicing the DIMMs
TABLE 7
Rank ClassificationLabel
Quad-rank LRDIMM (not supported)4Rx4
Dual-rank RDIMM2Rx4
Single-rank RDIMM1Rx4
Note - The Oracle Exadata Storage Server X5-2 High Capacity does not support Quad-Rank
LRDIMMs.
DIMM Rank Classification Labels
Inconsistencies Between DIMM Fault Indicators
and the BIOS Isolation of Faulty DIMMs
When a single DIMM is marked as failed by Oracle ILOM (for example, fault.memory.
intel.dimm.training-failed is listed in the SP Event Log), BIOS might disable the entire
memory channel that contains the failed DIMM, up to three DIMMs. As a result, none of the
memory installed in the disabled channel will be available to the operating system. However,
when the Fault Remind button is pressed, only the fault status indicator (LED) associated with
the failed DIMM lights. The fault LEDs for the other DIMMs in the memory channel remain
off. Therefore, you can correctly identify the failed DIMM.
Using the Storage Server Fault Remind Button
When the server Fault Remind button is pressed, an LED located next to the Fault Remind
button lights green to indicate that there is sufficient voltage present in the fault remind circuit
to light any fault LEDs that were lit due to a failure. If this LED does not light when you press
the Fault Remind button, it is likely that the capacitor powering the fault remind circuit has lost
its charge. This can happen if the Fault Remind button is pressed for several minutes with fault
LEDs lit or if power has been removed from the server for more than 15 minutes.
The following figure shows the location of the Fault Remind button.
80Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 81
Identify and Remove a Faulty DIMM
Identify and Remove a Faulty DIMM
1.
Prepare the server for service.
a.
Power off the server, and disconnect the power cords from the server power
supplies.
See “Powering Down the Server” on page 51.
b.
Extend the server into the maintenance position.
See “Extend the Server to the Maintenance Position” on page 55.
c.
Attach an antistatic wrist strap to your wrist and then to a metal area on the
chassis.
See “Electrostatic Discharge Safety” on page 48.
Servicing Components That Require Server Power-Off81
Page 82
Identify and Remove a Faulty DIMM
d.
Remove the server top cover.
See “Remove the Server Top Cover” on page 58.
e.
Swivel the air baffle into the upright position to allow access to DIMMs.
2.
Identify and note the location of the faulty DDR4 DIMM by pressing the Fault
Remind button on the motherboard I/O card.
See “Using the Storage Server Fault Remind Button” on page 80.
Faulty DIMMs are identified with a corresponding amber LED on the motherboard.
■
If the DIMM fault LED is off, then the DIMM is operating properly.
■
If the DIMM fault LED is on (amber), then the DIMM is faulty and should be replaced [1].
82Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 83
3.
To remove the faulty DIMM, do the following:
a.
Rotate both DIMM socket ejectors outward as far as they will go.
The DIMM is partially ejected from the socket.
b.
Carefully lift the DIMM straight up to remove it from the socket.
Identify and Remove a Faulty DIMM
4.
Replace each faulty DIMM with either another DIMM of the same rank size (dualrank or single-rank) or leave the socket empty.
For DIMM replacement instructions, see “Install a DIMM” on page 84.
Related Information
■
“DIMM and Processor Physical Layout” on page 78
■
“DIMM Population Rules” on page 79
■
“DIMM Rank Classification Labels” on page 79
■
“Install a DIMM” on page 84
Servicing Components That Require Server Power-Off83
Page 84
Install a DIMM
Install a DIMM
1.
Unpack the replacement DDR4 DIMM and place it on an antistatic mat.
2.
Ensure that the replacement DDR4 DIMM matches the size of the DIMM it is
replacing.
For DIMM socket population rules, see “DIMM Population Rules” on page 79.
3.
Install a DIMM.
a.
Ensure that the ejector tabs are in the open position.
b.
Align the notch in the replacement DIMM with the connector key in the
connector socket.
The notch ensures that the DIMM is oriented correctly.
c.
Push the DDR4 DIMM into the connector socket until the ejector tabs lock
the DIMM in place.
84Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 85
Install a DIMM
If the DIMM does not easily seat into the connector socket, verify that the notch in the
DIMM is aligned with the connector key in the connector socket. If the notch is not
aligned, damage to the DIMM might occur.
4.
Repeat Step 3 until all replacement DIMMs are installed.
5.
Return the server to operation:
a.
Lower the air baffle to the installed position.
b.
Install the server top cover.
See “Install the Server Top Cover” on page 161.
c.
Return the server to the normal rack position.
See “Return the Server to the Normal Rack Position” on page 165.
d.
Reconnect the power cords and data cables to the server.
Servicing Components That Require Server Power-Off85
Page 86
Servicing PCIe Cards
6.
See “Reconnect Power and Data Cables” on page 167.
e.
Power on the server.
See “Power On the Server” on page 167.
Verify that the AC OK LED is lit.
(Optional) Use Oracle ILOM to clear server DDR4 DIMM faults.
DDR4 DIMM faults are automatically cleared after a new memory DIMM has been installed.
If you need to manually clear DDR4 DIMM faults, refer to the Oracle Integrated Lights Out
Manager (ILOM) 3.2 Documentation Library at: http://www.oracle.com/goto/ilom/docs.
Related Information
■
“DIMM and Processor Physical Layout” on page 78
■
“DIMM Population Rules” on page 79
■
“DIMM Rank Classification Labels” on page 79
■
“Identify and Remove a Faulty DIMM” on page 81
Servicing PCIe Cards
These sections describe how to service and install PCIe cards. Refer to your PCIe card
documentation for complete software and cabling information about your card.
Caution - The procedures in this section should not be used to service the Oracle Storage 12 Gb
SAS PCIe RAID HBA card that is located in PCIe slot 6.
Caution - These procedures require that you handle components that are sensitive to
electrostatic discharge. This sensitivity can cause the components to fail. To avoid
damage, ensure that you follow antistatic practices as described in “Take Antistatic
Measures” on page 57.
Caution - Ensure that all power is removed from the server before removing or installing
PCIe cards. You must disconnect all power cables from the system before performing these
procedures.
This section covers the following topics:
■
“PCIe Slot Locations” on page 87
86Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 87
Servicing PCIe Cards
■
“Remove a PCIe Card” on page 88
■
“Install a PCIe Card” on page 90
Related Information
■
“Servicing the Oracle Storage 12 Gb/s SAS PCIe RAID HBA Card” on page 123
PCIe Slot Locations
There are six PCIe slots available for optional PCIe cards. The following figure shows the PCIe
slot numbering.
Figure
Description
Legend
1PCIe slot 1 (Contains the Oracle Flash Accelerator F160 NVMe PCIe Add In Card.)
2PCIe slot 2 (Contains the Oracle Flash Accelerator F160 NVMe PCIe Add In Card.)
3PCIe slot 3 (Contains the Sun Dual Port QDR InfiniBand Host Channel Adapter for PCIExpress Gen 3.)
4PCIe slot 4 (Contains the Oracle Flash Accelerator F160 NVMe PCIe Add In Card.)
5PCIe slot 5 (Contains the Oracle Flash Accelerator F160 NVMe PCIe Add In Card.)
6PCIe slot 6 (Contains the Oracle Storage 12 Gb/s SAS PCIe RAID HBA card)
Related Information
■
“Remove a PCIe Card” on page 88
■
“Install a PCIe Card” on page 90
Servicing Components That Require Server Power-Off87
Page 88
Remove a PCIe Card
1.
Remove a PCIe Card
Prepare the server for service.
a.
Power off the server and disconnect the power cords from the power
supplies.
See “Powering Down the Server” on page 51.
b.
Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 55.
c.
Attach an antistatic wrist strap.
See “Electrostatic Discharge Safety” on page 48.
d.
Remove the server top cover.
See “Remove the Server Top Cover” on page 58.
e.
Swivel the air baffle into the upright position to allow access to PCIe cards.
2.
Locate the PCIe card that you want to remove.
See “PCIe Slot Locations” on page 87.
3.
If necessary, make a note of where the PCIe cards are installed.
4.
Unplug all data cables from the PCIe card [1].
88Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 89
Remove a PCIe Card
5.
Rotate the PCIe card locking mechanism [1], and then lift up on the PCIe card to
disengage it from the motherboard connectors [2].
6.
Place the PCIe card on an antistatic mat.
Related Information
■
“PCIe Slot Locations” on page 87
■
“Install a PCIe Card” on page 90
Servicing Components That Require Server Power-Off89
Page 90
Install a PCIe Card
1.
2.
3.
Install a PCIe Card
Unpack the replacement PCIe card, and place it on an antistatic mat.
Locate the proper PCIe slot for the card you are replacing.
If necessary, remove the PCIe filler panel from the slot.
Note - Save this filler panel in case you ever need to remove the PCIe card from the system.
90Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 91
4.
Insert the PCIe card into the correct slot [1], and rotate the PCIe locking
mechanism to secure the PCIe card in place [2].
Install a PCIe Card
5.
Reconnect the cables to the PCIe card that you unplugged during the removal
procedure [2].
6.
Return the server to operation.
a.
Lower the air baffle to the installed position.
Servicing Components That Require Server Power-Off91
Page 92
Servicing the Internal USB Flash Drive
b.
Install the top cover.
See “Install the Server Top Cover” on page 161.
c.
Return the server to the normal rack position.
See “Return the Server to the Normal Rack Position” on page 165.
d.
Connect any required data cables to the PCIe card.
Route data cables through the cable management arm.
e.
Reconnect the power cords to the power supplies and power on the server.
See “Reconnect Power and Data Cables” on page 167 and “Power On the
Server” on page 167. Verify that the AC OK LED is lit.
7.
Use Oracle ILOM to clear any server PCIe card faults.
You can use the Oracle ILOM web interface or the command-line interface (CLI) to manually
clear faults. For information on how to use the Oracle ILOM web interface or the CLI to clear
server faults, see the Oracle Integrated Lights Out Manager (ILOM) 3.2 Documentation Library
at:
http://www.oracle.com/goto/ilom/docs.
8.
To determine whether additional steps are required to complete the installation
of the PCIe card, refer to the storage server product notes for the type of PCIe
card you installed.
Related Information
■
“PCIe Slot Locations” on page 87
■
“Remove a PCIe Card” on page 88
Servicing the Internal USB Flash Drive
Caution - These procedures require that you handle components that are sensitive
to electrostatic discharge. This sensitivity can cause the component to fail. To avoid
damage, ensure that you follow antistatic practices as described in “Take Antistatic
Measures” on page 57.
92Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 93
Remove the Internal USB Flash Drive
Caution - Ensure that all power is removed from the server before removing or installing a
flash drive or damage to the drive might occur. You must disconnect all power cables from the
system before performing these procedures.
This section covers the following topics:
■
“Remove the Internal USB Flash Drive” on page 93
■
“Install the Internal USB Flash Drive” on page 94
Related Information
■
“USB Ports” on page 174
Remove the Internal USB Flash Drive
1.
Prepare the server for service.
a.
Power off the server and disconnect the power cords from the server power
supplies.
See “Powering Down the Server” on page 51.
b.
Extend the server into maintenance position.
See “Extend the Server to the Maintenance Position” on page 55.
c.
Attach an antistatic wrist strap to your wrist, and then to a metal area on the
chassis.
See “Electrostatic Discharge Safety” on page 48.
d.
Remove the server top cover.
See “Remove the Server Top Cover” on page 58.
Servicing Components That Require Server Power-Off93
Page 94
Install the Internal USB Flash Drive
2.
To remove the USB flash drive, grasp the flash drive and pull it from the slot.
Related Information
■
“Install the Internal USB Flash Drive” on page 94
Install the Internal USB Flash Drive
1.
Unpack the replacement USB flash drive.
94Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 95
2.
Insert the flash drive into the USB slot.
Install the Internal USB Flash Drive
3.
Return the server to operation.
a.
Install the server top cover.
See “Install the Server Top Cover” on page 161.
b.
Return the server to the normal rack position.
See “Return the Server to the Normal Rack Position” on page 165.
c.
Reconnect the power cords to the server power supplies and power on the
server.
See “Reconnect Power and Data Cables” on page 167 and “Power On the
Server” on page 167. Verify that the AC OK LED is lit.
Related Information
■
“Remove the Internal USB Flash Drive” on page 93
Servicing Components That Require Server Power-Off95
Page 96
Servicing the Battery
Servicing the Battery
The real-time clock (RTC) battery maintains system time when the server is powered off and
a time server is unavailable. If the server fails to maintain the proper time when the system is
powered off and not connected to a network, replace the battery.
Caution - Ensure that all power is removed from the server before removing or installing
the battery. You must disconnect the power cables from the system before performing this
procedure.
This section covers the following topics:
■
“Remove the Battery” on page 96
■
“Install the Battery” on page 97
Remove the Battery
1.
Prepare the server for service.
a.
Power off the server and disconnect the power cords from the server power
supplies.
See “Powering Down the Server” on page 51.
b.
Extend the server into the maintenance position.
See “Extend the Server to the Maintenance Position” on page 55.
c.
Attach an antistatic wrist strap to your wrist, and then to a metal area on the
chassis.
See “Electrostatic Discharge Safety” on page 48.
d.
Remove the server top cover.
See “Remove the Server Top Cover” on page 58.
e.
Swivel the air baffle into the upright position to allow access to the battery.
96Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 97
Install the Battery
2.
To dislodge the battery from its retainer, gently push the top edge of the battery
away from the retainer.
3.
Lift the battery up and out of its retainer.
Related Information
■
“Install the Battery” on page 97
Install the Battery
1.
Unpack the replacement battery.
2.
Press the new battery into the battery retainer with the positive side (+) facing
the server chassis side wall.
Servicing Components That Require Server Power-Off97
Page 98
Install the Battery
3.
4.
Note - If the service processor is configured to synchronize with a network time server using
the Network Time Protocol (NTP), the Oracle ILOM SP clock will be reset as soon as the
server is powered on and connected to the network; otherwise, proceed to the next step.
If the service processor is not configured to use NTP, you must do one of the
following:
■
Reset the Oracle ILOM SP clock using the Oracle ILOM CLI or the web interface.
For instructions, refer to the Oracle Integrated Lights Out Manager (ILOM) 3.2
Documentation Library at: http://www.oracle.com/goto/ilom/docs.
■
Use the BIOS Setup Utility to reprogram the host clock.
For the BIOS Main menu that supports this procedure, refer to the Oracle X5 Series ServersAdministration Guide at http://www.oracle.com/goto/x86admindiag/docs.
Return the server to operation.
a.
Lower the air baffle to the installed position.
b.
Install the server top cover.
See “Install the Server Top Cover” on page 161.
c.
Return the server to the normal rack position.
See “Return the Server to the Normal Rack Position” on page 165.
d.
Reconnect the power cords to the server power supplies, and power on the
server.
See “Reconnect Power and Data Cables” on page 167 and “Power On the
Server” on page 167. Verify that the AC OK LED is lit.
Related Information
■
“Remove the Battery” on page 96
98Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Page 99
Servicing Processors
Caution - Ensure that all power is removed from the server before removing or installing a
processor. You must disconnect the power cables from the system before performing these
procedures.
Caution - This procedure requires that you handle components that are sensitive to electrostatic
discharge. This sensitivity can cause the component to fail. To avoid damage, ensure that
you follow electrostatic discharge safety measures and antistatic practices. See “Electrostatic
Discharge Safety” on page 48.
This section covers the following topics:
■
“Processor Removal and Replacement Tool” on page 99
■
“Remove a Processor” on page 100
■
“Install a Processor” on page 106
Servicing Processors
Related Information
■
“Servicing the DIMMs” on page 77
Processor Removal and Replacement Tool
Caution - The correct color-coded processor removal and replacement tool (part number
G29477-002 or later) must be used to remove and replace a processor; otherwise, the processor
or the processor socket might be damaged.
The Oracle Exadata Storage Server X5-2 High Capacity uses two Intel E5-2630, 8-core, 2.4
GHz, 85W processors. When replacing the processor, you must use the specifically designed
green color-coded processor removal and replacement tool.
Note - The green processor removal and replacement tool is included in the box with the 8-
core replacement processor. Additionally, the correct removal and replacement tool ships with
replacement motherboards.
Servicing Components That Require Server Power-Off99
Page 100
Remove a Processor
1.
Remove a Processor
Caution - The green color-coded processor removal and replacement tool (part number
G29477-002 or later) must be used to remove the 8-core processor; otherwise, the processor or
the processor socket might be damaged.
Caution - Be careful not to touch the processor socket pins. The processor socket pins are
very fragile. A light touch can bend the processor socket pins beyond repair. Always use the
processor removal/replacement tool to remove and replace the processor.
Prepare the server for service.
a.
Power off the server and disconnect the power cords from the power
supplies.
See “Powering Down the Server” on page 51.
b.
Attach an antistatic wrist strap to your wrist, and then to a metal area on the
chassis.
See “Take Antistatic Measures” on page 57.
c.
Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 55
d.
Remove the server top cover.
See “Remove the Server Top Cover” on page 58.
e.
Swivel the air baffle into the upright position to allow access to the
processors.
2.
Identify the location of the faulty processor by pressing the Fault Remind button
on the motherboard I/O card.
Note - When the Fault Remind button is pressed, an LED located next to the Fault Remind
button lights green to indicate that there is sufficient voltage present in the fault remind circuit
to light any fault LEDs that were lit due to a failure. If this LED fails to light when you press
the Fault Remind button, it is likely that the capacitor powering the fault remind circuit has
lost its charge. This can happen if the Fault Remind button is pressed for a long time with fault
LEDs lit or if power has been removed from the server for more than 15 minutes.
100Oracle Exadata Storage Server X5-2 High Capacity Service Manual • January 2018
Loading...
+ hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.