This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except
as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform,
publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is
prohibited.
The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing.
If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, then the following notice is applicable:
U.S. GOVERNMENT END USERS: Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation,
delivered to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental
regulations. As such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the
hardware, and/or documentation, shall be subject to license terms and license restrictions applicable to the programs. No other rights are granted to the U.S. Government.
This software or hardware is developed for general use in a variety of information management applications. It is not developed or intended for use in any inherently dangerous
applications, including applications that may create a risk of personal injury. If you use this software or hardware in dangerous applications, then you shall be responsible to take all
appropriate fail-safe, backup, redundancy, and other measures to ensure its safe use. Oracle Corporation and its affiliates disclaim any liability for any damages caused by use of this
software or hardware in dangerous applications.
Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners.
Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of
SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered
trademark of The Open Group.
This software or hardware and documentation may provide access to or information about content, products, and services from third parties. Oracle Corporation and its affiliates are
not responsible for and expressly disclaim all warranties of any kind with respect to third-party content, products, and services unless otherwise set forth in an applicable agreement
between you and Oracle. Oracle Corporation and its affiliates will not be responsible for any loss, costs, or damages incurred due to your access to or use of third-party content,
products, or services, except as set forth in an applicable agreement between you and Oracle.
Access to Oracle Support
Oracle customers that have purchased support have access to electronic support through My Oracle Support. For information, visit http://www.oracle.com/pls/topic/lookup?
ctx=acc&id=info or visit http://www.oracle.com/pls/topic/lookup?ctx=acc&id=trs if you are hearing impaired.
Ce logiciel et la documentation qui l'accompagne sont protégés par les lois sur la propriété intellectuelle. Ils sont concédés sous licence et soumis à des restrictions d'utilisation et
de divulgation. Sauf stipulation expresse de votre contrat de licence ou de la loi, vous ne pouvez pas copier, reproduire, traduire, diffuser, modifier, accorder de licence, transmettre,
distribuer, exposer, exécuter, publier ou afficher le logiciel, même partiellement, sous quelque forme et par quelque procédé que ce soit. Par ailleurs, il est interdit de procéder à toute
ingénierie inverse du logiciel, de le désassembler ou de le décompiler, excepté à des fins d'interopérabilité avec des logiciels tiers ou tel que prescrit par la loi.
Les informations fournies dans ce document sont susceptibles de modification sans préavis. Par ailleurs, Oracle Corporation ne garantit pas qu'elles soient exemptes d'erreurs et vous
invite, le cas échéant, à lui en faire part par écrit.
Si ce logiciel, ou la documentation qui l'accompagne, est livré sous licence au Gouvernement des Etats-Unis, ou à quiconque qui aurait souscrit la licence de ce logiciel pour le
compte du Gouvernement des Etats-Unis, la notice suivante s'applique :
U.S. GOVERNMENT END USERS: Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation,
delivered to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental
regulations. As such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the
hardware, and/or documentation, shall be subject to license terms and license restrictions applicable to the programs. No other rights are granted to the U.S. Government.
Ce logiciel ou matériel a été développé pour un usage général dans le cadre d'applications de gestion des informations. Ce logiciel ou matériel n'est pas conçu ni n'est destiné à être
utilisé dans des applications à risque, notamment dans des applications pouvant causer un risque de dommages corporels. Si vous utilisez ce logiciel ou ce matériel dans le cadre
d'applications dangereuses, il est de votre responsabilité de prendre toutes les mesures de secours, de sauvegarde, de redondance et autres mesures nécessaires à son utilisation dans
des conditions optimales de sécurité. Oracle Corporation et ses affiliés déclinent toute responsabilité quant aux dommages causés par l'utilisation de ce logiciel ou matériel pour des
applications dangereuses.
Oracle et Java sont des marques déposées d'Oracle Corporation et/ou de ses affiliés. Tout autre nom mentionné peut correspondre à des marques appartenant à d'autres propriétaires
qu'Oracle.
Intel et Intel Xeon sont des marques ou des marques déposées d'Intel Corporation. Toutes les marques SPARC sont utilisées sous licence et sont des marques ou des marques
déposées de SPARC International, Inc. AMD, Opteron, le logo AMD et le logo AMD Opteron sont des marques ou des marques déposées d'Advanced Micro Devices. UNIX est une
marque déposée de The Open Group.
Ce logiciel ou matériel et la documentation qui l'accompagne peuvent fournir des informations ou des liens donnant accès à des contenus, des produits et des services émanant de
tiers. Oracle Corporation et ses affiliés déclinent toute responsabilité ou garantie expresse quant aux contenus, produits ou services émanant de tiers, sauf mention contraire stipulée
dans un contrat entre vous et Oracle. En aucun cas, Oracle Corporation et ses affiliés ne sauraient être tenus pour responsables des pertes subies, des coûts occasionnés ou des
dommages causés par l'accès à des contenus, produits ou services tiers, ou à leur utilisation, sauf mention contraire stipulée dans un contrat entre vous et Oracle.
Accès aux services de support Oracle
Les clients Oracle qui ont souscrit un contrat de support ont accès au support électronique via My Oracle Support. Pour plus d'informations, visitez le site http://www.oracle.com/
pls/topic/lookup?ctx=acc&id=info ou le site http://www.oracle.com/pls/topic/lookup?ctx=acc&id=trs si vous êtes malentendant.
Page 5
Contents
Using This Documentation ............. ................ ................ ................ ................ ... 13
Index ............. ................ ................ ................ ................ ................ ................ ... 279
11
Page 12
12Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 13
Using This Documentation
■
Overview – Describes how to troubleshoot and maintain Oracle Exadata Storage Server
X7-2.
■
Audience – Technicians, system administrators, authorized service providers, and trained
hardware service personnel who have been instructed on the hazards within the equipment
and are qualified to remove and replace hardware.
■
Required knowledge – Advanced experience troubleshooting and replacing hardware.
Product Documentation Library
Documentation and resources for this product and related products are available at https://
www.oracle.com/goto/x7-2l/docs.
Feedback
Provide feedback about this documentation at http://www.oracle.com/goto/docfeedback.
Using This Documentation13
Page 14
14Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 15
About the Oracle Exadata Storage Server X7-2
These sections describe the controls, connectors, LEDs, system components, and replaceable
components of the server:
■
“Product Description” on page 15
■
“Front and Back Panel Components” on page 16
■
“System Components” on page 21
Note - All system firmware and software for Exadata storage servers is automatically
maintained by the Exadata update process. Do not manually update firmware or software unless
directed by Oracle Support.
Product Description
The Oracle Exadata Storage Server X7-2 is an enterprise-class, two rack unit (2U) server that is
available in two configurations: Extreme Flash (EF) and High Capacity (HC). They both use the
same server but they are provisioned differently, as described in the following table.
System Components
Two Intel Xeon 4114, 10-core, 2.2 GHz, 85W processors
Twelve 16 GB 2666 MT/sec DDR4 DIMMs (192 GB)
Optionally, customers can expand system memory by upgrading to twelve 64 GB DDR4 DIMMs (768 GB) or
twenty-four 64 GB DDR4 DIMMs (1.5 TB)
11(Optional) Oracle Storage 12 Gb SAS PCIe RAID HBA, Internal card and super capacitor (HC only)
22Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 23
System Components
Figure
Description
Legend
12PCIe cards
13PCIe plastic walls
14System battery
15Power supplies
16Air baffle
17DIMMs
Replaceable Units
The following table lists the replaceable units in the server and directs you to the replacement
instructions.
Note - You can remove and replace hot-serviceable components while the server module is
operating in Main power mode. Service cold-service components when the server module is
powered off and, in most cases, when you remove them from the server.
Motherboard assemblyProvides connectors for the DIMMs, processors,
Located in PCIe slot 11, the Oracle Storage 12
Gb SAS PCIe RAID HBA, Internal card manages
SAS storage drives (HC only).
Contains the push-button circuitry and LEDs that
are displayed on the bezel of the chassis.
PCIe risers, and other components.
“Servicing the Internal
HBA Card and HBA Super
Capacitor” on page 159
“Servicing the Front LED
Indicator Module” on page 145
“Servicing the Motherboard
Assembly” on page 173
24Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 25
Troubleshooting and Diagnostics
This section includes information about troubleshooting hardware component faults for the
Oracle Exadata Storage Server X7-2. It contains the following topics:
■
“Troubleshooting Server Component Hardware Faults” on page 25
■
“Troubleshooting With Diagnostic Tools” on page 39
■
“Attaching Devices to the Server” on page 41
■
“Getting Help” on page 46
For more information about server troubleshooting and diagnostics, refer to the Diagnostics andTroubleshooting Guide for Oracle x86 Servers with Oracle ILOM 4.0 at http://www.oracle.
com/goto/x86admindiag/docs.
Troubleshooting Server Component Hardware Faults
This section contains maintenance-related information and procedures to troubleshoot and
repair server hardware issues. The following topics are covered.
■
“Troubleshooting Server Hardware Faults” on page 25
■
“Troubleshooting and Diagnostic Information” on page 29
■
“Troubleshooting Using the Server Front and Back Panel Status Indicators” on page 30
■
“Troubleshooting System Cooling Issues” on page 36
■
“Troubleshooting Power Issues” on page 37
Troubleshooting Server Hardware Faults
When a server hardware fault event occurs, the system lights the Fault-Service Required LED
and captures the event in the Oracle ILOM event log. If you set up notifications through Oracle
ILOM, you also receive an alert through the notification method you chose. When you become
aware of a hardware fault, address it immediately.
Troubleshooting and Diagnostics25
Page 26
Basic Troubleshooting Process
To investigate a hardware fault, see the following:
■
“Basic Troubleshooting Process” on page 26
■
“Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface” on page 27
Basic Troubleshooting Process
Use the following process to address a hardware fault. For the step-by-step procedure, see
“Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface” on page 27.
1.
Identify the server subsystem containing the fault.
You can use Oracle ILOM to identify the failed component.
2.
Review the Oracle Server X7-2 Product Notes and the Oracle Exadata DatabaseMachine Release Notes at https://docs.oracle.com/cd/E80920_01/index.htm.
The release notes contain up-to-date information about the server, including hardware-related
issues.
3.
Prepare the server for service using Oracle ILOM.
■
Prepare for hot service if the component supports it.
To hot service components (such as storage drives, fan modules, and power supplies), see
the removal, installation, and replacement procedures in this document.
■
Prepare for cold service if hot service is not supported for the component.
■
Use Oracle to power off the system or server module.
■
Use Oracle ILOM to set the OK to Remove status for a server module that you must
remove. This prevents any power or update processes from being initiated before you
can remove the server module from the chassis.
4.
Prepare the service work space.
Before servicing the server, prepare the work space, ensuring Electrostatic Discharge Safety
(ESD) protection for the server and components.
See “Preparing for Service” on page 49.
5.
Service the components.
To service the components, see the removal, installation, and replacement procedures in this
document.
6.
Clear the fault in Oracle ILOM.
26Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 27
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Depending on the component, you might need to clear the fault in Oracle ILOM. Generally,
components that have a FRU ID clear the fault automatically.
Related Information
■
“Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface” on page 27
Troubleshoot Hardware Faults Using the Oracle ILOM Web
Interface
Note - The screens shown in this procedure might differ from those for your server.
This procedure uses the basic troubleshooting steps described in “Basic Troubleshooting
Process” on page 26.
Use this procedure to troubleshoot hardware faults with the Oracle ILOM web interface and, if
necessary, prepare the server for service.
Note - This procedure provides one basic approach to troubleshooting hardware faults. It uses
the Oracle ILOM web interface. However, you can perform procedure using the Oracle ILOM
command-line interface (CLI). For more information about the Oracle ILOM web interface and
CLI, refer to the Oracle ILOM documentation.
1.
Log in to the server SP Oracle ILOM web interface.
a.
Open a browser and direct it using the IP address of the server SP.
b.
At the Login screen, enter a user name (with administrator privileges) and
password.
The Summary Information page appears. The Status section of the Summary Information
page provides information about the server subsystems, including:
■
Processors
■
Memory
■
Power
■
Cooling
■
Storage
■
Networking
Troubleshooting and Diagnostics27
Page 28
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
2.
In the Status section of the Oracle ILOM Summary Information page, identify the
server subsystem that requires service.
In the following example, the Status page shows that the Processor subsystem requires service,
which indicates that a hardware component in the subsystem is in a fault state.
3.
To identify the component, in the Status section, click Processors.
The Oracle ILOM Processors page appears.
The following example shows the Processors subsystem section, which indicates that CPU 0
has a fault.
28Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 29
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
4.
To get more information, click the Open Problems link.
The Open Problems page provides detailed information, such as the time the event occurred,
component and subsystem name, and description of the issue. It also includes a link to an
Oracle Knowledge Base article.
In this example, the hardware fault with CPU 0 requires local/physical access to the server.
Tip - The System Log provides a chronological list of all system events and faults that occurred
since the log was last reset, and includes additional information, such as severity levels and
error counts. The System Log also includes information on the devices not reported in the
Status section. To access the System Log, in the left panel, click System Log.
5.
Before going to the server, review the Oracle Server X7-2L Product Notes for
information related to the issue or the component. To access the Product Notes,
go to http://www.oracle.com/goto/x7-2l/docs.
The Product Notes contain up-to-date information about the server, including hardware-related
issues.
6.
To prepare the server for service, see “Preparing for Service” on page 49.
After servicing the component, you might need to clear the fault in Oracle ILOM. For more
information, refer to the service procedure for the component.
7.
Service the component.
Managing Server Hardware Faults Through the
Oracle ILOM Fault Management Shell
The Oracle ILOM Fault Management Shell enables you to view and manage fault activity on
managed servers and other types of devices.
For more information about how to use the Oracle ILOM Fault Management Shell, refer to the
Oracle ILOM User's Guide for System Monitoring and Diagnostics Firmware Release 4.0.x in
the Oracle Integrated Lights Out Manager (ILOM) 4.0 Documentation Library at http://www.
oracle.com/goto/ilom/docs.
Troubleshooting and Diagnostic Information
The following list contains diagnostic and troubleshooting-related procedures and references
that can assist you with resolving server issues.
Troubleshooting and Diagnostics29
Page 30
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
■
Diagnostics and Troubleshooting Guide for Oracle x86 Servers with Oracle ILOM 4.0.x at
http://www.oracle.com/goto/x86admindiag/docs
■
Oracle X7 Series Servers Administration Guide
■
“Troubleshooting Using the Server Front and Back Panel Status Indicators” on page 30
■
“Managing Server Hardware Faults Using the Oracle ILOM Fault Management
Shell” on page 39
Troubleshooting Using the Server Front and Back
Panel Status Indicators
These sections describe the status indicators (LEDs) located on the front and back of the server,
including those found on components and ports. This section includes the following topics:
■
“Server Boot Process and Normal Operating State Indicators” on page 30
■
“Server System-Level Status Indicators” on page 31
■
“Server Fan Status Indicators” on page 33
■
“Storage Drive Status Indicators” on page 33
■
“Power Supply Status Indicators” on page 34
■
“Network Management Port Status Indicators” on page 34
■
“Ethernet Port Status Indicators” on page 34
■
“Motherboard Status Indicators” on page 35
Related Information
■
“EF Front Panel Components” on page 17
■
“EF Back Panel Components and Cable Connections” on page 18
■
“HC Front Panel Components” on page 19
■
“HC Back Panel Components and Cable Connections” on page 20
Server Boot Process and Normal Operating State Indicators
A normal server boot process involves two indicators, the service processor SP OK LED
indicator and the System OK LED indicator.
When you connect AC power to the server, the server boots into standby power mode:
1. The SP OK LED blinks slowly (0.5 seconds on, 0.5 seconds off) while the SP is starting,
and the main System OK LED remains off until the SP is ready.
30Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 31
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
2. After a few minutes, the main System OK LED slowly flashes the standby blink pattern
(0.1 seconds on, 2.9 seconds off), indicating that the SP is ready for use. In Standby power
mode, the server is not initialized or fully powered on at this point.
When powering on the server (either by the On/Standby button or Oracle ILOM), the server
boots to Main power mode:
1. The System OK LED blinks slowly (0.5 seconds on, 0.5 seconds off), and the SP OK LED
remains lit (no blinking).
2. When the server successfully boots, the System OK LED remains lit. When the System OK
LED and the SP OK LED indicators remain lit, the server is in Main power mode.
Note - The green System OK LED indicator and the green SP OK indicator remain lit (no
blinking) when the server is in a normal operating state.
Server System-Level Status Indicators
There are seven system-level status indicators (LEDs), some of which are located on both the
server front panel and the back panel. For the location of the status indicators, see “Front and
Back Panel Components” on page 16. The following table describes these indicators.
Status Indicator
Name
Locate Button/
LED
Fault-Service
Required
System OKGreenIndicates the operational state of the chassis.
IconColorState and Meaning
WhiteIndicates the location of the server.
■ OFF – Server is operating normally.
■ FAST BLINK (250 ms) – Use Oracle ILOM to
activate this LED indicator to enable you to locate a
particular system quickly and easily.
■ Pressing the Locate button toggles the LED indicator
fast blink on or off.
AmberIndicates the fault state of the server.
■ OFF – The server is operating normally.
■ STEADY ON – A fault is present on the server. This
LED indicator lights whenever a fault indicator lights
for a replaceable component on the server.
Note - When this LED indicator is lit, a system console
message might appear that includes a recommended
service action.
Troubleshooting and Diagnostics31
Page 32
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Status Indicator
Name
IconColorState and Meaning
■ OFF – AC power is not present or the Oracle ILOM
boot is not complete.
■ STANDBY BLINK (on for 100 ms, off for 2900 ms)
– Standby power is on, but the chassis power is off
and the Oracle ILOM SP is running.
■ SLOW BLINK (1000 ms) – Startup sequence was
initiated on the host. This pattern begins soon after
you power on the server. This status indicates either:
power-on self-test (POST) code checkpoint tests
are running on the server host system, or the host
is transitioning from the powered-on state to the
standby state on shutdown.
■ STEADY ON – The server is powered on, and all
host POST code checkpoint tests are complete. The
server is in one of the following states: the server
host is booting the operating system (OS), or the
server host is running the OS.
SP OKGreenIndicates the state of the service processor.
■ OFF – Service processor (SP) is not running.
■ SLOW BLINK – SP is booting.
■ STEADY ON – SP is fully operational.
Top FanAmberIndicates that one or more of the internal fan modules
failed.
■ OFF – Indicates steady state; no service is required.
■ STEADY ON – Indicates service required.
Back Power
AmberIndicates that one of the server power supplies failed.
Supply Fault
■ OFF – Indicates steady state; no service is required.
■ STEADY ON – Indicates service required; service
the power supply.
System Over
Back
AmberIndicates a warning for an overtemperature condition.
Temperature
Warning
■ OFF – Normal operation; no service is required.
■ STEADY ON – The system is experiencing an
overtemperature warning condition.
Note - This is a warning indication, not a fatal
overtemperature. Failure to correct this might result in
the system overheating and shutting down unexpectedly.
DO NOT
WhiteIndicates that the system is not ready to service.
SERVICE
■ OFF – Normal operation.
■ STEADY ON – The system is not ready for service.
32Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 33
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Status Indicator
Name
IconColorState and Meaning
Note - The DO NOT SERVICE indicator is application
specific. This indicator is illuminated only on demand by
the Host application.
Server Fan Status Indicators
Each fan module has one status indicator (LED). The LEDs are located on the chassis midwall adjacent to and aligned with the fan modules, and are visible when the server top cover is
removed.
Status Indicator
Name
Fan StatusAmber■ Off – The fan module is correctly installed and
IconColorState and Meaning
operating within specification.
■ Amber – The fan module is faulty. The front TOP
FAN LED and the front and back panel Fault-Service
Required LEDs are also lit if the system detects a fan
module fault.
Storage Drive Status Indicators
There are three status indicators (LEDs) on each drive, which applies to only the HC server.
Status Indicator
Name
OK/ActivityGreen■ OFF – Power is off or installed drive is not
Fault-Service
Required
OK to RemoveBlue■ STEADY ON – The storage drive can be removed
IconColorState and Meaning
recognized by the system.
■ STEADY ON – The drive is engaged and is
receiving power.
■ RANDOM BLINK – There is disk activity. Status
indicator LED blinks on and off to indicate activity.
Amber■ OFF – The storage drive is operating normally.
■ STEADY ON – The system detected a fault with the
storage drive.
safely during a hot-plug operation.
■ OFF – The storage drive was not prepared for
removal.
Troubleshooting and Diagnostics33
Page 34
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Power Supply Status Indicators
There are two status indicators (LEDs) on each power supply. These indicators are visible from
the back of the server.
Status Indicator
Name
AC OK/ DC OKGreen■ OFF – No AC power is present.
Fault-Service
Required
IconColorState and Meaning
■ SLOW BLINK – Normal operation. Input power
is within specification. DC output voltage is not
enabled.
■ STEADY ON – Normal operation. Input AC power
and DC output voltage are within specification.
Amber■ OFF – Normal operation. No service action is
required.
■ STEADY ON – The power supply (PS) detected a PS
fan failure, PS overtemperature, PS over current, or
PS over or under voltage.
Network Management Port Status Indicators
The server has one 10/100/1000BASE-T Ethernet management domain interface, labeled NET
MGT. There are two status indicators (LEDs) on this port. These indicators are visible from the
back of the server.
Status Indicator
Name
ActivityTop rightGreen■ ON – Link up.
Link speedTop leftGreen■ ON – 1000BASE-T link.
LocationColorState and Meaning
■ OFF – No link or down link.
■ BLINKING – Packet activity.
■ OFF – 10/100BASE-T link.
Ethernet Port Status Indicators
The server has one Gigabit Ethernet port (NET 0). There are two status indicators (LEDs) that
are visible from the back of the server.
34Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 35
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Status Indicator
Name
ActivityTop leftGreen■ ON – Link up.
Link speedTop rightBi-colored:
LocationColorState and Meaning
■ OFF – No activity.
■ BLINKING – Packet activity.
Amber/Green
■ OFF – 100BASE-T link (if link up).
■ Green ON – 1000BASE-T link.
Motherboard Status Indicators
The motherboard contains the following status indicators (LEDs).
Status IndicatorDescription
DIMM Fault Status
Indicators
Processor Fault
Status Indicators
Fault Remind Status
Indicator
STBY PWRGD
Status Indicator
PCIe Hot-Plug
Status Indicators
■ Each 24 DIMM socket on the motherboard has an amber fault status indicator (LED)
associated with it.
■ If Oracle ILOM determines that a DIMM is faulty, pressing the Fault Remind button on
the motherboard I/O card signals the service processor to light the fault LED associated
with the failed DIMM.
■ For more information on DIMM fault status indicators and the location of the Fault
Remind button, see “Servicing the DIMMs” on page 101.
■ The motherboard includes a fault status indicator (LED) adjacent to each of the two
processor sockets.
■ These LEDs indicate when a processor fails.
■ Pressing the Fault Remind button on the motherboard I/O card signals the service
processor to light the fault status indicators associated with the failed processors.
■ For more information on processor fault status indicators and the location of the Fault
Remind button, see “Servicing Processors” on page 123.
■ This status indicator (LED) is located next to the Fault Remind button and is powered
from the super capacitor that powers the fault LEDs on the motherboard.
■ This LED lights to indicate that the fault remind circuitry is working properly in
cases where no components failed and, as a result, none of the component fault LEDs
illuminate.
■ For more information on the fault remind status indicator and the location of the Fault
Remind button, see “Using the Server Fault Remind Button” on page 104.
■ This green status indicator (LED) is labeled STBY PWRGD and is located on the
motherboard near the back of the server.
■ This LED lights to inform a service technician that the motherboard is receiving Standby
power from at least one of the power supplies.
■ This LED helps prevent service actions on the server internal components while the AC
power cords are installed and power is supplied to the server.
■ Each PCIe card on the motherboard has a green power indicator and an amber attention
status indicator (LED) associated with it.
■ If the green power LED is on and the amber attention LED is off, then the SSD is
operating properly.
Troubleshooting and Diagnostics35
Page 36
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Status IndicatorDescription
■ If the amber attention LED is on, verify the state of the SSD in the Oracle Exadata
Storage Server Cell Software to determine if the PCIe card is faulty and needs
replacement.
Troubleshooting System Cooling Issues
Maintaining the proper internal operating temperature of the server is crucial to the health of the
server. To prevent server shutdown and damage to components, address over temperature and
hardware-related issues as soon as they occur. If your server has a temperature-related fault, use
the information in the following table to troubleshoot the issue.
Cooling IssueDescriptionActionPrevention
External
Ambient
Temperature
Too High
Airflow
Blockage
Cooling Areas
Compromised
The server fans pull cool air
into the server from its external
environment. If the ambient
temperature is too high, the
internal temperature of the
server and its components
increases. This can cause poor
performance and component
failure.
The server cooling system
uses fans to pull cool air in
from the server front intake
vents and exhaust warm air out
the server back panel vents.
If the front or back vents are
blocked, the airflow through
the server is disrupted and
the cooling system fails to
function properly causing the
server internal temperature to
rise.
The air baffle, component filler
panels, and server top cover
maintain and direct the flow
of cool air through the server.
These server components must
be in place for the server to
function as a sealed system.
If these components are
not installed correctly, the
airflow inside the server can
become chaotic and nondirectional, which can cause
Verify the ambient temperature
of the server space against the
environmental specifications
for the server. If the
temperature is not within
the required operating
range, remedy the situation
immediately.
Inspect the server front
and back panel vents for
blockage from dust or debris.
Additionally, inspect the
server interior for improperly
installed components or cables
that can block the flow of air
through the server.
Inspect the server interior to
ensure that the air baffle is
properly installed. Ensure
that all external-facing
slots (storage drive, PCIe)
are occupied with either a
component or a component
filler panel. Ensure that the
server top cover is in place and
sits flat and snug on top of the
server.
Periodically verify the ambient
temperature of the server space
to ensure that it is within the
required range, especially if
you made any changes to the
server space (for example,
added additional servers). The
temperature must be consistent
and stable.
Periodically inspect and clean
the server vents using an ESD
certified vacuum cleaner.
Ensure that all components,
such as cards, cables, fans,
air baffles and dividers are
properly installed. Never
operate the server without the
top cover installed.
When servicing the server,
ensure that the air baffle is
installed correctly and that
the server has no unoccupied
external-facing slots. Never
operate the server without the
top cover installed.
36Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 37
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Cooling IssueDescriptionActionPrevention
server components to overheat
and fail.
Hardware
Component
Failure
■ Components, such as
power supplies and fan
modules, are an integral
part of the server cooling
system. When one of these
components fails, the
server internal temperature
can rise. This rise in
temperature can cause
other components to enter
into an over-temperature
state. Additionally, some
components, such as
processors, might overheat
when they are failing,
which can also generate an
over-temperature event.
■ To reduce the risk
related to component
failure, power supplies
and fan modules are
installed in pairs to
provide redundancy.
Redundancy ensures
that if one component in
the pair fails, the other
functioning component can
continue to maintain the
subsystem. For example,
power supplies serve a dual
function; they provide both
power and airflow. If one
power supply fails, the
other functioning power
supply can maintain both
the power and the cooling
subsystems.
Investigate the cause of the
overtemperature event, and
replace failed components
immediately. For hardware
troubleshooting information,
see “Troubleshooting
Server Hardware
Faults” on page 25.
Component redundancy
is provided to allow for
component failure in critical
subsystems, such as the
cooling subsystem. However,
once a component in a
redundant system fails,
the redundancy no longer
exists, and the risk for server
shutdown and component
failures increases. Therefore,
it is important to maintain
redundant systems and
replace failed components
immediately.
Troubleshooting Power Issues
If your server does not power on, use the information in the following table to troubleshoot the
issue.
Troubleshooting and Diagnostics37
Page 38
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Power IssueDescriptionActionPrevention
AC Power
Connection
The AC power cords are the
direct connection between the
server power supplies and the
power sources. The server
power supplies need separate
stable AC circuits. Insufficient
voltage levels or fluctuations
in power can cause server
power problems. The power
supplies are designed to
operate at a particular voltage
and within an acceptable range
of voltage fluctuations (refer to
“Electrical Requirements” in
Oracle Servers X7-2 and X72L Installation GuideOracle
Exadata Storage Server
X7-2 EF and HC Installation
Guide).
Power
Supplies
(PSUs)
The server power supply
units (PSUs) provide the
necessary server voltages from
the AC power outlets. If the
power supplies are inoperable,
unplugged, or disengaged from
the internal connectors, the
server cannot power on.
Note - Use the Velcro straps
on the back of the server
to secure the power cord
connectors to the back of
the power supplies. The
Velcro retaining straps
minimize the risk of accidental
disconnection.
Top CoverThe server top cover maintains
the air pressures inside the
server, prevents accidental
exposure to hazardous
voltages, and protects internal
components from physical and
environmental damage.
Verify that both AC power
cords are connected to the
server. Verify that the correct
power is present at the outlets
and monitor the power to
verify that it is within the
acceptable range. You can
verify proper connection and
operation by verifying the
power supply (PS) indicator
panels, which are located at
the back of the server on the
power supplies. Lit green
AC OK indicators show a
properly functioning power
supply. An amber AC OK
indicator indicates that the AC
power to the power supply is
insufficient.
Verify that the AC cables
are connected to both power
supplies. Verify that the power
supplies are operational (the
PSU indicator panel must have
a lit green AC OK indicator).
Ensure that the power supply
is properly installed. A power
supply that is not fully engaged
with its internal connector does
not have power applied and
does not have a lit green AC
OK indicator.
Do not operate the server
without the top cover
installed unless you are
hot-plugging a fan module,
and then ensure that you
complete the operation and
replace the cover within 20
seconds. For instructions
for properly replacing fan
Use the AC power cord Velcro
retaining clips and position
the cords to minimize the risk
of accidental disconnection.
Ensure that the AC circuits that
supply power to the server are
stable and not overburdened.
When a power supply fails,
replace it immediately. To
ensure redundancy, the server
has two power supplies. This
redundant configuration
prevents server downtime,
or an unexpected shutdown,
due to a failed power supply.
The redundancy allows the
server to continue to operate
if one of the power supplies
fails. However, when a
server is being powered by
a single power supply, the
redundancy no longer exists,
and the risk for downtime
or an unexpected shutdown
increases. When installing a
power supply, ensure that it is
fully seated and engaged with
its connector inside the drive
bay. A properly installed power
supply has a lit green AC OK
indicator.
Be careful to avoid bending
or otherwise warping the top
cover.
38Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 39
Troubleshooting With Diagnostic Tools
Power IssueDescriptionActionPrevention
modules, see “Servicing Fan
Modules” on page 73.
For instructions for installing
the server top cover, see
“Install the Server Top
Cover” on page 189.
Managing Server Hardware Faults Using the
Oracle ILOM Fault Management Shell
The Oracle ILOM Fault Management Shell enables you to view and manage fault activity on
managed servers and other types of devices.
For more information about how to use the Oracle ILOM Fault Management Shell, refer to the
Oracle ILOM User's Guide for System Monitoring and Diagnostics Firmware Release 4.0.x in
the Oracle Integrated Lights Out Manager (ILOM) 4.0 Documentation Library at http://www.
oracle.com/goto/ilom/docs.
Troubleshooting With Diagnostic Tools
The server and its accompanying software and firmware contain diagnostic tools and features
that can help you isolate component problems, monitor the status of a functioning system,
and exercise one or more subsystems to disclose more subtle or intermittent hardware-related
problems.
Each diagnostic tool has its own strength and application. Review the tools listed in this section
and determine which tool might be best to use for your situation. After you determine the tool to
use, you can access it locally, while at the server, or remotely.
■
“Diagnostic Tools” on page 39
■
“Diagnostic Tool Documentation” on page 41
Diagnostic Tools
The selection of diagnostic tools available for your server range in complexity from a
comprehensive validation test suite (Oracle VTS) to a chronological event log (Oracle ILOM
Troubleshooting and Diagnostics39
Page 40
Troubleshooting With Diagnostic Tools
System Log). The selection of diagnostic tools also includes standalone software packages,
firmware-based tests, and hardware-based LED indicators.
The following table summarizes the diagnostic tools that you can use when troubleshooting or
monitoring your server.
Diagnostic ToolTypeWhat It DoesAccessibilityRemote Capability
Oracle ILOMSP firmwareMonitors environmental
Hardware-based
LED indicators
Power-On SelfTest (POST)
UEFI
Diagnostics
HWdiagSP firmwareThe HWdiag utility checks
Oracle Solaris
commands
Oracle Linux
commands
Oracle VTSDiagnostic
condition and component
functionality sensors,
generates alerts, performs
fault isolation, and provides
remote access.
Hardware and
SP firmware
Host firmwareTests core components of
SP firmwareUEFI diagnostics can test
Operating
system software
Operating
system software
tool standalone
software
Indicates status of overall
system and particular
components.
system: CPUs, memory,
and motherboard I/O bridge
integrated circuits.
and detect problems on all
processors, memory, disk
drives, and network ports.
the status of a system and
its components. You run the
HWdiag utility within the
Oracle ILOM Diag shell.
Displays various kinds of
system information.
Displays various kinds of
system information.
Exercises and stresses the
system, running tests in
parallel.
Can function on
either Standby power
mode or Main power
mode and is not OS
dependent.
Available when
system power is
available.
Runs on startup.
Available when the
operating system is
not running.
You can use either
the Oracle ILOM
web interface or
the commandline interface
(CLI) to run UEFI
diagnostics.
Can function on
Standby power and
when operating
system is not
running.
Requires operating
system.
Requires operating
system.
Requires the Solaris
operating system.
Install Oracle VTS
software separately.
Designed for remote
and local access.
Local, but sensor
and indicators are
accessible from
Oracle ILOM
web interface or
command-line
interface (CLI).
Local, but can be
accessed through
Oracle ILOM
Remote System
Console Plus.
Remote access
through Oracle
ILOM Remote
System Console
Plus.
Local, but remote
serial access is
possible if the
SP serial port is
connected to a
network-accessible
terminal server.
Local, and over
network.
Local, and over
network.
View and control
over network.
40Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 41
Diagnostic Tool Documentation
The following table identifies where you can find more information about diagnostic tools.
Diagnostic ToolDocumentationLocation
Oracle ILOMOracle Integrated Lights Out
Manager 4.0 Documentation Library
System indicators and sensorsThis document“Troubleshooting Using the Server
UEFI DiagsDiagnostics and Troubleshooting
Guide for Oracle x86 Servers with
Oracle ILOM 4.0
Oracle VTSOracle VTS software and
documentation
Attaching Devices to the Server
Attaching Devices to the Server
http://www.oracle.com/goto/
ilom/docs
Front and Back Panel Status
Indicators” on page 30
http://www.oracle.com/goto/
x86admindiag/docs
https://docs.oracle.com/cd/
E83848_01/
The following sections contain procedural information for attaching devices to the server so you
can access diagnostic tools when troubleshooting and servicing the server:
■
“Attach Devices to the Server” on page 41
■
“Back Panel Connector Locations” on page 42
■
“Configuring Serial Port Sharing” on page 43
■
“Ethernet Device Naming” on page 45
■
“Back Panel Pinhole Switches” on page 45
Attach Devices to the Server
This procedure explains how to connect devices to the server (remotely and locally), so that you
can interact with the service processor (SP) and the server console.
1.
Connect an Ethernet cable to the Gigabit Ethernet (NET) connector as needed for
OS support.
See “Back Panel Connector Locations” on page 42.
Troubleshooting and Diagnostics41
Page 42
Attach Devices to the Server
2.
To connect to the Oracle ILOM service processor over the network, connect an
Ethernet cable to the Ethernet port labeled NET MGT.
See “Back Panel Connector Locations” on page 42.
3.
To access the Oracle ILOM command-line interface (CLI) locally, connect a serial
null modem cable to the RJ-45 serial port labeled SER MGT.
To access the system console, connect the RJ-45 cable to a terminal or terminal emulator, log on
to Oracle ILOM, and type start /HOST/console.
See “Back Panel Connector Locations” on page 42.
Back Panel Connector Locations
The following illustration shows and describes the locations of the back panel connectors. Use
this information to set up the server, access diagnostic tools, and manage the server during
service.
No. Cable Port or Expansion SlotDescription
1Power supply 0 input powerThe server has two power supply connectors, one for each power supply.
42Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 43
Attach Devices to the Server
No. Cable Port or Expansion SlotDescription
Power supply 1 input powerDo not attach power cables to the power supplies until you finish connecting
the data cables to the server. The server goes into Standby power mode, and
the Oracle ILOM service processor initializes when the AC power cables are
connected to the power source. System messages might be lost after 60 seconds
if the server is not connected to a terminal, PC, or workstation.
Note - Oracle ILOM signals a fault on any installed power supply that is not
connected to an AC power source, since it might indicate a loss of redundancy.
2Network management port (NET MGT)The service processor NET MGT port is the optional connection to the Oracle
3Ethernet port (NET 0)The Ethernet port enables you to connect the system to the network. The Ethernet
4USB portThe USB port supports hot-plugging. You can connect and disconnect a USB
5Serial management port (SER MGT)The service processor SER MGT port uses an RJ-45 cable and terminal (or
ILOM service processor. The NET MGT port is configured by default to use
Dynamic Host Configuration Protocol (DHCP). The service processor NET
MGT port uses an RJ-45 cable for a 10/100/1000BASE-T connection.
port uses an RJ-45 cable for a 10/100/1000BASE-T connection.
cable or a peripheral device while the server is running without affecting system
operations.
emulator) to provide access to the Oracle ILOM command-line interface (CLI).
Using Oracle ILOM, you can configure it to connect to the system console.
Note - This port does not support network connections.
Configuring Serial Port Sharing
By default, the service processor (SP) controls the serial management (SER MGT) port and
uses it to redirect the host serial console output. Using Oracle ILOM, you can assign the host
console (COM1) as owner of the SER MGT port output, which allows the host console to
output information directly to the SER MGT port. Serial port sharing is useful for Windows
kernel debugging because you can view non-ASCII character traffic output from the host
console.
Set up the network on the SP before attempting to change the serial port owner to the host
server. If the network is not set up first, and you switch the serial port owner to the host
server, you cannot connect using the CLI or web interface to change the serial port owner
back to the SP. To return the serial port owner setting to the SP, restore access to the serial
port on the server. For details, refer to the Oracle Integrated Lights Out Manager (ILOM) 4.0
Documentation Library at http://www.oracle.com/goto/ilom/docs.
If you accidentally lose access to Oracle ILOM, contact Oracle Service and follow the process
to return the serial port ownership back to the SP.
You can assign serial port output using either Oracle ILOM CLI or web interface, as described
in the following sections:
■
“Assign Serial Port Output Using the Oracle ILOM CLI” on page 44
Troubleshooting and Diagnostics43
Page 44
Assign Serial Port Output Using the Oracle ILOM CLI
■
“Assign Serial Port Output Using the Oracle ILOM Web Interface” on page 44
Assign Serial Port Output Using the Oracle ILOM CLI
1.
Open an SSH session, and at the command line, log in to the SP Oracle ILOM
CLI.
Log in as a user with root or administrator privileges. For example:
ssh root@ipaddress
The ipaddress is the IP address of the server SP.
For more information, refer to “Accessing Oracle ILOM” in the Oracle X7 Series Servers
Administration Guide.
The Oracle ILOM CLI prompt (->) appears.
2.
To set the serial port owner, type:
-> set /System/SP/serial/portsharing owner=host
Note - The serial port sharing value, by default, is owner=SP.
3.
Connect a serial host to the server.
Assign Serial Port Output Using the Oracle ILOM Web
Interface
1.
Log in to the SP Oracle ILOM web interface.
Open a web browser and direct it using the IP address of the server SP. Log in as root or a
user with administrator privileges. Refer to “Accessing Oracle ILOM” in the Oracle X7 SeriesServers Administration Guide.
The Summary Information page appears.
2.
In the Oracle ILOM web interface, on the navigation menu on the left side of the
screen, select ILOM Administration → Connectivity.
3.
Click the Serial Port tab.
The Serial Port Settings page appears.
Note - The serial port sharing setting, by default, is Service Processor.
44Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 45
Assign Serial Port Output Using the Oracle ILOM Web Interface
4.
In the Serial Port Settings page, select HostServer as the serial port owner.
5.
Click Save for the changes to take effect.
6.
Connect a serial host to the server.
Ethernet Device Naming
This section contains information about the device naming for the one 10-Gigabit Ethernet port
(labeled NET 0) on the back panel of the server. For location information, see “Back Panel
Connector Locations” on page 42.
Ethernet Port Device Naming
The device naming for the Ethernet interface is reported differently by different interfaces
and operating systems. The following table shows the BIOS (physical) and operating system
(logical) naming convention for the interface. This naming convention might vary depending on
the conventions of your operating system and which devices are installed in the server.
Note - Naming used by the interfaces might vary from that listed below depending on which
devices are installed in the system.
PortSolarisLinuxWindows
NET 0igb0■ Linux 6 - eth0
■ Linux 7 - enp1
Ethernet
Note - For Windows, port naming such as
Ethernet is used by default. Windows allows
you to rename the ports to meet applicationspecific needs.
Back Panel Pinhole Switches
Oracle Service personnel might instruct you to use the back panel pinhole switches to reset the
NMI, Host Warm Reset, or SP, as shown in the following figure.
Troubleshooting and Diagnostics45
Page 46
Getting Help
CalloutIconDescription
1Non-maskable interrupt
2Host Warm Reset
3SP Reset
†
Oracle Service use only.
Getting Help
The following sections describe how to get additional help to resolve server problems.
■
“Contacting Support” on page 47
■
“Locating the System Serial Number” on page 47
■
“Locating the Exadata Database Machine Rack Serial Numbers and JobID
Label” on page 48
(NMI)
†
46Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 47
Getting Help
Contacting Support
If the troubleshooting procedures in this chapter do not solve your problem, use the following
table to collect information that you might need to communicate to support personnel.
System Configuration
Information Needed
Service contract number
System model
Operating environment
System serial number
Peripherals attached to the system
Email address and phone number
for you and a secondary contact
Street address where the system is
located
Superuser password
Summary of the problem and
the work being done when the
problem occurred
Other Useful Information
IP address
Server name (system host name)
Network or internet domain name
Proxy server configuration
Related Information
Your Information
■
“Locating the System Serial Number” on page 47
Locating the System Serial Number
When you ask for service on your system, you might need to have your server serial number.
Record this number for future use. Use one of the following methods to locate your server serial
number:
■
On the front panel of the server, look to the left side of the chassis and below the status
LEDs.
Troubleshooting and Diagnostics47
Page 48
Getting Help
■
Locate the Customer Information Sheet (CIS) attached to your server packaging. This sheet
includes the serial number.
■
From the Oracle ILOM web interface, go to the System Information → SummaryInformation page in the Oracle ILOM web interface.
■
From the Oracle ILOM CLI, type show /System.
Locating the Exadata Database Machine Rack
Serial Numbers and JobID Label
During a service call, you might need the serial numbers and the JobID label from the Exadata
Database Machine rack. Record these numbers for future use. You can find the system serial
numbers and JobID label in the following physical locations:
■
Look for the Large System Serial Number label and JobID label on the front of the rack,
inside the door, on the left vertical strut starting at rack unit 23 (U23) and going down.
■
Look for the Small System Serial Number label on the top left portion of the cabinet from
the rear above the power distribution unit (PDU).
Related Information
■
“EF Front Panel Components” on page 17
■
“HC Front Panel Components” on page 19
48Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 49
Preparing for Service
These sections describe safety considerations and provide prerequisite procedures and
information about replacing components in the server:
■
“Safety Precautions” on page 49
■
“Safety Symbols” on page 50
■
“Electrostatic Discharge Safety” on page 50
■
“Key Identity Properties (KIP) Automated Update” on page 51
■
“Required Tools” on page 52
■
“Preparing the Server for Component Replacement” on page 52
Related Information
■
“Returning the Server to Operation” on page 187
Safety Precautions
For your protection, observe the following safety precautions when setting up your equipment:
■
Follow all standard cautions, warnings, and instructions marked on the equipment and
described in the Oracle Engineered System Safety and Compliance Guide, ComplianceModel No.: ESY27 and Important Safety Information for Oracle's Hardware Systems.
■
Ensure that the voltage and frequency of your power source match the voltage and
frequency inscribed on the equipment electrical rating label.
■
Follow the electrostatic discharge safety practices, as described in “Electrostatic Discharge
Safety” on page 50.
■
Disconnect both power supply cords (if necessary) before servicing components.
Related Information
■
“Safety Symbols” on page 50
Preparing for Service49
Page 50
Safety Symbols
■
“Electrostatic Discharge Safety” on page 50
Safety Symbols
The following symbols might appear in this document. Note their meanings.
Caution - Risk of personal injury or equipment damage. To avoid personal injury or
equipment damage, follow the instructions.
Caution - Hot surface. Avoid contact. Surfaces are hot and might cause personal injury, if
touched.
Caution - Hazardous voltages are present. To reduce the risk of electric shock and danger to
personal health, follow the instructions.
Related Information
■
“Safety Precautions” on page 49
■
“Electrostatic Discharge Safety” on page 50
Electrostatic Discharge Safety
Devices that are sensitive to electrostatic discharge (ESD), such as the motherboard, PCIe
cards, drives, processors, and memory DIMMs require special handling. See “Contacting
Support” on page 47
Caution - The boards and drives contain electronic components that are extremely sensitive to
static electricity. Ordinary amounts of static electricity from clothing or the work environment
can destroy components. Do not touch the components along their connector edges.
Do the following when handling ESD-sensitive components:
■
Use an antistatic wrist strap.
Wear an antistatic wrist strap when handling components such as drive assemblies, boards,
or cards. When servicing or removing server components, disconnect the power cords from
50Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 51
Key Identity Properties (KIP) Automated Update
the server, and attach an antistatic strap to your wrist and then to a metal area on the chassis.
Then disconnect the power cords from the server. Following this practice equalizes the
electrical potentials between you and the server.
Note - An antistatic wrist strap is not included in the Accessory Kit for the server. However,
antistatic wrist straps are included with components.
■
Use an antistatic mat.
Place ESD-sensitive components such as the motherboard, DIMMS (memory modules),
and other printed circuit board (PCB) cards on an antistatic mat. You can use the following
items as an antistatic mat:
■
An antistatic bag used to wrap an Oracle replacement part
■
An Oracle ESD mat (orderable item)
■
A disposable ESD mat (shipped with some replacement parts or optional system
components)
Related Information
■
“Safety Precautions” on page 49
■
“Safety Symbols” on page 50
■
“Preparing the Server for Component Replacement” on page 52
■
“Returning the Server to Operation” on page 187
Key Identity Properties (KIP) Automated Update
Oracle ILOM includes a key identity properties (KIP) auto-update feature that ensures product
information that is used for service entitlement and warranty coverage is accurately maintained
by the server at all times, including during hardware replacement activities.
The KIP includes the server product name, product part number (PPN), and product serial
number (PSN). The KIP is stored in the replaceable unit identifiers) container of the three server
replaceable units that are designated quorum members.
The quorum members include:
■
Disk backplane (DBP), designated as a primary quorum member.
■
Motherboard (MB), designated as a backup quorum member.
■
Power supply (PS), designated as a backup quorum member.
Preparing for Service51
Page 52
Required Tools
When you remove and replace a server FRU that contains the KIP, Oracle ILOM programs the
KIP of the replacement component to contain the same KIP as the other two components.
Note - Replace only one quorum member at a time. Automated updates are completed only
when two of the three quorum members contain matching key identity properties.
Related Information
■
“Servicing Power Supplies” on page 81
■
“Servicing the Disk Backplane” on page 137
■
“Servicing the Motherboard Assembly” on page 173
Required Tools
The server can be serviced with the following tools:
■
Antistatic wrist strap
■
Antistatic mat
■
No. 2 Phillips screwdriver
■
Torx (6 lobe) T15, T25, and T30 screwdrivers
■
12.0 in-lbs/ft (inch-pounds/foot) torque driver
Related Information
■
“Preparing for Service” on page 49
Preparing the Server for Component Replacement
Note - When you replace the storage drives or power supplies, not all of these procedures are
necessary. For more information, see the replacement procedures for those components.
Before you can remove and install components that are inside the server, you must perform the
procedures in the following sections:
52Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 53
Preparing the Server for Component Replacement
■
“Powering Down the Server” on page 53
■
“Disconnect Cables From the Server” on page 59
■
“Extend the Server to the Maintenance Position” on page 60
■
“Remove the Server From the Rack” on page 62
■
“Take Antistatic Measures” on page 63
■
“Remove the Server Top Cover” on page 64
Related Information
■
“Returning the Server to Operation” on page 187
Powering Down the Server
Determine how you want to power down the server by reviewing the options in the following
table.
Note - Some server components are hot-pluggable, such as storage drives, power supplies, fan
modules, and M.2 flash SSDs. Components that are hot-pluggable do not require the server to
be powered down when performing service procedures.
DescriptionLink
Power down the server gracefully to prevent data from
being corrupted. Performing a graceful shutdown ensures
that the system is ready for restart.
If the server is not responding, or you must shut down
the server quickly, perform an immediate shutdown.
■ “Power Down the Server Gracefully Using the
Oracle ILOM CLI” on page 54
■ “Power Down the Server Gracefully Using the
Oracle ILOM Web Interface” on page 55
■ “Power Down the Server Gracefully Using the On/
Standby Button” on page 56
■ “Power Down the Server for Immediate Shutdown
Using the Oracle ILOM CLI” on page 57
■ “Power Down the Server for Immediate
Shutdown Using the Oracle ILOM Web
Interface” on page 58
■ “Power Down the Server for Immediate Shutdown
Using the On/Standby Button” on page 59
Related Information
■
“Power On the Server” on page 193
Preparing for Service53
Page 54
Power Down the Server Gracefully Using the Oracle ILOM CLI
Power Down the Server Gracefully Using the Oracle ILOM CLI
Performing a graceful shutdown ensures that all of your data is saved and the system is ready
for restart.
1.
Log in to the server as superuser or equivalent.
Depending on the nature of the problem, you might want to view the system status or the log
files or run diagnostics before you shut down the system. For log file information, refer to the
Oracle Integrated Lights Out Manager (ILOM) 4.0 Documentation Collection at http://www.
oracle.com/goto/ilom/docs.
2.
Notify affected users that the server is going to be powered down.
3.
Save any open files, and quit all running applications.
For specific information about these processes, refer to your application documentation.
4.
Log in to the Oracle ILOM command-line interface (CLI) using an Administrator
account.
For instructions, refer to the Oracle Exadata Database Machine Installation and Configuration
Guide at https://docs.oracle.com/cd/E80920_01/index.htm.
5.
At the Oracle ILOM prompt, shut down the operating system:
-> stop /System
Note - The stop /System command does not guarantee a system shutdown. This command
is only a request to the host operating system (OS) to perform a system shutdown. To ensure
a system shutdown, see “Power Down the Server for Immediate Shutdown Using the Oracle
ILOM CLI” on page 57.
If the system is running Oracle Solaris, refer to the Oracle Solaris system administration
documentation for additional information.
The host server performs an orderly power shutdown to Standby power mode, and the
System OK LED on the front panel begins blinking. See “Server System-Level Status
Indicators” on page 31.
6.
Disconnect the power and cables from the server.
See “Disconnect Cables From the Server” on page 59.
54Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 55
Power Down the Server Gracefully Using the Oracle ILOM Web Interface
Caution - When you power down the server using Oracle ILOM, the server enters Standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, disconnect the power cords from the
power supplies.
Related Information
■
“Power Down the Server Gracefully Using the Oracle ILOM Web Interface” on page 55
■
“Power Down the Server Gracefully Using the On/Standby Button” on page 56
■
“Power Down the Server for Immediate Shutdown Using the On/Standby
Button” on page 59
■
“Power On the Server” on page 193
Power Down the Server Gracefully Using the Oracle ILOM
Web Interface
Performing a graceful shutdown ensures that all of your data is saved and the system is ready
for restart.
1.
Log in to the server as superuser or equivalent.
Depending on the nature of the problem, you might want to view the system status or the log
files or run diagnostics before you shut down the system. For log file information, refer to the
Oracle Integrated Lights Out Manager (ILOM) 4.0 Documentation Collection at http://www.
oracle.com/goto/ilom/docs.
2.
Notify affected users that the server is going to be powered down.
3.
Save any open files, and quit all running applications.
For specific information about these processes, refer to your application documentation.
4.
Log in to the Oracle ILOM web interface using an Administrator account.
For instructions, refer to the Oracle Exadata Database Machine Installation and Configuration
Guide at https://docs.oracle.com/cd/E80920_01/index.htm.
5.
In the left pane, click Host Management → Power Control, and from the Select
Action list, click Graceful Shutdown and Power Off.
6.
Click Save, and click OK.
Preparing for Service55
Page 56
Power Down the Server Gracefully Using the On/Standby Button
The host server performs an orderly power shutdown to Standby power mode, and the
System OK LED on the front panel begins blinking. See “Server System-Level Status
Indicators” on page 31.
7.
Disconnect the power cords and data cables from the server.
See “Disconnect Cables From the Server” on page 59.
Caution - When you power down the server using Oracle ILOM, the server enters Standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, disconnect the power cords from the
power supplies.
Related Information
■
“Power Down the Server Gracefully Using the Oracle ILOM CLI” on page 54
■
“Power Down the Server Gracefully Using the On/Standby Button” on page 56
■
“Power Down the Server for Immediate Shutdown Using the On/Standby
Button” on page 59
■
“Power On the Server” on page 193
Power Down the Server Gracefully Using the On/Standby
Button
1.
Press and quickly release the On/Standby button on the front panel.
This action causes ACPI-enabled operating systems to perform an orderly shutdown of the
operating system. Servers not running ACPI-enabled operating systems shut down to Standby
power mode immediately.
When Main power is off, the System OK LED on the front panel begins blinking,
indicating that the server is in Standby power mode. See “Server System-Level Status
Indicators” on page 31.
2.
Disconnect the power cords and data cables from the server.
See “Disconnect Cables From the Server” on page 59.
Caution - When you power down the server using the On/Standby button, the server enters
Standby power mode. Power is still directed to the service processor remote management
subsystem and power supply fans. To completely power off the server, disconnect the power
cords from the power supplies.
56Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 57
Power Down the Server for Immediate Shutdown Using the Oracle ILOM CLI
Related Information
■
“Server System-Level Status Indicators” on page 31
■
“Power Down the Server Gracefully Using the Oracle ILOM CLI” on page 54
■
“Power Down the Server Gracefully Using the Oracle ILOM Web Interface” on page 55
■
“Power On the Server” on page 193
Power Down the Server for Immediate Shutdown Using the
Oracle ILOM CLI
Caution - This procedure quickly forces the server Main power off. An immediate power-off
might corrupt the system data. Therefore, use this procedure to power down the server only
after attempting the graceful power-down procedure.
1.
Log in to the Oracle ILOM command-line interface (CLI) using an Administrator
account.
Oracle ILOM displays the default command prompt (->), indicating that you have successfully
logged in to Oracle ILOM.
2.
From the CLI prompt, type the following command:
-> stop -f /System
The server powers down immediately to Standby power mode, and the System OK LED on the
front panel begins blinking. See “Server System-Level Status Indicators” on page 31.
3.
Disconnect the power and data cables from the server.
See “Disconnect Cables From the Server” on page 59.
Caution - When you power down the server using Oracle ILOM, the server enters Standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, you must disconnect the power cords
from the power supplies.
Related Information
■
“Power Down the Server for Immediate Shutdown Using the On/Standby
Button” on page 59
■
“Power Down the Server for Immediate Shutdown Using the Oracle ILOM Web
Interface” on page 58
Preparing for Service57
Page 58
Power Down the Server for Immediate Shutdown Using the Oracle ILOM Web Interface
■
“Power On the Server” on page 193
Power Down the Server for Immediate Shutdown Using the
Oracle ILOM Web Interface
Caution - This procedure quickly forces the server Main power off. An immediate power-off
might corrupt the system data. Therefore, use this procedure to power down the server only
after attempting the graceful power-down procedure.
1.
Log in to the Oracle ILOM web interface using an Administrator account.
The Oracle ILOM web interface System Information → Summary Information page appears.
2.
In the left pane, click Host Management → Power Control, and in the Select
Action list, click Immediate Power Off.
3.
Click Save, and click OK.
The server powers down immediately to Standby power mode, and the System OK LED on the
front panel begins blinking. See “Server System-Level Status Indicators” on page 31.
4.
Disconnect the power and data cables from the server.
See “Disconnect Cables From the Server” on page 59.
Caution - When you power down the server using Oracle ILOM, the server enters Standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, you must disconnect the power cords
from the power supplies.
Related Information
■
“Power Down the Server for Immediate Shutdown Using the On/Standby
Button” on page 59
■
“Power Down the Server for Immediate Shutdown Using the Oracle ILOM
CLI” on page 57
■
“Power On the Server” on page 193
58Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 59
Power Down the Server for Immediate Shutdown Using the On/Standby Button
Power Down the Server for Immediate Shutdown Using the
On/Standby Button
Caution - When you power down the server using Oracle ILOM, the server enters Standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, you must disconnect the power cords
from the power supplies.
1.
Press and hold the On/Standby button for four seconds to force the Main power
off and to enter Standby power mode.
When Main power is off, the System OK LED on the front panel begins blinking,
indicating that the server is in Standby power mode. See “Server System-Level Status
Indicators” on page 31.
2.
Disconnect the power cords and data cables from the server.
See “Disconnect Cables From the Server” on page 59.
Caution - When you power down the server using the On/Standby button, the server enters
Standby power mode. Power is still directed to the service processor remote management
subsystem and power supply fans. To completely power off the server, you must disconnect the
power cords from the power supplies.
Related Information
■
“Server System-Level Status Indicators” on page 31
■
“Power Down the Server for Immediate Shutdown Using the Oracle ILOM
CLI” on page 57
■
“Power Down the Server for Immediate Shutdown Using the Oracle ILOM Web
Interface” on page 58
■
“Power On the Server” on page 193
Disconnect Cables From the Server
Caution - The system supplies Standby power to the circuit boards even when the system is
powered off.
Preparing for Service59
Page 60
Extend the Server to the Maintenance Position
1.
Label all cables connected to the server.
2.
Power down the server.
See “Powering Down the Server” on page 53.
3.
Disconnect the power cords from the back of the server.
4.
Disconnect all data cables from the back of the server.
5.
If your rackmount kit includes a cable management device, remove the cables
from it.
6.
Depending on the components that you are servicing, either extend the server to
the maintenance position, or remove the server from the rack.
Related Information
■
“EF Back Panel Components and Cable Connections” on page 18
■
“Powering Down the Server” on page 53
■
“Extend the Server to the Maintenance Position” on page 60
■
“Remove the Server From the Rack” on page 62
■
“Reconnect Power and Data Cables” on page 192
Extend the Server to the Maintenance Position
You can service the following components with the server in the maintenance position:
■
Storage drives
■
Fan modules
■
Power supplies
■
DDR4 DIMMs
■
PCIe cards and cables
■
SAS cables
■
M.2 Flash drives
■
System battery
■
Processors
■
Disk backplane
■
Front indicator module
■
Motherboard
60Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 61
Extend the Server to the Maintenance Position
If the server is installed in a rack with extendable slide-rails, use this procedure to extend the
server to the maintenance position.
1.
To prevent the rack from tipping forward when the server is extended, extend all
rack anti-tilt mechanisms.
For instructions for stabilizing the rack, refer to “Stabilize the Rack” in Oracle Exadata Storage
Server X7-2 EF and HC Installation Guide. Also refer to the Oracle Rack Cabinet User's Guide
at https://docs.oracle.com/cd/E85660_01/index.html.
2.
Verify that no cables are damaged or are going to interfere when the server is
extended.
Although the cable management arm (CMA) that is supplied with the server is hinged to
accommodate extending the server, ensure that all cables and cords are capable of extending.
3.
From the front of the server, open and hold the left and right release latch covers
in the open position.
When in an open position, the release latch covers engage the slide-rail release latches [1].
Caution - Deploy any rack anti-tilt mechanism before releasing the slide-rail release latches.
4.
While the release latch covers are in the open position, slowly pull the server
forward until the slide-rails latch into a locked position [2].
Preparing for Service61
Page 62
Remove the Server From the Rack
The server is now in the extended maintenance position.
Related Information
■
“Disconnect Cables From the Server” on page 59
■
“Remove the Server From the Rack” on page 62
■
“Reinstall the Server Into the Rack” on page 190
Remove the Server From the Rack
Caution - The EF server weighs approximately 22.8 kg (50.2 lb). The HC server weighs
approximately 30.5 kg (67.1 lb). Two people are required to unmount and carry the chassis.
1.
Disconnect all the cables and power cords from the server.
2.
Remove the cable management arm (CMA).
For instructions for removing the CMA, see the “Remove the Cable Management Arm” in
Oracle Exadata Storage Server X7-2 EF and HC Installation Guide.
3.
Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 60.
4.
From the front of the server, pull the green slide-rail release tabs toward the front
of the server, and pull the server out of the rack until it is free of the rack rails [1
and 2].
62Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 63
A slide-rail release tab is located on each slide-rail.
5.
Set the server on a sturdy work surface.
Related Information
Take Antistatic Measures
■
“Disconnect Cables From the Server” on page 59
■
“Extend the Server to the Maintenance Position” on page 60
■
“Reinstall the Server Into the Rack” on page 190
Take Antistatic Measures
1.
Prepare an antistatic surface on which to set parts during removal and
installation.
Place electrostatic discharge (ESD)-sensitive components on an antistatic mat. You can use the
following items as an antistatic mat:
■
An antistatic bag used to wrap a replacement part
■
An Oracle ESD mat (orderable item)
■
A disposable ESD mat (shipped with some replacement parts or optional system
components)
2.
Attach an antistatic wrist strap.
When servicing or removing server components, attach an antistatic strap to your wrist and then
to a metal area on the chassis. Then perform the procedures for servicing the server.
Preparing for Service63
Page 64
Remove the Server Top Cover
Note - An antistatic wrist strap is not included in the Accessory Kit for the server. However,
antistatic wrist straps might be included with options and components.
Related Information
■
“Extend the Server to the Maintenance Position” on page 60
■
“Remove the Server From the Rack” on page 62
■
“Remove the Server Top Cover” on page 64
■
“Remove Antistatic Measures” on page 190
Remove the Server Top Cover
Note - Servicing some components requires that the top cover be removed.
1.
Ensure that AC power cords are disconnected from the server power supplies.
Note - Fan modules and M.2 flash SSDs are hot-pluggable and do not require AC power cables
to be disconnected from the server power supplies.
64Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 65
Remove the Server Top Cover
2.
If the release button latch is in the locked position, use a Torx T10 screwdriver to
turn the release button latch clockwise to the unlocked position.
3.
Unlatch the top cover [1].
Lift up on the release button on top of the server cover. Lifting the release button causes the
server cover to slide toward the rear of the chassis for easy removal.
4.
Lift up and remove the top cover [2].
Preparing for Service65
Page 66
Remove the Server Top Cover
Related Information
■
“Take Antistatic Measures” on page 63
■
“Install the Server Top Cover” on page 189
66Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 67
Servicing Storage Drives
This section describes how to service storage drives only in the HC server. The EF server
does not support storage drives. The storage drive slots in the EF server contain filler panels.
Storage drives are replaceable units that do not require you to power off the server. For
more information about replaceable units, see “Illustrated Parts Breakdown” on page 22 and
“Replaceable Units” on page 23.
■
“Storage Drives Hot-Plug Conditions” on page 67
■
“Storage Drive Locations and Numbering” on page 68
■
“Removing and Replacing a Storage Drive” on page 69
Related Information
■
“Servicing SAS Cables” on page 167
Storage Drives Hot-Plug Conditions
The hard-disk drives (HDDs) that are installed in the HC server are hot-pluggable, but this
capability depends on how you configure the drives. To hot-plug a drive, you must take the
drive offline before you can remove it. When you take the drive offline, you prevent any
application from accessing the drive and remove the logical software links to the drive.
The following conditions inhibit the ability to perform hot-plugging of a drive:
■
The drive provides the operating system, and the operating system is not mirrored on
another drive.
■
The drive cannot be logically isolated from the online operations of the server.
If either of the disk drive conditions applies, shut down the system before you replace the drive.
See “Powering Down the Server” on page 53.
Note - Replacing a drive does not require removing the server from a rack.
Servicing Storage Drives67
Page 68
Storage Drive Locations and Numbering
Related Information
■
“Storage Drive Status Indicators” on page 33
■
“Removing and Replacing a Storage Drive” on page 69
Storage Drive Locations and Numbering
The following illustration shows the location of the drives and the internal system software
designations for the drives. For information on storage drive status indicators (LEDs), see
“Storage Drive Status Indicators” on page 33.
CalloutDescription
1Storage drive 0 HDD
2Storage drive 4 HDD
3Storage drive 8 HDD
4Storage drive 1 HDD
5Storage drive 5 HDD
6Storage drive 9 HDD
7Storage drive 2 HDD
8Storage drive 6 HDD
9Storage drive 10 HDD
10Storage drive 3 HDD
68Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 69
CalloutDescription
11Storage drive 7 HDD
12Storage drive 11 HDD
Removing and Replacing a Storage Drive
The following sections describe how to remove and replace an HDD storage drive.
■
“Remove a Storage Drive” on page 69
■
“Install a Storage Drive” on page 71
Remove a Storage Drive
1.
Prepare the system for the drive removal.
Refer to “Take Antistatic Measures” on page 63.
Removing and Replacing a Storage Drive
2.
Identify the location of the drive that you want to remove.
For storage drive locations, see “Storage Drive Locations and Numbering” on page 68.
3.
Remove the storage drive.
If you are removing an HDD storage drive, type the required operating system commands to
stop the server from using the drive; otherwise, proceed to the next step.
The exact commands required depend on the configuration of your drives. Unmount file
systems or issue RAID commands, as needed.
Note - The blue OK to Remove status indicator LED on the drive might not light, as support for
this varies depending on the operating system in use. For information on storage drive status
indicators (LEDs), refer to “Storage Drive Status Indicators” on page 33.
Servicing Storage Drives69
Page 70
Remove a Storage Drive
4.
Push the latch release button to open the drive latch [1, 2].
CalloutDescription
1Pressing the latch release
2Opening the latch.
button.
Caution - The latch is not an ejector. Do not open the latch too far to the right. Doing so can
damage the latch.
70Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 71
5.
Grasp the latch and pull the drive out of the drive slot.
Install a Storage Drive
6.
Consider your next steps:
■
If you are replacing the drive, continue to “Install a Storage Drive” on page 71.
■
If you are not replacing the drive, install a drive slot filler in the empty drive slot to maintain
proper airflow, and perform administrative tasks to configure the server to operate without
the drive.
For information on how to install a storage drive filler panel, see “Remove and Install Filler
Panels” on page 187.
Related Information
■
“Storage Drive Status Indicators” on page 33
■
“Storage Drives Hot-Plug Conditions” on page 67
■
“Install a Storage Drive” on page 71
Install a Storage Drive
1.
Remove the replacement drive from its packaging, and place the drive on an
antistatic mat.
Servicing Storage Drives71
Page 72
Install a Storage Drive
2.
3.
4.
If necessary, remove the drive filler panel. See “Remove and Install Filler
Panels” on page 187.
Align the replacement drive with the drive slot.
The drive is physically addressed according to the slot in which it is installed. It is important to
install a replacement drive in the same slot as the drive that was removed.
Slide the drive into the slot until the drive is fully seated.
5.
Close the drive latch to lock the drive in place.
6.
Perform administrative procedures to reconfigure the drive.
The procedures that you perform at this point depend on how you configured your data. You
might need to partition the drive, create file systems, load data from backups, or have the drive
updated from a RAID configuration.
Related Information
■
“Storage Drive Status Indicators” on page 33
■
“Storage Drives Hot-Plug Conditions” on page 67
■
“Remove a Storage Drive” on page 69
72Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 73
Servicing Fan Modules
This section describes how to service fan modules. Fan modules are replaceable units that
do not require you to power off the server. For more information about replaceable units, see
“Illustrated Parts Breakdown” on page 22 and “Replaceable Units” on page 23.
The fan modules are located in the front of the server. See the following procedures:
■
“Remove a Fan Module” on page 73
■
“Install a Fan Module” on page 75
■
“Remove the Fan Tray” on page 77
■
“Install the Fan Tray” on page 78
Related Information
■
“Remove the Server Top Cover” on page 64
Remove a Fan Module
You do not have to power off the server to service fan modules.
Do not begin this procedure unless you have a replacement fan module and you are ready to
install it right away.
Caution - When removing and replacing a fan module in the Oracle Exadata Storage Server
X7-2, complete the entire procedure within 60 seconds to maintain adequate cooling within the
system. Remove and replace only one fan module at a time. Ensure that you have obtained the
replacement fan module and that is it is ready for installation before starting the replacement
procedure. Each fan module contains two fans, with one fan motor per fan. The two fan motors
provide separate tachometer signals so that the fan module reports two tach signals to Oracle
ILOM. Even if only one fan motor is faulted within the fan module, the Oracle ILOM service
processor detects that two fan motors have failed to spin while the fan module is removed.
If you do not replace the fan module within 20 seconds of removal, Oracle ILOM takes the
protective action to shut down the system to prevent thermal damage to the system.
Servicing Fan Modules73
Page 74
Remove a Fan Module
1.
Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 60.
2.
To access the fan modules, remove the server top cover.
See “Remove the Server Top Cover” on page 64.
3.
Identify the faulty fan module.
Each fan module has a fan status indicator (LED) that is located next to the module. If the LED
is amber, the fan has failed. The location of the fan status LED is shown in the following figure.
LED Color and State Meaning
Amber – The fan module is faulty. The front Top Fan LED and the front and back panel Fault-Service Required
LEDs also are lit if the system detects a fan module fault.
74Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 75
Install a Fan Module
4.
Using your forefinger and thumb, lift the fan module straight up and out of the
chassis and set it aside on an antistatic mat.
5.
Consider your next step:
■
If you removed the fan assembly as part of another procedure, return to that procedure.
■
Otherwise, continue to “Install a Fan Module” on page 75.
Related Information
■
“Install a Fan Module” on page 75
Install a Fan Module
1.
Remove the replacement fan module from its packaging, and place it on an
antistatic mat.
2.
With the server top cover removed, install the replacement fan module into the
server.
Servicing Fan Modules75
Page 76
Install a Fan Module
The fan modules are keyed to ensure that they are installed in the correct orientation.
3.
Press down on the fan module to fully seat the fan module.
4.
Verify that the fan module status indicator (LED) on the replacement fan module
is not illuminated.
5.
Install the server top cover.
See “Install the Server Top Cover” on page 189.
6.
Verify that the Top Fan Fault LED on the front of the server and the Fault-Service
Required LEDs on the front and back of the server are off.
See “Server System-Level Status Indicators” on page 31 for more information about identifying
and interpreting system status indicators.
7.
Consider your next step:
■
If you installed the fan module as part of another procedure, return to that procedure.
■
Otherwise, if you powered off the server, return the server to operation. See “Returning the
Server to Operation” on page 187.
Related Information
■
“Remove a Fan Module” on page 73
76Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 77
Remove the Fan Tray
Perform this procedure only when servicing the following replaceable units:
■
Disk backplane
■
SAS cables
■
Motherboard assembly
1.
Using a Torx T25 screwdriver, loosen the three spring-mounted screws that
secure the fan tray to the server chassis [1].
Remove the Fan Tray
2.
Lift the fan tray from the server [2].
3.
Continue with the next step in the service procedure.
Servicing Fan Modules77
Page 78
Install the Fan Tray
Related Information
■
“Remove a Fan Module” on page 73
■
“Servicing the Disk Backplane” on page 137
■
“Servicing SAS Cables” on page 167
■
“Servicing the Motherboard Assembly” on page 173
Install the Fan Tray
Perform this procedure only when servicing the following replaceable units:
■
Disk backplane
■
SAS cables
■
Motherboard assembly
78Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 79
1.
Lower the fan tray into the server [1].
Install the Fan Tray
2.
Using a Torx T25 screwdriver, tighten the three spring-mounted screws that
secure the fan tray to the server chassis [2].
3.
Continue with the next step in the service procedure.
Related Information
■
“Install a Fan Module” on page 75
■
“Servicing the Disk Backplane” on page 137
■
“Servicing SAS Cables” on page 167
■
“Servicing the Motherboard Assembly” on page 173
Servicing Fan Modules79
Page 80
80Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 81
Servicing Power Supplies
This section describes how to service power supplies. Power supplies are replaceable units that
do not require you to power off the server. For more information about replaceable units, see
“Illustrated Parts Breakdown” on page 22 and “Replaceable Units” on page 23.
The power supplies are located at the back of the server. See the following procedures:
■
“Power Supply Overview” on page 81
■
“Remove a Power Supply” on page 82
■
“Install a Power Supply” on page 84
Power Supply Overview
The server redundant power supplies support concurrent maintenance, which allows you to
remove and replace a power supply without shutting down the server, provided that the other
power supply is online and working.
The server supports model A266 (1200 watt) power supplies. The A266 power supply unit
(PSU) provides conversion from the AC lines to the system, accepting ranges from 100-240
volts AC (VAC). These PSUs are designed to be hot-pluggable, and in most cases, provide fully
redundant “1+1” power, allowing the system to suffer the loss of a PSU or an AC feed with no
loss to system availability.
In maximally configured systems, it is possible that the worst-case power consumption of the
system could exceed the capacity of a single PSU. The PSUs provide an over-subscription
mode, which allows the system to operate with fault-tolerance even with modest excursions
beyond the rated capacity of a single PSU. This over-subscription support is accomplished
using hardware signaling between the PSU and motherboard circuitry, which can force
the system to throttle processor (CPU) and memory power in the event that a PSU is lost.
The resulting power savings is enough to allow the system to continue to run (in a lowerperformance state) until the power problem is resolved.
The following status indicators (LEDs) are lit when a power supply fault is detected:
Servicing Power Supplies81
Page 82
Remove a Power Supply
■
Front and back Fault-Service Required LEDs
■
Amber Fault-Service Required LED on the faulty power supply
■
Back Power Supply Fault-Service Required LED located on the server front panel
For more information, see “Power Supply Status Indicators” on page 34.
If a power supply fails and you do not have a replacement available, leave the failed power
supply installed to ensure proper airflow in the server.
Related Information
■
“Troubleshooting Using the Server Front and Back Panel Status Indicators” on page 30
Remove a Power Supply
1.
Prepare the system for power supply removal.
Refer to “Take Antistatic Measures” on page 63.
2.
Identify which power supply requires replacement.
82Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 83
Remove a Power Supply
CalloutDescription
1Power supply 0
2Power supply 1
A lit amber Fault-Service Required LED on a power supply indicates that a failure was
detected. You also can use the Oracle ILOM showfaulty command at the Oracle ILOM
command-line prompt (->) to identify a power supply failure.
Alternatively, to list all known faults in the server, log in to Oracle Solaris and issue the fmadmfaulty command, or log in to the Oracle ILOM service processor from the Oracle ILOM Fault
Management Shell and issue the fmadm faulty command. For more information about how to
use the Oracle ILOM Fault Management Shell and supported commands, refer to the OracleILOM User's Guide for System Monitoring and Diagnostics Firmware Release 4.0.x in the
Oracle Integrated Lights Out Manager (ILOM) 4.0 Documentation Library at http://www.
oracle.com/goto/ilom/docs.
Note - The fans of a failed power supply might still be spinning when the system is powered on.
The fans stop spinning when you disconnect the power cords.
3.
Gain access to the rear of the server where the faulty power supply is located.
4.
If the cable management arm (CMA) is installed, disconnect both CMA left-side
connectors, and move the CMA out of the way.
For instructions for disconnecting the CMA left-side connectors, refer to “Remove the Cable
Management Arm” in Oracle Exadata Storage Server X7-2 EF and HC Installation Guide.
Caution - When disconnecting the CMA left-side connectors, be sure to use your arm to
support the CMA so that it does not hang down under its own weight and stress the right-side
connectors; otherwise, the CMA might be damaged. You must continue to support the CMA
until you have reconnected both of the left-side connectors.
5.
Disconnect the power cord from the faulty power supply.
Servicing Power Supplies83
Page 84
Install a Power Supply
6.
Grasp the power supply handle and push the power supply latch to the left.
7.
Pull the power supply out of the chassis.
Caution - When you remove a power supply, replace it with another power supply; otherwise,
the server might overheat due to improper airflow.
8.
Continue to “Install a Power Supply” on page 84.
Related Information
■
“EF Back Panel Components and Cable Connections” on page 18
■
“Power Supply Status Indicators” on page 34
■
“Install a Power Supply” on page 84
Install a Power Supply
Caution - Always replace the failed power supply with the same type model of power supply.
84Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 85
Install a Power Supply
1.
Remove the replacement power supply from its packaging, and place it on an
antistatic mat.
2.
Align the replacement power supply with the empty power supply slot.
3.
Slide the power supply into the slot until it is fully seated.
You hear an audible click when the power supply fully seats.
4.
Reconnect the power cord to the power supply.
5.
Verify that the amber Fault-Service Required LED on the replaced power supply
and the Fault-Service Required LEDs are not lit on the front and back panels.
Note - After you replace Power Supply 0, you might need to reset the Oracle ILOM service
processor (SP) to propagate the key identity properties (KIP) data to the new power supply. For
instructions on resetting the SP, refer to the Oracle Integrated Lights Out Manager (ILOM) 4.0Configuration and Maintenance Guide Firmware Release 4.0.x at http://www.oracle.com/
goto/ilom/docs. Power Supply 1 does not contain KIP data, and therefore does not require an
SP reset after replacement.
6.
If you disconnected the two CMA left-side connectors, reconnect the
connectors.
Servicing Power Supplies85
Page 86
Install a Power Supply
For instructions for reconnecting the CMA left-side connectors, refer to “Install the Cable
Management Arm (Optional)” in Oracle Exadata Storage Server X7-2 EF and HC Installation
Guide.
Related Information
■
“EF Back Panel Components and Cable Connections” on page 18
■
“Power Supply Status Indicators” on page 34
■
“Remove a Power Supply” on page 82
86Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 87
Servicing the Internal M.2 Flash SSDs
This section describes how to service M.2 flash solid-state drives (SSDs). M.2 flash SSDs
are replaceable units that do not require you to power off the server. For more information
about replaceable units, see “Illustrated Parts Breakdown” on page 22 and “Replaceable
Units” on page 23.
Caution - These procedures require that you handle components that are sensitive
to electrostatic discharge. This sensitivity can cause the component to fail. To avoid
damage, ensure that you follow antistatic practices as described in “Take Antistatic
Measures” on page 63.
This section covers the following procedures:
■
“Remove a Flash Riser Board” on page 87
■
“Install a Flash Riser Board” on page 89
■
“Identify and Remove an M.2 Flash SSD” on page 91
■
“Install an M.2 Flash SSD” on page 93
Related Information
■
“Servicing Storage Drives” on page 67
Remove a Flash Riser Board
The server can be equipped with up to two hot-pluggable flash riser boards. Each flash riser
board contains an M.2 flash SSD.
1.
Prepare the server for service.
a.
Extend the server into maintenance position.
See “Extend the Server to the Maintenance Position” on page 60.
Servicing the Internal M.2 Flash SSDs87
Page 88
Remove a Flash Riser Board
b.
Attach an antistatic wrist strap to your wrist, and then to a metal area on the
chassis. See “Take Antistatic Measures” on page 63.
c.
Remove the server top cover. See “Remove the Server Top
Cover” on page 64.
2.
To remove the flash riser board, do the following:
a.
Rotate both riser board socket ejectors outward as far as they go.
Note - The green power LED on the riser board is extinguished when you open the socket
ejectors.
b.
Carefully lift the riser board straight up to remove it from the sockets.
Place the riser board on an antistatic mat. If required, perform the procedures for removing
the M.2 flash SSD from the flash riser board. For instructions, see “Identify and Remove
an M.2 Flash SSD” on page 91.
Related Information
■
“Install a Flash Riser Board” on page 89
88Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 89
Install a Flash Riser Board
The system motherboard provides two slots (labeled 0 M.2 and 1 M.2) for flash riser boards.
When installing flash riser boards, install the first riser board into slot 0 M.2 and then into 1M.2.
1.
Unpack the replacement flash riser board and place it on an antistatic mat.
If required, ensure that you installed an M.2 flash SSD onto the riser board before installing the
riser board into the server. For instructions, see “Install an M.2 Flash SSD” on page 93.
2.
Install the flash riser board.
a.
Ensure that the ejector tabs are in the open position.
b.
Align the notch in the replacement riser board with the connector key in the
connector socket.
The notch ensures that the riser board is oriented correctly.
Install a Flash Riser Board
c.
Push the riser board into the connector socket until the riser board is
securely seated in the socket.
Servicing the Internal M.2 Flash SSDs89
Page 90
Install a Flash Riser Board
Caution - If the riser board does not easily seat into the connector socket, verify that the notch
in the riser board is aligned with the connector key in the connector socket. If the notch is not
aligned, damage to the riser board might occur.
d.
Rotate the riser board socket ejectors inward until the ejector tabs lock the
riser board in place.
3.
(Optional) Repeat Step 2 to install a second flash riser board into the server.
4.
Return the server to operation.
a.
Install the server top cover.
See “Install the Server Top Cover” on page 189.
b.
Return the server to the normal rack position.
See “Return the Server to the Normal Rack Position” on page 191.
Related Information
■
“Remove a Flash Riser Board” on page 87
90Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 91
Identify and Remove an M.2 Flash SSD
1.
Prepare the server for service.
a.
Extend the server into maintenance position.
See “Extend the Server to the Maintenance Position” on page 60.
b.
Attach an antistatic wrist strap to your wrist, and then to a metal area on the
chassis.
See “Take Antistatic Measures” on page 63.
c.
Remove the server top cover.
See “Remove the Server Top Cover” on page 64.
2.
Identify and note the location of the faulty M.2 flash SSD by pressing the Fault
Remind button on the motherboard.
See “Using the Server Fault Remind Button” on page 104.
Faulty SSDs are identified with a corresponding amber LED on the flash riser board.
Identify and Remove an M.2 Flash SSD
■
If the SSD fault LED is off, then the SSD is operating properly.
■
If the SSD fault LED is on (amber), then the SSD is faulty and must be replaced [1].
Note - The green Power LED indicator is illuminated when power is being applied to the
system. The green Power LED indicator is extinguished when you open the socket ejectors.
Servicing the Internal M.2 Flash SSDs91
Page 92
Identify and Remove an M.2 Flash SSD
3.
Perform the steps to remove the flash riser board containing the M.2 flash SSD
from the server.
For instructions, see “Remove a Flash Riser Board” on page 87. Place the riser board on an
antistatic mat.
4.
Using your fingers, push the plastic retainer clip through the flash riser board to
release the M.2 flash SSD from the riser board. [1].
5.
Remove the plastic retainer clip from the M.2 flash SSD.
6.
Remove the M.2 flash SSD from the riser board [2].
a.
Lift up on the end of the M.2 flash SSD where the plastic retainer clip was
removed by one-half to one inch.
b.
Gently slide the M.2 flash SSD card toward the back of the riser board and
out of the connector to disengage the SSD contacts from the riser board
socket.
Place the M.2 flash SSD on an antistatic mat.
Related Information
■
“Install an M.2 Flash SSD” on page 93
92Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 93
Install an M.2 Flash SSD
Note - Install the M.2 flash SSD only in the socket labeled SSD0.
1.
Unpack the replacement M.2 flash SSD.
2.
Install the M.2 flash SSD.
a.
Align and then insert the plastic retainer clip into the slot on the M.2 flash
SSD.
b.
Hold the M. 2 flash SSD so that the end of the card containing the plastic
retainer clip is approximately one-half to one inch above the riser board and
the other end is adjacent to the riser SSD socket [1].
Install an M.2 Flash SSD
c.
Gently slide the M.2 flash SSD forward and into the connector to engage the
flash SSD contacts with the riser board socket [2].
d.
Using your fingers, push down on the end of the card containing the plastic
retainer clip until the M.2 flash SSD snaps into place on the riser board [2].
You hear an audible click when the M.2 flash SSD is secured to the flash riser board.
Servicing the Internal M.2 Flash SSDs93
Page 94
Install an M.2 Flash SSD
3.
Perform the steps to install the M.2 flash riser board with the replacement flash
SSD into the server.
For instructions, see “Install a Flash Riser Board” on page 89.
4.
Return the server to operation.
a.
b.
Related Information
■
Install the server top cover.
See “Install the Server Top Cover” on page 189.
Return the server to the normal rack position.
See “Return the Server to the Normal Rack Position” on page 191.
“Identify and Remove an M.2 Flash SSD” on page 91
94Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 95
Servicing the Air Baffle
This section describes how to service the air baffle. The air baffle is a replaceable unit that
requires you to power off the server. For more information about replaceable units, see
“Illustrated Parts Breakdown” on page 22 and “Replaceable Units” on page 23.
Caution - To prevent the system from overheating, ensure that the air baffle is correctly
installed before powering on the system.
Caution - You must disconnect all power cables from the system before performing these
procedures.
This section covers the following procedures:
■
“Remove the Air Baffle” on page 95
■
“Install the Air Baffle” on page 97
Related Information
■
“Servicing the DIMMs” on page 101
■
“Servicing Processors” on page 123
Remove the Air Baffle
1.
Prepare the server for service.
a.
Power off the server, and disconnect the power cords from the server power
supplies.
See “Powering Down the Server” on page 53.
b.
Extend the server into the maintenance position. See “Take Antistatic
Measures” on page 63.
Servicing the Air Baffle95
Page 96
Remove the Air Baffle
2.
c.
Attach an antistatic wrist strap to your wrist, and then to a metal area on the
chassis. See “Take Antistatic Measures” on page 63
d.
Remove the server top cover. See “Remove the Server Top
Cover” on page 64.
Release the latch for the host bus adapter (HBA) super capacitor tray and rotate
the tray [1] in to the upright position [2].
96Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 97
3.
Remove the air baffle by lifting the baffle up and out of the server.
Install the Air Baffle
4.
Set aside the air baffle.
5.
Consider your next step:
■
If you removed the air baffle as part of another procedure, return to that procedure.
■
Otherwise, continue to “Install the Air Baffle” on page 97.
Related Information
■
“Install the Air Baffle” on page 97
Install the Air Baffle
1.
Remove the replacement air baffle from its packaging.
Servicing the Air Baffle97
Page 98
Install the Air Baffle
2.
Install the air baffle by placing it into the server and lowering it to its inserted
position.
3.
Rotate the HBA super capacitor tray down and in to its closed position.
98Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Page 99
You hear an audible click when the tray is properly secured to the air baffle.
Install the Air Baffle
4.
Consider your next step:
■
If you removed the air baffle as part of another procedure, return to that procedure.
■
Otherwise, continue to with this procedure.
5.
Return the server to operation:
a.
Install the server top cover.
See “Install the Server Top Cover” on page 189.
b.
Return the server to the normal rack position.
See “Return the Server to the Normal Rack Position” on page 191.
c.
Reconnect the power cords and data cables to the server.
See “Reconnect Power and Data Cables” on page 192.
d.
Power on the server.
See “Power On the Server” on page 193.
Verify that the power supply AC OK LED is lit.
Servicing the Air Baffle99
Page 100
Install the Air Baffle
Related Information
■
“Remove the Air Baffle” on page 95
100Oracle Exadata Storage Server X7-2 EF and HC Service Manual • May 2018
Loading...
+ hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.