This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except
as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform,
publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is
prohibited.
The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing.
If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, then the following notice is applicable:
U.S. GOVERNMENT END USERS: Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation,
delivered to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental
regulations. As such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the
hardware, and/or documentation, shall be subject to license terms and license restrictions applicable to the programs. No other rights are granted to the U.S. Government.
This software or hardware is developed for general use in a variety of information management applications. It is not developed or intended for use in any inherently dangerous
applications, including applications that may create a risk of personal injury. If you use this software or hardware in dangerous applications, then you shall be responsible to take all
appropriate fail-safe, backup, redundancy, and other measures to ensure its safe use. Oracle Corporation and its affiliates disclaim any liability for any damages caused by use of this
software or hardware in dangerous applications.
Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners.
Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of
SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered
trademark of The Open Group.
This software or hardware and documentation may provide access to or information about content, products, and services from third parties. Oracle Corporation and its affiliates are
not responsible for and expressly disclaim all warranties of any kind with respect to third-party content, products, and services unless otherwise set forth in an applicable agreement
between you and Oracle. Oracle Corporation and its affiliates will not be responsible for any loss, costs, or damages incurred due to your access to or use of third-party content,
products, or services, except as set forth in an applicable agreement between you and Oracle.
Access to Oracle Support
Oracle customers that have purchased support have access to electronic support through My Oracle Support. For information, visit http://www.oracle.com/pls/topic/lookup?
ctx=acc&id=info or visit http://www.oracle.com/pls/topic/lookup?ctx=acc&id=trs if you are hearing impaired.
Ce logiciel et la documentation qui l'accompagne sont protégés par les lois sur la propriété intellectuelle. Ils sont concédés sous licence et soumis à des restrictions d'utilisation et
de divulgation. Sauf stipulation expresse de votre contrat de licence ou de la loi, vous ne pouvez pas copier, reproduire, traduire, diffuser, modifier, accorder de licence, transmettre,
distribuer, exposer, exécuter, publier ou afficher le logiciel, même partiellement, sous quelque forme et par quelque procédé que ce soit. Par ailleurs, il est interdit de procéder à toute
ingénierie inverse du logiciel, de le désassembler ou de le décompiler, excepté à des fins d'interopérabilité avec des logiciels tiers ou tel que prescrit par la loi.
Les informations fournies dans ce document sont susceptibles de modification sans préavis. Par ailleurs, Oracle Corporation ne garantit pas qu'elles soient exemptes d'erreurs et vous
invite, le cas échéant, à lui en faire part par écrit.
Si ce logiciel, ou la documentation qui l'accompagne, est livré sous licence au Gouvernement des Etats-Unis, ou à quiconque qui aurait souscrit la licence de ce logiciel pour le
compte du Gouvernement des Etats-Unis, la notice suivante s'applique :
U.S. GOVERNMENT END USERS: Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation,
delivered to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental
regulations. As such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the
hardware, and/or documentation, shall be subject to license terms and license restrictions applicable to the programs. No other rights are granted to the U.S. Government.
Ce logiciel ou matériel a été développé pour un usage général dans le cadre d'applications de gestion des informations. Ce logiciel ou matériel n'est pas conçu ni n'est destiné à être
utilisé dans des applications à risque, notamment dans des applications pouvant causer un risque de dommages corporels. Si vous utilisez ce logiciel ou ce matériel dans le cadre
d'applications dangereuses, il est de votre responsabilité de prendre toutes les mesures de secours, de sauvegarde, de redondance et autres mesures nécessaires à son utilisation dans
des conditions optimales de sécurité. Oracle Corporation et ses affiliés déclinent toute responsabilité quant aux dommages causés par l'utilisation de ce logiciel ou matériel pour des
applications dangereuses.
Oracle et Java sont des marques déposées d'Oracle Corporation et/ou de ses affiliés. Tout autre nom mentionné peut correspondre à des marques appartenant à d'autres propriétaires
qu'Oracle.
Intel et Intel Xeon sont des marques ou des marques déposées d'Intel Corporation. Toutes les marques SPARC sont utilisées sous licence et sont des marques ou des marques
déposées de SPARC International, Inc. AMD, Opteron, le logo AMD et le logo AMD Opteron sont des marques ou des marques déposées d'Advanced Micro Devices. UNIX est une
marque déposée de The Open Group.
Ce logiciel ou matériel et la documentation qui l'accompagne peuvent fournir des informations ou des liens donnant accès à des contenus, des produits et des services émanant de
tiers. Oracle Corporation et ses affiliés déclinent toute responsabilité ou garantie expresse quant aux contenus, produits ou services émanant de tiers, sauf mention contraire stipulée
dans un contrat entre vous et Oracle. En aucun cas, Oracle Corporation et ses affiliés ne sauraient être tenus pour responsables des pertes subies, des coûts occasionnés ou des
dommages causés par l'accès à des contenus, produits ou services tiers, ou à leur utilisation, sauf mention contraire stipulée dans un contrat entre vous et Oracle.
Accès aux services de support Oracle
Les clients Oracle qui ont souscrit un contrat de support ont accès au support électronique via My Oracle Support. Pour plus d'informations, visitez le site http://www.oracle.com/
pls/topic/lookup?ctx=acc&id=info ou le site http://www.oracle.com/pls/topic/lookup?ctx=acc&id=trs si vous êtes malentendant.
Page 5
Contents
Using This Documentation ............. ................ ................ ................ ................ ... 13
Index ............. ................ ................ ................ ................ ................ ................ ... 307
12Oracle Server X7-2 Service Manual • October 2017
Page 13
Using This Documentation
■
Overview – Describes how to troubleshoot and maintain the Oracle Server X7-2.
■
Audience – Technicians, system administrators, authorized service providers, and trained
hardware service personnel who have been instructed on the hazards within the equipment
and are qualified to remove and replace hardware.
■
Required knowledge – Advanced experience troubleshooting and replacing hardware.
Product Documentation Library
Documentation and resources for this product and related products are available at http://www.
oracle.com/goto/x7-2/docs.
Feedback
Provide feedback about this documentation at http://www.oracle.com/goto/docfeedback.
Using This Documentation13
Page 14
14Oracle Server X7-2 Service Manual • October 2017
Page 15
About the Oracle Server X7-2
These sections describe the controls, connectors, status indicators, system components, and
replaceable components of the server:
■
“Product Description” on page 15
■
“About Controls and Connectors” on page 16
■
“About System Components” on page 19
Note - Always update the server with the latest firmware, drivers, and other hardware-related
software by downloading the latest software release package when you first receive the server,
and for every new software release. For information about the software release packages and
how to download the software, refer to “Getting Firmware and Software” in Oracle Servers
X7-2 and X7-2L Installation Guide.
Product Description
The Oracle Server X7-2 is an enterprise-class, one rack unit (1U) server. It supports the
following components:
■
Up to two Intel processors. Processors with the following capabilities are supported:
Up to 12 DDR4 DIMMs per processor for a maximum of 24 RDIMMs (16GB and 32GB)
and LRDIMMs (64GB) with speeds up to 2667 MT/sec are supported in dual-processor
systems.
■
Four PCIe Gen3 slots in dual-processor systems: one slot, slot 4, is dedicated to the internal
HBA. PCIe slot 1 is nonfunctional in single-processor systems.
■
Storage drive configurations can comprise both hard disk drives (HDDs) or solid state disk
drives (SSDs). Configurations include:
About the Oracle Server X7-215
Page 16
About Controls and Connectors
■
Up to eight 2.5-inch hot-pluggable SAS3 HDDs/SSDs, or up to eight 2.5-inch NVMExpress (NVMe) SSDs
■
Two internal SATA M.2 flash SSDs
■
Two hot-pluggable, redundant 1200W power supplies.
■
An on-board Oracle Integrated Lights Out Manager (Oracle ILOM) service processor (SP)
based on the ASPEED Pilot 4 chip.
About Controls and Connectors
The following sections describe the controls, indicators, connectors, and drives located on the
front and back panels.
■
“Front Panel Controls and Indicators” on page 16
■
“Back Panel Components and Cable Connections” on page 18
Related Information
■
“About System Components” on page 19
■
“Illustrated Parts Breakdown” on page 19
Front Panel Controls and Indicators
The following figure shows the controls, status indicators (LEDs), connectors, and drives on the
front panel of the Oracle Server X7-2.
16Oracle Server X7-2 Service Manual • October 2017
Page 17
About Controls and Connectors
Call OutDescription
1Product Serial Number (PSN) label and Radio Frequency Identification (RFID) tag
2Locate Button/LED: white
3Fault-Service Required LED: amber
4System OK LED: green
5On/Standby button
6Fault-Service Required LED: Top: Fan Module (amber)
7Fault-Service Required LED: Rear: Power Supply (amber)
8Fault-Service Required LED: Overtemp Icon: System Over Temperature Warning (amber)
9SP OK LED: green
10DO NOT SERVICE LED: white
11Storage drive 0 HDD/SSD/NVMe
12Storage drive 1 HDD/SSD/NVMe
13Storage drive 2 HDD/SSD/NVMe
14Storage drive 3 HDD/SSD/NVMe
15Storage drive 4 HDD/SSD/NVMe
16Storage drive 5 HDD/SSD/NVMe
17Storage drive 6 HDD/SSD/NVMe
18Storage drive 7 HDD/SSD/NVMe
About the Oracle Server X7-217
Page 18
About Controls and Connectors
Related Information
■
“Back Panel Components and Cable Connections” on page 18
■
“About System Components” on page 19
Back Panel Components and Cable Connections
The following figure shows the Oracle Server X7-2 back panel and the location of power
supplies, status indicators (LEDs), connectors, and PCIe slots.
Call OutDescription
1Power Supply (PS) 0
2Power Supply (PS) 0 status indicators: Fault-Service Required LED: amber; AC OK LED:
3Power Supply (PS) 1
4Power Supply (PS) 1 status indicators: Fault-Service Required LED: amber; AC OK LED:
5System status indicators: Locate Button/LED: white; Fault-Service Required LED: amber;
6PCIe card slot 1 (Nonfunctional in single-processor systems)
7PCIe card slot 2
8PCIe card slots 3 and 4
9Oracle Integrated Lights Out Manager (ILOM) service processor (SP) network management
10Network (NET) 10/100/1000BASE-T RJ-45 Gigabit Ethernet (GbE) port: NET 0
11Network (NET) 10GBASE-T RJ-45 GbE port: NET 1
12Network (NET) 10GBASE-T RJ-45 GbE port: NET 2
18Oracle Server X7-2 Service Manual • October 2017
green
green
System OK LED: green
(NET MGT) RJ-45 10/100/1000BASE-T port
Page 19
Call OutDescription
13Network (NET) 10/25GbE enhanced small form-factor pluggable (SFP+) port: NET 1
14Network (NET) 10/25GbE SFP+ port: NET 2
15USB 3.0 connector
16Serial management (SER MGT) RJ-45 serial port
Note - For more information on back panel components and connections, see “Back Panel
Connector Locations” on page 40. See also “Back Panel Pinhole Switches” on page 45.
Related Information
■
“Front Panel Controls and Indicators” on page 16
■
“About System Components” on page 19
About System Components
About System Components
These sections describe the components of the server:
■
“Illustrated Parts Breakdown” on page 19
■
“Customer-Replaceable Units” on page 21
■
“Field-Replaceable Units” on page 22
Illustrated Parts Breakdown
The following figure identifies the major components of the server.
About the Oracle Server X7-219
Page 20
About System Components
Callout Description
1Eight storage drives
2Front LED indicator module
3Disk backplane
4Super capacitor (Energy storage module for the Oracle Storage 12 Gb SAS PCIe RAID HBA, Internal card
in PCIe slot 4.)
5Fan modules
6Motherboard
7Processors and heatsinks (Single-processor systems contain only a single processor in socket P0; socket P1
contains a cover to protect processor socket pins.)
8Battery
9(Optional) Internal USB drive
10Air baffle
11Top cover
12Internal M.2 flash SSDs
13(Optional) PCIe card in slot 3
14Oracle Storage 12 Gb/s SAS PCIe RAID HBA card in PCIe slot 4
15(Optional) PCIe card in slot 2
16(Optional) PCIe card in slot 1
20Oracle Server X7-2 Service Manual • October 2017
Page 21
About System Components
Callout Description
Note - PCIe slot 1 is nonfunctional in single-processor systems.
17PCIe risers (3)
18DIMMs (Only twelve DIMMs are supported in single-processor systems and the DIMMs must be installed
in P0 DIMM sockets.)
19Power supplies
20Server chassis
Customer-Replaceable Units
The following table lists the customer-replaceable units (CRUs) in the server and directs you to
the replacement instructions.
CRUDescriptionReplacement Instructions
BatteryLithium coin-cell battery that powers the CMOS BIOS
DIMMsAdd or replace memory modules in the system.“Servicing the DIMMs
Storage drivesSupports SAS hard-disk drives (HDDs), solid-state
Internal USB flash
drive
Internal M.2 flash
SSDs
Fan modulesFour fan modules for cooling the motherboard
PCIe cardsOptional add-on PCIe cards that can expand the
PCIe risersHouses and connects the PCIe cards.“Servicing PCIe Risers
Power suppliesTwo fully redundant AC power supplies.“Servicing Power Supplies
and real-time clock.
drives (SSDs), or NVMe SSDs.
Optional support for one internal USB flash drive.“Servicing an Internal USB Flash
Supports up to two internal SATA M.2 flash SSDs.“Servicing the Internal M.2 Flash
assembly and installed components.
functionality of the server.
Note - The Oracle Storage 12 Gb SAS PCIe RAID
HBA, Internal card is a field replaceable unit (FRU)
and should only be serviced by authorized Oracle
Service personnel.
“Servicing the Battery
(CRU)” on page 133
(CRU)” on page 97
“Servicing Storage Drives
(CRU)” on page 65
Drive (CRU)” on page 93
SSDs (CRU)” on page 127
“Servicing Fan Modules
(CRU)” on page 81
“Servicing PCIe Cards
(CRU)” on page 121
(CRU)” on page 111
(CRU)” on page 87
Related Information
■
“Field-Replaceable Units” on page 22
About the Oracle Server X7-221
Page 22
About System Components
■
“Illustrated Parts Breakdown” on page 19
Field-Replaceable Units
The following table lists the field-replaceable units (FRUs) in the server and directs you to the
replacement instructions.
FRUDescriptionReplacement Instructions
Processor and
heatsink
Oracle Storage 12
Gb SAS PCIe RAID
HBA, Internal card
Disk backplaneServes as the interface between storage drives and the
Front LED indicator
module
Motherboard
assembly
SAS cablesProvide signals and backup power between the disk
NVMe cablesProvide signals between the disk backplane and the
Carries out the instructions of the system.“Servicing Processors
Located in internal PCIe slot 4, the Oracle Storage
12 Gb SAS PCIe RAID HBA, Internal card manages
SAS storage drives.
internal host bus adapter (HBA) card or Oracle PCIe
NVMe signals from the motherboard.
Contains the push-button circuitry and LEDs that are
displayed on the front panel.
Provides connectors for the fans, DIMMs, processors,
PCIe risers, internal USB port, and power supplies.
backplane and the super capacitor and the Oracle
Storage 12 Gb SAS PCIe RAID HBA, Internal card.
motherboard NVMe connectors.
(FRU)” on page 137
“Servicing the Internal HBA
Card and HBA Super Capacitor
(FRU)” on page 169
“Servicing the Disk Backplane
(FRU)” on page 151
“Servicing the Front Indicator
Module (FRU)” on page 157
“Servicing the Motherboard
(FRU)” on page 199
“Servicing the Internal HBA
Cables” on page 179
“Servicing the NVMe
Cables” on page 185
Related Information
■
“Customer-Replaceable Units” on page 21
■
“Illustrated Parts Breakdown” on page 19
22Oracle Server X7-2 Service Manual • October 2017
Page 23
Troubleshooting and Diagnostics
This section includes information about troubleshooting hardware component faults for the
Oracle Server X7-2. It contains the following topics:
■
“Troubleshooting Server Component Hardware Faults” on page 23
■
“Troubleshooting With Diagnostic Tools” on page 37
■
“Attaching Devices to the Server” on page 39
■
“Getting Help” on page 46
■
“Auto Service Requests” on page 48
For more information about server troubleshooting and diagnostics, refer to the Oracle x86Servers Diagnostics and Troubleshooting Guide for Servers With Oracle ILOM 4.0.x at http://
www.oracle.com/goto/x86admindiag/docs.
Troubleshooting Server Component Hardware Faults
This section contains maintenance-related information and procedures that you can use to
troubleshoot and repair server hardware issues. The following topics are covered.
■
“Troubleshooting Server Hardware Faults” on page 23
■
“Troubleshooting and Diagnostic Information” on page 27
■
“Troubleshooting Using the Server Front and Back Panel Status Indicators” on page 28
■
“Troubleshooting System Cooling Issues” on page 34
■
“Troubleshooting Power Issues” on page 35
Troubleshooting Server Hardware Faults
When a server hardware fault event occurs, the system lights the Fault-Service Required LED
and captures the event in the system event log (SEL). If you set up notifications through Oracle
ILOM, you also receive an alert through the notification method you chose. When you become
aware of a hardware fault, address it immediately.
Troubleshooting and Diagnostics23
Page 24
Troubleshooting Server Component Hardware Faults
To investigate a hardware fault, see the following:
■
“Basic Troubleshooting Process” on page 24
■
“Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface” on page 25
Basic Troubleshooting Process
Use the following process to address a hardware fault (for the step-by-step procedure, see
“Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface” on page 25).
1. Identify the server subsystem containing the fault.
You can use Oracle ILOM to identify the failed component.
2. Review the Oracle Server X7-2 Product Notes.
The product notes contain up-to-date information about the server, including hardwarerelated issues.
3. Prepare the server for service using Oracle ILOM.
If you determined that the hardware fault requires service (physical access to the server),
use Oracle ILOM to take the server offline, activate the Locate button/LED, and if
necessary, power off the server.
4. Prepare the service work space.
Before servicing the server, prepare the work space, ensuring Electrostatic Discharge Safety
(ESD) protection for the server and components.
See “Electrostatic Discharge Safety” on page 50.
5. Service components.
To service the components, see the removal, installation, and replacement procedures in this
document.
Note - A component designated as a field-replaceable unit (FRU) must be replaced by
Oracle Service personnel. Contact Oracle Service.
6. Clear the fault in Oracle ILOM.
Depending on the component, you might need to clear the fault in Oracle ILOM. Generally,
components that have a FRU ID clear the fault automatically.
Related Information:
■
“Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface” on page 25
24Oracle Server X7-2 Service Manual • October 2017
Page 25
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Troubleshoot Hardware Faults Using the Oracle ILOM Web
Interface
Note - The screens shown in this procedure might differ from those for your server.
This procedure uses the basic troubleshooting steps described in “Basic Troubleshooting
Process” on page 24.
Use this procedure to troubleshoot hardware faults using the Oracle ILOM web interface and, if
necessary, prepare the server for service.
Note - This procedure provides one basic approach to troubleshooting hardware faults. It uses
the Oracle ILOM web interface. However, the procedure can be performed using the Oracle
ILOM command-line interface (CLI). For more information about the Oracle ILOM web
interface and CLI, refer to the Oracle ILOM documentation.
Before starting this procedure, obtain the latest version of the Oracle Server X7-2 ProductNotes.
1.
Log in to the server SP Oracle ILOM web interface.
Open a browser and direct it using the IP address of the server SP. Enter a user name (with
administrator privileges) and password at the Login screen. The Summary Information page
appears.
The Status section of the Summary Information page provides information about the server
subsystems, including:
■
Processors
■
Memory
■
Power
■
Cooling
■
Storage
■
Networking
2.
In the Status section of the Oracle ILOM Summary Information page, identify the
server subsystem that requires service.
Troubleshooting and Diagnostics25
Page 26
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
In the following example, the Status page shows that the Processor subsystem requires service,
which indicates that a hardware component in the subsystem is in a fault state.
3.
To identify the component, click the Processors in the Status section.
The Oracle ILOM Processors page appears.
The following example shows the Processors subsystem section, which indicates that CPU 0
has a fault.
4.
To get more information, click the Open Problems link.
26Oracle Server X7-2 Service Manual • October 2017
Page 27
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
The Open Problems page provides detailed information, such as the time the event occurred,
the component and subsystem name, and a description of the issue. It also includes a link to an
Oracle Knowledge Base article.
Tip - The System Log provides a chronological list of all the system events and faults that
occurred since the log was last reset and includes additional information, such as severity levels
and error counts. The System Log also includes information on the devices not reported in the
Status section. To access the System Log, in the left panel, click System Log.
In this example, the hardware fault with CPU 0 requires local/physical access to the server.
5.
Before going to the server, review the Oracle Server X7-2 Product Notes for
information related to the issue or the component.
The Product Notes contain up-to-date information about the server, including hardware-related
issues.
6.
To prepare the server for service, see “Preparing for Service” on page 49.
After servicing the component, you might need to clear the fault in Oracle ILOM. For more
information, refer to the service procedure for the component.
7.
Service the component.
Troubleshooting and Diagnostic Information
The following list displays diagnostic and troubleshooting-related procedures and references
that can assist you with resolving server issues.
■
Oracle x86 Servers Diagnostics and Troubleshooting Guide for Servers With Oracle ILOM
4.0.x
■
Oracle X7 Series Servers Administration Guide
■
“Troubleshooting Using the Server Front and Back Panel Status Indicators” on page 28
■
“Managing Server Hardware Faults Through the Oracle ILOM Fault Management
Shell” on page 37
Troubleshooting and Diagnostics27
Page 28
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Troubleshooting Using the Server Front and Back
Panel Status Indicators
These sections describe the status indicators (LEDs) located on the front and back of the server,
including those found on components and ports. This section includes the following topics:
■
“Server Boot Process and Normal Operating State Indicators” on page 28
■
“Server System-Level Status Indicators” on page 29
■
“Server Fan Status Indicators” on page 30
■
“Storage Drive Status Indicators” on page 31
■
“Power Supply Status Indicators” on page 31
■
“Network Management Port Status Indicators” on page 32
■
“Ethernet Ports Status Indicators” on page 32
■
“Motherboard Status Indicators” on page 33
Related Information
■
“Front Panel Controls and Indicators” on page 16
■
“Back Panel Components and Cable Connections” on page 18
Server Boot Process and Normal Operating State Indicators
A normal server boot process involves two indicators, the service processor SP OK LED
indicator and the System OK LED indicator.
When AC power is connected to the server, the server boots into standby power mode:
1. The SP OK LED blinks slowly (0.5 seconds on, 0.5 seconds off) while the SP is starting,
and the System OK LED remains off until the SP is ready.
2. After a few minutes, the main System OK LED slowly flashes the standby blink pattern
(0.1 seconds on, 2.9 seconds off), indicating that the SP (and Oracle ILOM) is ready for
use. In Standby power mode, the server is not initialized or fully powered on at this point.
When powering on the server (either by the On/Standby button or Oracle ILOM), the server
boots to full power mode:
1. The System OK LED blinks slowly (0.5 seconds on, 0.5 seconds off), and the SP OK LED
remains lit (no blinking).
2. When the server successfully boots, the System OK LED remains lit. When the System OK
LED and the SP OK LED indicators remain lit, the server is in Main power mode.
28Oracle Server X7-2 Service Manual • October 2017
Page 29
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Note - The green System OK LED indicator and the green SP OK indicator remain lit (no
blinking) when the server is in a normal operating state.
Server System-Level Status Indicators
There are seven system-level status indicators (LEDs), some of which are located on both the
server front panel and the back panel. The following table describes these indicators.
Status Indicator
Name
Locate Button/
LED
Fault-Service
Required
System OKGreenIndicates the operational state of the chassis.
IconColorState and Meaning
WhiteIndicates the location of the server.
■ OFF – Server is operating normally.
■ FAST BLINK – Use Oracle ILOM to activate this
LED indicator to enable you to locate a particular
system quickly and easily.
■ Pressing the Locate button toggles the LED indicator
fast blink on or off.
AmberIndicates the fault state of the server.
■ OFF – The server is operating normally.
■ STEADY ON – A fault is present on the server. This
LED indicator lights whenever a fault indicator lights
for a replaceable component on the server.
Note - When this LED indicator is lit, a system console
message might appear that includes a recommended
service action.
■ OFF – AC power is not present or the Oracle ILOM
boot is not complete.
■ STANDBY BLINK – Standby power is on, but the
chassis power is off and the Oracle ILOM SP is
running.
■ SLOW BLINK – Startup sequence was initiated on
the host. This pattern begins soon after you power
on the server. This status indicates either: power-on
self-test (POST) code checkpoint tests are running
on the server host system, or the host is transitioning
from the powered-on state to the standby state on
shutdown.
■ STEADY ON – The server is powered on, and all
host POST code checkpoint tests are complete. The
server is in one of the following states: the server
Troubleshooting and Diagnostics29
Page 30
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Status Indicator
Name
SP OKGreenIndicates the state of the service processor.
Top FanAmberIndicates that one or more of the internal fan modules
Back Power
Supply Fault
System Over
Temperature
Warning
DO NOT
SERVICE
IconColorState and Meaning
host is booting the operating system (OS), or the
server host is running the OS.
■ OFF – Service processor (SP) is not running.
■ SLOW BLINK – SP is booting.
■ STEADY ON – SP is fully operational.
failed.
■ OFF – Indicates steady state; no service is required.
■ STEADY ON – Indicates service required.
AmberIndicates that one of the server power supplies failed.
■ OFF – Indicates steady state; no service is required.
■ STEADY ON – Indicates service required; service
the power supply.
AmberIndicates a warning for an overtemperature condition.
■ OFF – Normal operation; no service is required.
■ STEADY ON – The system is experiencing an
overtemperature warning condition.
Note - This is a warning indication, not a fatal
overtemperature. Failure to correct this might result in
the system overheating and shutting down unexpectedly.
WhiteIndicates that the system is not ready to service.
■ OFF – Normal operation.
■ STEADY ON – The system is not ready for service.
Server Fan Status Indicators
Each fan module has one status indicator (LED). The LEDs are located on the chassis midwall adjacent to and aligned with the fan modules and are visible when the server top cover is
removed.
30Oracle Server X7-2 Service Manual • October 2017
Note - The DO NOT SERVICE indicator is application
specific. This indicator is only illuminated on demand by
the Host application.
Page 31
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Status Indicator
Name
Fan StatusAmber■ Off – The fan module is correctly installed and
IconColorState and Meaning
operating within specification.
■ Amber – The fan module is faulty. The front TOP
FAN LED and the front and back panel Fault-Service
Required LEDs are also lit if the system detects a fan
module fault.
Storage Drive Status Indicators
There are three status indicators (LEDs) on each drive.
Status Indicator
Name
OK/ActivityGreen■ OFF – Power is off or installed drive is not
Fault-Service
Required
OK to RemoveBlue■ STEADY ON – The storage drive can be removed
IconColorState and Meaning
recognized by the system.
■ STEADY ON – The drive is engaged and is
receiving power.
■ RANDOM BLINKING – There is disk activity.
Status indicator LED blinks on and off to indicate
activity.
Amber■ OFF – The storage drive is operating normally.
■ STEADY ON – The system detected a fault with the
storage drive.
safely during a hot-plug operation.
■ OFF – The storage drive is not prepared for removal.
Power Supply Status Indicators
There are two status indicators (LEDs) on each power supply. These indicators are visible from
the back of the server.
Status Indicator
Name
AC OK/ DC OKGreen■ OFF – No AC power is present.
IconColorState and Meaning
■ SLOW BLINK – Normal operation. Input power
is within specification. DC output voltage is not
enabled.
Troubleshooting and Diagnostics31
Page 32
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Status Indicator
Name
Fault-Service
Required
IconColorState and Meaning
■ STEADY ON – Normal operation. Input AC power
and DC output voltage are within specification.
Amber■ OFF – Normal operation. No service action is
required.
■ STEADY ON – The power supply (PS) detected a PS
fan failure, PS overtemperature, PS over current, or
PS over or under voltage.
Network Management Port Status Indicators
The server has one 10/100/1000BASE-T Ethernet management domain interface, labeled NET
MGT. There are two status indicators (LEDs) on this port. These indicators are visible from the
back of the server.
Status Indicator
Name
ActivityTop rightGreen■ ON – Link up.
Link speedTop leftGreen■ ON – 1000BASE-T link.
LocationColorState and Meaning
■ OFF – No link or down link.
■ FLASHING – Packet activity.
■ OFF – 10/100BASE-T link.
Ethernet Ports Status Indicators
The server has five Gigabit Ethernet (GbE) ports:
■
One 10/100/1000BASE-T RJ-45 GbE port, labeled NET 0
■
Two 10GBASE-T RJ-45 GbE ports, labeled NET 1 and NET 2
■
Two 10/25GbE enhanced small form-factor pluggable (SFP+) ports, labeled NET 1 and
NET 2
SFP+ ports support up to 10GbE when using SFP+ transceivers, and up to 25GbE when
using SFP28 transceivers.
Note - The 10/25GbE SFP+ (NET 1 and NET 2) ports are the default network ports for the
server. When using an SFP+ port, network connectivity is disabled on the 10GBASE-T RJ-45
GbE (NET 1 and NET 2) ports.
32Oracle Server X7-2 Service Manual • October 2017
Page 33
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
There are two status indicators (LEDs) on each port. These indicators are visible from the rear
of the server.
Status Indicator
Name
RJ-45 10GbE Ports
ActivityTop leftGreen■ ON – Link up.
Link speedTop rightBi-colored:
SFP+ 10/25GbE Ports
ActivityTopGreen■ OFF– No activity.
Link SpeedBottomBi-colored:
LocationColorState and Meaning
■ OFF– No activity.
■ FLASHING – Packet activity.
Amber/Green
Amber/Green
■ OFF – 100BASE-T link (if link up).
■ Green ON – 1000BBASE-T link.
■ FLASHING – Packet activity.
■ OFF – No activity.
■ Amber ON – 10GbE link.
■ Green ON – 25GbE link.
Motherboard Status Indicators
The motherboard contains the following status indicators (LEDs).
Status IndicatorDescription
DIMM Fault Status
Indicators
Processor Fault
Status Indicators
Fault Remind Status
Indicator
■ Each of the 24 DIMM sockets on the motherboard has an amber fault status indicator
(LED) associated with it.
■ If Oracle ILOM determines that a DIMM is faulty, pressing the Fault Remind button on
the motherboard I/O card signals the service processor to light the fault LED associated
with the failed DIMM.
■ For more information on DIMM fault status indicators and the location of the Fault
Remind button, see “Servicing the DIMMs (CRU)” on page 97.
■ The motherboard includes a fault status indicator (LED) adjacent to each of the two
processor sockets.
■ These LEDs indicate when a processor fails.
■ Pressing the Fault Remind button on the motherboard I/O card signals the service
processor to light the fault status indicators associated with the failed processors.
■ For more information on processor fault status indicators and the location of the Fault
Remind button, see “Servicing Processors (FRU)” on page 137.
■ This status indicator (LED) is located next to the Fault Remind button and is powered
from the super capacitor that powers the fault LEDs on the motherboard.
■ This LED lights to indicate that the fault remind circuitry is working properly in
cases where no components failed and, as a result, none of the component fault LEDs
illuminate.
Troubleshooting and Diagnostics33
Page 34
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Status IndicatorDescription
■ For more information on the fault remind status indicator and the location of the Fault
Remind button, see “Using the Server Fault Remind Button” on page 104.
STBY PWRGD
Status Indicator
■ This green status indicator (LED) is labeled STBY PWRGD and is located on the
motherboard near the back of the server.
■ This LED lights to inform a service technician that the motherboard is receiving Standby
power from at least one of the power supplies.
■ This LED is provided to help prevent service actions on the server internal components
while the AC power cords are installed and power is being supplied to the server.
Troubleshooting System Cooling Issues
Maintaining the proper internal operating temperature of the server is crucial to the health of the
server. To prevent server shutdown and damage to components, address over temperature and
hardware-related issues as soon as they occur. If your server has a temperature-related fault, use
the information in the following table to troubleshoot the issue.
Cooling IssueDescriptionActionPrevention
External
Ambient
Temperature
Too High
Airflow
Blockage
Cooling Areas
Compromised
The server fans pull cool air
into the server from its external
environment. If the ambient
temperature is too high, the
internal temperature of the
server and its components
increases. This can cause poor
performance and component
failure.
The server cooling system
uses fans to pull cool air in
from the server front intake
vents and exhaust warm air out
the server back panel vents.
If the front or back vents are
blocked, the airflow through
the server is disrupted and
the cooling system fails to
function properly causing the
server internal temperature to
rise.
The air baffle, component filler
panels, and server top cover
maintain and direct the flow
of cool air through the server.
These server components must
be in place for the server to
function as a sealed system.
Verify the ambient temperature
of the server space against the
environmental specifications
for the server. If the
temperature is not within
the required operating
range, remedy the situation
immediately.
Inspect the server front
and back panel vents for
blockage from dust or debris.
Additionally, inspect the
server interior for improperly
installed components or cables
that can block the flow of air
through the server.
Inspect the server interior to
ensure that the air baffle is
properly installed. Ensure
that all external-facing
slots (storage drive, PCIe)
are occupied with either a
component or a component
Periodically verify the ambient
temperature of the server space
to ensure that it is within the
required range, especially if
you made any changes to the
server space (for example,
added additional servers). The
temperature must be consistent
and stable.
Periodically inspect and clean
the server vents using an ESD
certified vacuum cleaner.
Ensure that all components,
such as cards, cables, fans,
air baffles and dividers are
properly installed. Never
operate the server without the
top cover installed.
When servicing the server,
ensure that the air baffle is
installed correctly and that
the server has no unoccupied
external-facing slots. Never
operate the server without the
top cover installed.
34Oracle Server X7-2 Service Manual • October 2017
Page 35
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Cooling IssueDescriptionActionPrevention
Hardware
Component
Failure
If these components are
not installed correctly, the
airflow inside the server can
become chaotic and nondirectional, which can cause
server components to overheat
and fail.
■ Components, such as
power supplies and fan
modules, are an integral
part of the server cooling
system. When one of these
components fails, the
server internal temperature
can rise. This rise in
temperature can cause
other components to enter
into an over-temperature
state. Additionally, some
components, such as
processors, might overheat
when they are failing,
which can also generate an
over-temperature event.
■ To reduce the risk
related to component
failure, power supplies
and fan modules are
installed in pairs to
provide redundancy.
Redundancy ensures
that if one component in
the pair fails, the other
functioning component can
continue to maintain the
subsystem. For example,
power supplies serve a dual
function; they provide both
power and airflow. If one
power supply fails, the
other functioning power
supply can maintain both
the power and the cooling
subsystems.
filler panel. Ensure that the
server top cover is in place and
sits flat and snug on top of the
server.
Investigate the cause of the
overtemperature event, and
replace failed components
immediately. For hardware
troubleshooting information,
see “Troubleshooting
Server Hardware
Faults” on page 23.
Component redundancy
is provided to allow for
component failure in critical
subsystems, such as the
cooling subsystem. However,
once a component in a
redundant system fails,
the redundancy no longer
exists, and the risk for server
shutdown and component
failures increases. Therefore,
it is important to maintain
redundant systems and
replace failed components
immediately.
Troubleshooting Power Issues
If your server does not power on, use the information in the following table to troubleshoot the
issue.
Troubleshooting and Diagnostics35
Page 36
Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
Power IssueDescriptionActionPrevention
AC Power
Connection
The AC power cords are the
direct connection between the
server power supplies and the
power sources. The server
power supplies need separate
stable AC circuits. Insufficient
voltage levels or fluctuations
in power can cause server
power problems. The power
supplies are designed to
operate at a particular voltage
and within an acceptable range
of voltage fluctuations (refer to
“Electrical Requirements” in
Oracle Servers X7-2 and X72L Installation Guide).
Power
Supplies
(PSUs)
The server power supply
units (PSUs) provide the
necessary server voltages from
the AC power outlets. If the
power supplies are inoperable,
unplugged, or disengaged from
the internal connectors, the
server cannot power on.
Note - Use the Velcro straps
on the back of the server
to secure the power cord
connectors to the back of
the power supplies. The
Velcro retaining straps
minimize the risk of accidental
disconnection.
Top CoverThe server top cover maintains
the air pressures inside the
server, prevents accidental
exposure to hazardous
voltages, and protects internal
components from physical and
environmental damage.
Verify that both AC power
cords are connected to the
server. Verify that the correct
power is present at the outlets
and monitor the power to
verify that it is within the
acceptable range. You can
verify proper connection and
operation by verifying the
power supply (PS) indicator
panels, which are located at
the back of the server on the
power supplies. Lit green
AC OK indicators show a
properly functioning power
supply. An amber AC OK
indicator indicates that the AC
power to the power supply is
insufficient.
Verify that the AC cables
are connected to both power
supplies. Verify that the power
supplies are operational (the
PSU indicator panel must have
a lit green AC OK indicator).
Ensure that the power supply
is properly installed. A power
supply that is not fully engaged
with its internal connector does
not have power applied and
does not have a lit green AC
OK indicator.
Do not operate the server
without the top cover installed
unless you are hot-plugging
a fan module, and then
ensure that you complete
the operation and replace
the cover within 20 seconds.
For instructions for properly
replacing fan modules, see
Use the AC power cord Velcro
retaining clips and position
the cords to minimize the risk
of accidental disconnection.
Ensure that the AC circuits that
supply power to the server are
stable and not overburdened.
When a power supply fails,
replace it immediately. To
ensure redundancy, the server
has two power supplies. This
redundant configuration
prevents server downtime,
or an unexpected shutdown,
due to a failed power supply.
The redundancy allows the
server to continue to operate
if one of the power supplies
fails. However, when a
server is being powered by
a single power supply, the
redundancy no longer exists,
and the risk for downtime
or an unexpected shutdown
increases. When installing a
power supply, ensure that it is
fully seated and engaged with
its connector inside the drive
bay. A properly installed power
supply has a lit green AC OK
indicator.
Be careful to avoid bending
or otherwise warping the top
cover.
36Oracle Server X7-2 Service Manual • October 2017
Page 37
Troubleshooting With Diagnostic Tools
Power IssueDescriptionActionPrevention
“Servicing Fan Modules
(CRU)” on page 81. For
instructions for installing
the server top cover, see
“Install the Server Top
Cover” on page 210.
Managing Server Hardware Faults Through the
Oracle ILOM Fault Management Shell
The Oracle ILOM Fault Management Shell enables Oracle Service personnel to view and
manage fault activity on managed servers and other types of devices.
For more information about how to use the Oracle ILOM Fault Management Shell, see the
Oracle ILOM User's Guide for System Monitoring and Diagnostics Firmware Release 4.0.x in
the Oracle Integrated Lights Out Manager (ILOM) 4.0 Documentation Library at http://www.
oracle.com/goto/ilom/docs.
Caution - The purpose of the Oracle ILOM Fault Management Shell is to help Oracle Service
personnel diagnose system problems. Customers must not launch this shell or run fault
management commands in the shell unless requested to do so by Oracle Service.
Troubleshooting With Diagnostic Tools
The server and its accompanying software and firmware contain diagnostic tools and features
that can help you isolate component problems, monitor the status of a functioning system,
and exercise one or more subsystem to disclose more subtle or intermittent hardware-related
problems.
Each diagnostic tool has its own specific strength and application. Review the tools listed in this
section and determine which tool might be best to use for your situation. After you determine
the tool to use, you can access it locally, while at the server, or remotely.
■
“Diagnostic Tools” on page 38
■
“Diagnostic Tool Documentation” on page 39
Troubleshooting and Diagnostics37
Page 38
Troubleshooting With Diagnostic Tools
Diagnostic Tools
The selection of diagnostic tools available for your server range in complexity from a
comprehensive validation test suite (Oracle VTS) to a chronological event log (Oracle ILOM
System Log). The selection of diagnostic tools also includes standalone software packages,
firmware-based tests, and hardware-based LED indicators.
The following table summarizes the diagnostic tools that you can use when troubleshooting or
monitoring your server.
Diagnostic ToolTypeWhat It DoesAccessibilityRemote Capability
Oracle ILOMSP firmwareMonitors environmental
Hardware-based
LED indicators
Power-On SelfTest (POST)
UEFI
Diagnostics
HWdiagSP firmwareThe HWdiag utility checks
Oracle Solaris
commands
Oracle Linux
commands
condition and component
functionality sensors,
generates alerts, performs
fault isolation, and provides
remote access.
Hardware and
SP firmware
Host firmwareTests core components of
SP firmwareUEFI diagnostics can test
Operating
system software
Operating
system software
Indicates status of overall
system and particular
components.
system: CPUs, memory,
and motherboard I/O bridge
integrated circuits.
and detect problems on all
processors, memory, disk
drives, and network ports.
the status of a system and
its components. You run the
HWdiag utility within the
Oracle ILOM Diag shell.
Displays various kinds of
system information.
Displays various kinds of
system information.
Can function on
either Standby power
mode or Main power
mode and is not OS
dependent.
Available when
system power is
available.
Runs on startup.
Available when the
operating system is
not running.
You can use either
the Oracle ILOM
web interface or
the commandline interface
(CLI) to run UEFI
diagnostics.
Can function on
Standby power and
when operating
system is not
running.
Requires operating
system.
Requires operating
system.
Designed for remote
and local access.
Local, but sensor
and indicators are
accessible from
Oracle ILOM
web interface or
command-line
interface (CLI).
Local, but can be
accessed through
Oracle ILOM
Remote System
Console Plus.
Remote access
through Oracle
ILOM Remote
System Console
Plus.
Local, but remote
serial access is
possible if the
SP serial port is
connected to a
network-accessible
terminal server.
Local, and over
network.
Local, and over
network.
38Oracle Server X7-2 Service Manual • October 2017
Page 39
Attaching Devices to the Server
Diagnostic ToolTypeWhat It DoesAccessibilityRemote Capability
Oracle VTSDiagnostic
tool standalone
software
Exercises and stresses the
system, running tests in
parallel.
Requires the Solaris
operating system.
Install Oracle VTS
software separately.
View and control
over network.
Diagnostic Tool Documentation
The following table identifies where you can find more information about diagnostic tools.
Diagnostic ToolDocumentationLocation
Oracle ILOMOracle Integrated Lights Out
Manager 4.0 Documentation Library
UEFI Diagnostics or HWdiagOracle x86 Servers Diagnostics and
Troubleshooting Guide for Servers
With Oracle ILOM 4.0.x
System indicators and sensorsThis document“Troubleshooting Using the Server
Oracle VTSOracle VTS software and
documentation
http://www.oracle.com/goto/
ilom/docs
http://www.oracle.com/goto/
x86admindiag/docs
Front and Back Panel Status
Indicators” on page 28
http://docs.oracle.com/cd/
E19719-01/index.html
Attaching Devices to the Server
The following sections contain procedural information for attaching devices to the server so you
can access diagnostic tools when troubleshooting and servicing the server:
■
“Attach Devices to the Server” on page 40
■
“Back Panel Connector Locations” on page 40
■
“Configuring Serial Port Sharing” on page 42
■
“Server Operating System Names for the NVMe Storage Drives” on page 44
■
“Ethernet Port Boot Order and Device Naming” on page 44
■
“Back Panel Pinhole Switches” on page 45
Troubleshooting and Diagnostics39
Page 40
Attach Devices to the Server
Attach Devices to the Server
This procedure explains how to connect devices to the server (remotely and locally), so that you
can interact with the service processor (SP) and the server console.
1.
Connect an Ethernet cable to the Gigabit Ethernet (NET) connector as needed for
OS support.
See “Back Panel Connector Locations” on page 40.
2.
To connect to the service processor's Oracle ILOM over the network, connect an
Ethernet cable to the Ethernet port labeled NET MGT.
See “Back Panel Connector Locations” on page 40.
3.
To access the Oracle ILOM command-line interface (CLI) locally, connect a serial
null modem cable to the RJ-45 serial port labeled SER MGT.
To access the system console, connect the RJ-45 cable to a terminal or terminal emulator, log on
to Oracle ILOM, and type start /HOST/console.
See “Back Panel Connector Locations” on page 40.
Back Panel Connector Locations
The following illustration shows and describes the locations of the back panel connectors. Use
this information to set up the server, so that you can access diagnostic tools and manage the
server during service.
40Oracle Server X7-2 Service Manual • October 2017
Page 41
Attach Devices to the Server
No. Cable Port or Expansion SlotDescription
1Power supply 0 input power
The server has two power supply connectors, one for each power
supply.
Power supply 1 input power
Do not attach power cables to the power supplies until you finish
connecting the data cables to the server. The server goes into
Standby power mode, and the Oracle ILOM service processor
initializes when the AC power cables are connected to the power
source. System messages might be lost after 60 seconds if the server
is not connected to a terminal, PC, or workstation.
Note - Oracle ILOM signals a fault on any installed power supply
that is not connected to an AC power source, since it might indicate
a loss of redundancy.
2Network management port (NET MGT)The service processor NET MGT port is the optional connection
to the Oracle ILOM service processor. The NET MGT port is
configured by default to use Dynamic Host Configuration Protocol
(DHCP). The service processor NET MGT port uses an RJ-45 cable
for a 10/100/1000BASE-T connection.
3Ethernet ports:The five Gigabit Ethernet (GbE) ports enable you to connect the
system to the network.
Troubleshooting and Diagnostics41
Page 42
Attach Devices to the Server
No. Cable Port or Expansion SlotDescription
■ NET 0, 10/100/1000BASE-T RJ-45
■ NET 1 and NET 2, 10GBASE-T RJ-
■ NET 1 and NET 2, 10/25GbE SFP+
4USB portThe USB port supports hot-plugging. You can connect and
5Serial management port (SER MGT)The service processor SER MGT port uses an RJ-45 cable and
Configuring Serial Port Sharing
GbE port
45 GbE ports
ports
Note - SFP+ ports support up
to 10GbE when using SFP+
transceivers, and up to 25GbE when
using SFP28 transceivers.
Note - The 10/25GbE SFP+ ports are the default network ports
for the server. When using an SFP+ port, network connectivity is
disabled on the 10GBASE-T RJ-45 GbE ports.
disconnect a USB cable or a peripheral device while the server is
running without affecting system operations.
terminal (or emulator) to provide access to the Oracle ILOM
command-line interface (CLI). Using Oracle ILOM, you can
configure it to connect to the system console.
Note - This port does not support network connections.
By default, the SP console (SER MGT) port sends serial port output from the server. Using
Oracle ILOM, you can specify that the host console (COM1) be assigned as owner of the server
serial port output. This feature is useful for Windows kernel debugging, as it enables you to
view non-ASCII character traffic from the host console.
Set up the network on the SP before attempting to change the serial port owner to the host
server. If the network is not set up, and you switch the serial port owner to the host server, you
cannot connect using the CLI or web interface to change the serial port owner back to the SP.
To return the serial port owner setting to the SP, restore access to the serial port on the server.
For details, refer to the Oracle Integrated Lights Out Manager (ILOM) 4.0 Documentation
Library at http://www.oracle.com/goto/ilom/docs.
You can assign serial port output using either the Oracle ILOM web interface or the commandline interface (CLI). For instructions, see the following sections:
■
“Assign Serial Port Output Using the Oracle ILOM CLI” on page 43
■
“Assign Serial Port Output Using the Oracle ILOM Web Interface” on page 43
42Oracle Server X7-2 Service Manual • October 2017
Page 43
Assign Serial Port Output Using the Oracle ILOM CLI
Assign Serial Port Output Using the Oracle ILOM CLI
1.
Open an SSH session, and at the command line, log in to the SP Oracle ILOM
CLI.
Log in as a user with root or administrator privileges. For example:
ssh root@ipaddress
Where ipaddress is the IP address of the server SP.
For more information, see “Accessing Oracle ILOM” in the Oracle X7 Series Servers
Administration Guide.
The Oracle ILOM CLI prompt (->) appears.
2.
To set the serial port owner, type:
-> set /SP/serial/portsharing owner=host
Note - The serial port sharing value, by default, is owner=SP.
3.
Connect a serial host to the server.
Assign Serial Port Output Using the Oracle ILOM Web
Interface
1.
Log in to the SP Oracle ILOM web interface.
Open a web browser and direct it using the IP address of the server SP. Log in as root or a user
with administrator privileges. See “Accessing Oracle ILOM” in the Oracle X7 Series ServersAdministration Guide.
The Summary Information page appears.
2.
In the Oracle ILOM web interface, select ILOM Administration → Connectivity
from the navigation menu on the left side of the screen.
3.
Select the Serial Port tab.
The Serial Port Settings page appears.
Note - The serial port sharing setting, by default, is Service Processor.
4.
In the Serial Port Settings page, select HostServer as the serial port owner.
Troubleshooting and Diagnostics43
Page 44
Assign Serial Port Output Using the Oracle ILOM Web Interface
5.
Click Save for the changes to take effect.
6.
Connect a serial host to the server.
Server Operating System Names for the NVMe
Storage Drives
If NVMe storage drives are installed in the server front panel, they are labeled NVMe0 through
NVMe7. The server operating systems assign these storage drives different names. For the
corresponding names assigned by the operating systems, see the following table.
Note - The drive names provided in the table assume that NVMe cabling between the
motherboard NVMe connectors and the disk backplane is correct.
Storage Drive LabelsNames Assigned by the Server
NVMe0PCIe Slot 100
NVMe1PCIe Slot 101
NVMe2PCIe Slot 102
NVMe3PCIe Slot 103
NVMe4PCIe Slot 104
NVMe5PCIe Slot 105
NVMe6PCIe Slot 106
NVMe7PCIe Slot 107
Operating Systems
Ethernet Port Boot Order and Device Naming
This section contains information about the boot order and device naming for the Gigabit
Ethernet ports on the back panel of the server. For location information, see “Back Panel
Connector Locations” on page 40.
Ethernet Port Boot Order
The order in which the BIOS detects the Ethernet ports during server boot is listed below.
44Oracle Server X7-2 Service Manual • October 2017
Page 45
Assign Serial Port Output Using the Oracle ILOM Web Interface
Note - You can change the boot priority using the Boot Device Priority screen available in the
Boot menu of the BIOS Setup Utility.
1. NET 0
2. NET 1
3. NET 2
Ethernet Port Device Naming
The device naming for the Ethernet interface is reported differently by different interfaces
and operating systems. The following table shows the BIOS (physical) and operating system
(logical) naming convention for the interface. This naming convention might vary depending on
the conventions of your operating system and which devices are installed in the server.
Note - Naming used by the interfaces might vary from that listed below depending on which
devices are installed in the system.
PortSolarisLinux 6Linux 7Windows (example default
Net 0igb0eth0enp1Ethernet
Net 1igb1eth1enp2Ethernet 2
Net 2igb2eth2enp3Ethernet 3
Note - For Windows, port names like Ethernet, Ethernet 2, and Ethernet 3 are used by default.
names, see note below)
However, actual port naming is based on the order of enumeration, typically during operating
system installation. Additionally, Windows allows you to rename the ports to meet applicationspecific needs.
Back Panel Pinhole Switches
This section shows the location of the back panel pinhole switches.
Troubleshooting and Diagnostics45
Page 46
Getting Help
CalloutIconDescription
1Non-maskable interrupt
2Host Warm Reset
(NMI)
†
3SP Reset
†
Oracle Service use only.
Getting Help
The following sections describe how to get additional help to resolve server-related problems.
■
“Contacting Support” on page 46
■
“Locating the Chassis Serial Number” on page 47
Contacting Support
If the troubleshooting procedures in this chapter fail to solve your problem, use the following
table to collect information that you might need to communicate to support personnel.
46Oracle Server X7-2 Service Manual • October 2017
Page 47
Getting Help
System Configuration
Information Needed
Service contract number
System model
Operating environment
System serial number
Peripherals attached to the system
Email address and phone number
for you and a secondary contact
Street address where the system is
located
Superuser password
Summary of the problem and
the work being done when the
problem occurred
Other Useful Information
IP address
Server name (system host name)
Network or internet domain name
Proxy server configuration
Related Information
Your Information
■
“Locating the Chassis Serial Number” on page 47
Locating the Chassis Serial Number
You might need your server serial number when you ask for service on your system. Record
this number for future use. Use one of the following resources or methods to locate your server
serial number.
■
The serial number is located on the Radio-frequency Identification (RFID) label on the
bottom left side of the front panel bezel, below the general status LEDs.
For illustrations of the server front panel, see “About Controls and Connectors” on page 16.
■
The serial number is recorded on a label that is attached to the top-front surface of the
system.
■
The serial number is recorded on the yellow Customer Information Sheet (CIS) that is
attached to your server packaging.
Troubleshooting and Diagnostics47
Page 48
Auto Service Requests
■
Using Oracle ILOM:
■
From the command-line interface (CLI), type the command: show /System
■
From the web interface, view the serial number on the System Information screen.
Auto Service Requests
Oracle Auto Service Requests (ASR) is a feature available to customers having Oracle Premier
Support and is provided to those customers at no additional cost. Oracle ASR is the fastest
way to restore system availability if a hardware fault occurs. Oracle ASR software is secure
and customer installable, with the software and documentation downloadable from My Oracle
Support at https://support.oracle.com. When you log in to My Oracle Support, refer to the
"Oracle Auto Service Request" knowledge article document (ID 1185493.1) for instructions on
downloading the Oracle ASR software.
When a hardware fault is detected, Oracle ASR opens a service request with Oracle and
transfers electronic fault telemetry data to help expedite the diagnostic process. Oracle
diagnostic capabilities then analyze the telemetry data for known issues and delivers immediate
corrective actions. For security, the electronic diagnostic data sent to Oracle includes only what
is needed to solve the problem. The software does not use any incoming Internet connections
and does not include any remote access mechanisms.
For more information about Oracle ASR, go to: https://www.oracle.com/support/premier/
auto-service-request.html.
48Oracle Server X7-2 Service Manual • October 2017
Page 49
Preparing for Service
These sections describe safety considerations and provide prerequisite procedures and
information to replace components within the server:
■
“Safety Precautions” on page 49
■
“Safety Symbols” on page 50
■
“Electrostatic Discharge Safety” on page 50
■
“FRU Key Identity Properties (KIP) Automated Update” on page 51
■
“Required Tools” on page 52
■
“Preparing the Server for Component Replacement” on page 52
Related Information
■
“Returning the Server to Operation” on page 209
Safety Precautions
For your protection, observe the following safety precautions when setting up your equipment:
■
Follow all standard cautions, warnings, and instructions marked on the equipment and
described in the Oracle Server X7-2 Safety and Compliance Guide and in the printed
Important Safety Information for Oracle's Hardware Systems.
■
Ensure that the voltage and frequency of your power source match the voltage and
frequency inscribed on the equipment's electrical rating label.
■
Follow the electrostatic discharge safety practices as described in “Electrostatic Discharge
Safety” on page 50.
■
Disconnect both power supply cords (if necessary) before servicing components.
Related Information
■
“Safety Symbols” on page 50
Preparing for Service49
Page 50
Safety Symbols
■
“Electrostatic Discharge Safety” on page 50
Safety Symbols
The following symbols might appear in this book. Note their meanings.
Caution - Risk of personal injury or equipment damage. To avoid personal injury or
equipment damage, follow the instructions.
Caution - Hot surface. Avoid contact. Surfaces are hot and might cause personal injury if
touched.
Caution - Hazardous voltages are present. To reduce the risk of electric shock and danger to
personal health, follow the instructions.
Related Information
■
“Safety Precautions” on page 49
■
“Electrostatic Discharge Safety” on page 50
Electrostatic Discharge Safety
Devices that are sensitive to electrostatic discharge (ESD), such as the motherboard, PCIe cards,
drives, processors, and memory cards require special handling.
Caution - The boards and drives contain electronic components that are extremely sensitive to
static electricity. Ordinary amounts of static electricity from clothing or the work environment
can destroy components. Do not touch the components along their connector edges.
Do the following when handling ESD-sensitive components:
■
Use an antistatic wrist strap.
Wear an antistatic wrist strap when handling components such as drive assemblies, boards,
or cards. When servicing or removing server components, disconnect the power cords from
50Oracle Server X7-2 Service Manual • October 2017
the server and then attach an antistatic strap to your wrist and then to a metal area on the
chassis. Following this practice equalizes the electrical potentials between you and the
server.
Note - An antistatic wrist strap is not included in the Ship Kit for the server. However,
antistatic wrist straps are included with optional components.
■
Use an antistatic mat.
Place ESD-sensitive components such as the motherboard, DIMMs (memory modules),
and other printed circuit boards on an antistatic mat. The following items can be used as an
antistatic mat:
■
An antistatic bag used to wrap an Oracle replacement part
■
An Oracle ESD mat (orderable item)
■
A disposable ESD mat (shipped with some replacement parts or optional system
components)
Related Information
■
“Safety Precautions” on page 49
■
“Safety Symbols” on page 50
■
“Preparing the Server for Component Replacement” on page 52
Oracle ILOM includes a key identity properties (KIP) auto-update feature that ensures product
information that is used for service entitlement and warranty coverage is accurately maintained
by the server at all times, including during hardware replacement activities.
The KIP includes the server product name, product part number (PPN), and product serial
number (PSN). The KIP is stored in the FRUID (field-replaceable unit identifiers) container of
the three server FRUs that are designated quorum members.
The quorum members include:
■
Disk backplane (DBP), designated as a primary quorum member.
■
Motherboard (MB), designated as a backup quorum member.
Preparing for Service51
Page 52
Required Tools
■
Power supply (PS), designated as a backup quorum member.
When a server FRU that contains the KIP is removed and a replacement component is installed,
the KIP of the replacement component is programmed by Oracle ILOM to contain the same
KIP as the other two components.
Note - Only one of the quorum members can be replaced at a time. Automated updates can only
be completed when two of the three quorum members contain matching key identity properties.
Related Information
■
“Servicing Power Supplies (CRU)” on page 87
■
“Servicing the Disk Backplane (FRU)” on page 151
■
“Servicing the Motherboard (FRU)” on page 199
Required Tools
The server can be serviced with the following tools:
■
Antistatic wrist strap
■
Antistatic mat
■
No. 2 Phillips screwdriver
■
Torx (6 lobe) T10, T15, T25, and T30 screwdrivers
■
12.0 in-lbs/ft (inch-pounds/foot) torque driver
Related Information
■
“Preparing for Service” on page 49
Preparing the Server for Component Replacement
Before you can remove and install components that are inside the server, you must perform the
procedures in some or all of the following tasks:
52Oracle Server X7-2 Service Manual • October 2017
Page 53
Preparing the Server for Component Replacement
Note - When replacing the storage drives or power supplies, not all of these procedures are
necessary. See the replacement tasks for those components for more information.
■
“Powering Down the Server” on page 53
■
“Disconnect Cables From the Server” on page 58
■
“Extend the Server to the Maintenance Position” on page 59
■
“Remove the Server From the Rack” on page 61
■
“Take Antistatic Measures” on page 61
■
“Remove the Server Top Cover” on page 62
Related Information
■
“Returning the Server to Operation” on page 209
Powering Down the Server
To determine how you want to power off the server, view the options in the following table.
Note - Some server components, such as storage drives, power supplies, and fan modules are
hot-pluggable. Components that are hot-pluggable do not require the server to be powered down
when performing service procedures.
DescriptionLink
Power down the server gracefully to save all data and to
prevent data from being corrupted. Performing a graceful
shutdown ensures that the system is ready for restart.
If the server is not responding, or you must shut down
the server quickly, perform an immediate shutdown.
■ “Power Down Server Gracefully Using the Oracle
ILOM CLI” on page 54
■ “Power Down Server Gracefully Using the Oracle
ILOM Web Interface” on page 55
■ “Power Down Server Gracefully Using the On/
Standby Button” on page 56
■ “Use the On/Standby Button for Immediate
Shutdown” on page 56
■ “Use the Oracle ILOM CLI for Immediate
Shutdown” on page 57
■ “Use the Oracle ILOM Web Interface for Immediate
Shutdown” on page 58
Related Information
■
“Power On the Server” on page 214
Preparing for Service53
Page 54
Power Down Server Gracefully Using the Oracle ILOM CLI
Power Down Server Gracefully Using the Oracle ILOM CLI
Performing a graceful shutdown ensures that all of your data is saved and the system is ready
for restart.
1.
Log in to the server as superuser or equivalent.
Depending on the nature of the problem, you might want to view the system status or the log
files or run diagnostics before you shut down the system. For more information, see the Oracle
ILOM 4.0 Documentation Library at http://www.oracle.com/goto/ilom/docs.
2.
Notify affected users that the server will be powered down.
3.
Save any open files, and quit all running applications.
Refer to your application documentation for specific information on these processes.
4.
Log in to the Oracle ILOM command-line interface (CLI) using an Administrator
account.
Refer to “Accessing Oracle ILOM” in Oracle Servers X7-2 and X7-2L Installation Guide.
5.
At the Oracle ILOM prompt, shut down the operating system:
-> stop /System
Note - The stop / System command does not guarantee a system shutdown. This command
is only a request to the host operating system (OS) to perform a system shutdown. To ensure a
system shutdown, see “Use the Oracle ILOM CLI for Immediate Shutdown” on page 57.
If the system is running the Oracle Solaris OS, refer to the Oracle Solaris system administration
documentation for additional information.
6.
Disconnect the power cords and data cables from the server.
See “Disconnect Cables From the Server” on page 58.
Caution - When you power down the server using Oracle ILOM, the server enters Standby
power mode. Power is still directed to the service processor remote management subsystem and
power supply fans. To completely power off the server, you must disconnect the power cords
from the power supplies.
Related Information
■
“Power Down Server Gracefully Using the Oracle ILOM Web Interface” on page 55
54Oracle Server X7-2 Service Manual • October 2017
Page 55
Power Down Server Gracefully Using the Oracle ILOM Web Interface
■
“Power Down Server Gracefully Using the On/Standby Button” on page 56
■
“Use the On/Standby Button for Immediate Shutdown” on page 56
■
“Power On the Server” on page 214
Power Down Server Gracefully Using the Oracle ILOM Web
Interface
1.
Log in to the server as superuser or equivalent.
Depending on the nature of the problem, you might want to view the system status or the log
files or run diagnostics before you shut down the system. For more information, see the Oracle
ILOM 4.0 Documentation Library at http://www.oracle.com/goto/ilom/docs.
2.
Notify affected users that the server will be powered down.
3.
Save any open files and quit all running applications.
Refer to your application documentation for specific information about these processes.
4.
Log in to the Oracle ILOM web interface using an Administrator account.
Refer to “Accessing Oracle ILOM” in Oracle Servers X7-2 and X7-2L Installation Guide.
The Oracle ILOM web interface System Information → Summary Information page appears.
5.
In the left pane, click Host Management → Power Control, and select Graceful
Shutdown and Power Off from the Action list.
6.
Click Save and then click OK.
The host server performs an orderly power shutdown.
7.
Disconnect the power cords and data cables from the server.
See “Disconnect Cables From the Server” on page 58.
Caution - When you power down the server using Oracle ILOM, the server enters Standby
power mode. Power is still directed to the service processor and power supply fans. To
completely power off the server, you must disconnect the power cords from the power supplies.
Related Information
■
“Power Down Server Gracefully Using the Oracle ILOM CLI” on page 54
■
“Power Down Server Gracefully Using the On/Standby Button” on page 56
Preparing for Service55
Page 56
Power Down Server Gracefully Using the On/Standby Button
■
“Use the On/Standby Button for Immediate Shutdown” on page 56
■
“Power On the Server” on page 214
Power Down Server Gracefully Using the On/Standby Button
1.
Press and quickly release the On/Standby button on the front panel.
This action causes ACPI-enabled operating systems to perform an orderly shutdown of the
operating system. Servers not running ACPI-enabled operating systems shut down to Standby
power mode immediately.
When main power is off, the System OK LED on the front panel will begin flashing,
indicating that the server is in Standby power mode. See “Server System-Level Status
Indicators” on page 29.
2.
Disconnect the power cords and data cables from the server.
See “Disconnect Cables From the Server” on page 58.
Caution - When you power down the server using the On/Standby button, the server enters
Standby power mode. Power is still directed to the service processor and power supply fans. To
completely power off the server, you must disconnect the power cords from the power supplies.
Related Information
■
“Server System-Level Status Indicators” on page 29
■
“Power Down Server Gracefully Using the Oracle ILOM CLI” on page 54
■
“Power Down Server Gracefully Using the Oracle ILOM Web Interface” on page 55
■
“Use the On/Standby Button for Immediate Shutdown” on page 56
■
“Power On the Server” on page 214
Use the On/Standby Button for Immediate Shutdown
Caution - An immediate power down might corrupt system data, therefore, only use this
procedure to power down the server after attempting the graceful power down procedure.
1.
Press and hold the On/Standby button for four seconds to force the Main power
off and to enter Standby power mode.
When Main power is off, the System OK LED on the front panel begins flashing, indicating that
the server is in Standby power mode. See “Server System-Level Status Indicators” on page 29.
56Oracle Server X7-2 Service Manual • October 2017
Page 57
Use the Oracle ILOM CLI for Immediate Shutdown
2.
Disconnect the power and data cables from the server.
See “Disconnect Cables From the Server” on page 58.
Caution - When you press the On/Standby button to enter Standby power mode, power is still
directed to the service processor and power supply fans. To completely power off the server,
you must disconnect the power cords from the power supplies.
Related Information
■
“Server System-Level Status Indicators” on page 29
■
“Power Down Server Gracefully Using the Oracle ILOM CLI” on page 54
■
“Power Down Server Gracefully Using the Oracle ILOM Web Interface” on page 55
■
“Power Down Server Gracefully Using the On/Standby Button” on page 56
■
“Use the Oracle ILOM CLI for Immediate Shutdown” on page 57
■
“Use the Oracle ILOM Web Interface for Immediate Shutdown” on page 58
■
“Power On the Server” on page 214
Use the Oracle ILOM CLI for Immediate Shutdown
Caution - An immediate power down might corrupt system data, therefore, only use this
procedure to power down the server after attempting the graceful power down procedure.
1.
Log in to the Oracle ILOM command-line interface (CLI) using an Administrator
account.
Oracle ILOM displays the default command prompt (->), indicating that you have successfully
logged in to Oracle ILOM.
2.
From the CLI prompt, type the following command:
-> stop -f /System
The server powers down immediately.
3.
Disconnect the power and data cables from the server.
See “Disconnect Cables From the Server” on page 58.
Caution -When the server is powered down, it enters Standby power mode In standby power
mode, power is still directed to the service processor and power supply fans. To completely
power off the server, you must disconnect the power cords from the power supplies.
Preparing for Service57
Page 58
Use the Oracle ILOM Web Interface for Immediate Shutdown
Related Information
■
“Use the On/Standby Button for Immediate Shutdown” on page 56
■
“Use the Oracle ILOM Web Interface for Immediate Shutdown” on page 58
Use the Oracle ILOM Web Interface for Immediate Shutdown
Caution - An immediate power down might corrupt system data, so only use this procedure to
power down the server after attempting the graceful power down procedure.
1.
Log in to the Oracle ILOM web interface using an Administrator account.
The Oracle ILOM web interface System Information page appears.
2.
In the left pane, click Host Management → Power Control, and select Immediate
Power Off in the Select Action list.
3.
Click Save, then click OK.
The server powers down immediately.
4.
Disconnect the power and data cables from the server.
See “Disconnect Cables From the Server” on page 58.
Caution -When the server is powered down, it enters Standby power mode In standby power
mode, power is still directed to the service processor and power supply fans. To completely
power off the server, you must disconnect the power cords from the power supplies.
Related Information
■
“Use the On/Standby Button for Immediate Shutdown” on page 56
■
“Power Down Server Gracefully Using the Oracle ILOM CLI” on page 54
■
“Power On the Server” on page 214
Disconnect Cables From the Server
Caution -The system supplies Standby power to the circuit boards when the power cords are
connected even when the system is powered off.
1.
Label all cables connected to the server.
58Oracle Server X7-2 Service Manual • October 2017
Page 59
Extend the Server to the Maintenance Position
2.
Power down the server.
See “Powering Down the Server” on page 53.
3.
Disconnect the power cords from the rear of the server.
4.
Disconnect all data cables from the rear of the server.
Related Information
■
“Back Panel Components and Cable Connections” on page 18
■
“Powering Down the Server” on page 53
■
“Remove the Server From the Rack” on page 61
■
“Reconnect Data Cables and Power Cords” on page 214
Extend the Server to the Maintenance Position
The following components can be serviced with the server in the maintenance position:
■
Storage drives
■
Fan modules
■
Power supplies
■
PCIe risers
■
PCIe cards and cables
■
SAS cables
■
NVMe cables
■
DIMMs
■
Internal USB flash drive
■
M.2 flash SSDs
■
System battery
■
Processors
■
Disk backplane
■
Front LED indicator module
■
Motherboard assembly
If the server is installed in a rack with extendable slide-rails, use this procedure to extend the
server to the maintenance position.
1.
To prevent the rack from tipping forward when the server is extended, extend all
rack anti-tilt mechanisms.
Preparing for Service59
Page 60
Extend the Server to the Maintenance Position
Refer to “Stabilize the Rack” in Oracle Servers X7-2 and X7-2L Installation Guide.
2.
Verify that no cables will be damaged or will interfere when the server is
extended.
Although the cable management arm (CMA) that is supplied with the server is hinged to
accommodate extending the server, you should ensure that all cables and cords are capable of
extending.
3.
From the front of the server, open and hold the left and right release latch covers
in the open position [1].
When in an open position, the release latch covers engage the slide-rail release latches [1].
Caution - Deploy any rack anti-tilt mechanism before releasing the slide-rail release latches.
4.
While the release latch covers are in the open position, slowly pull the server
forward until the slide-rails latch into a locked position [2].
The server is now in the extended maintenance position.
Related Information
■
“Disconnect Cables From the Server” on page 58
60Oracle Server X7-2 Service Manual • October 2017
Page 61
Remove the Server From the Rack
■
“Remove the Server From the Rack” on page 61
■
“Reinstall the Server Into the Rack” on page 211
Remove the Server From the Rack
Caution - The server weighs approximately 35.0 lbs (15.88 kg). Two people are required to
dismount and carry the chassis.
1.
Disconnect all the cables and power cords from the server.
2.
Remove the cable management arm (CMA).
Refer to “Remove the Cable Management Arm” in Oracle Servers X7-2 and X7-2L Installation
Guide.
3.
Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 59.
4.
From the front of the server, pull the green slide-rail release tabs toward the front
of the server, and pull the server out of the rack until it is free of the rack rails.
A slide-rail release tab is located on each slide-rail.
Note - To pull the green slide-rail release tab, place your finger in the center of the tab, not on
the end, and apply pressure as you pull the tab toward the front of the server.
5.
Set the server on a sturdy work surface.
Related Information
■
“Disconnect Cables From the Server” on page 58
■
“Take Antistatic Measures” on page 61
■
“Extend the Server to the Maintenance Position” on page 59
■
“Reinstall the Server Into the Rack” on page 211
Take Antistatic Measures
1.
Prepare an antistatic surface on which to set parts during removal and
installation.
Preparing for Service61
Page 62
Remove the Server Top Cover
Place electrostatic discharge (ESD)-sensitive components such as the printed circuit boards on
an antistatic mat. The following items can be used as an antistatic mat:
■
An antistatic bag used to wrap a replacement part
■
An Oracle ESD mat (orderable item)
■
A disposable ESD mat (shipped with some replacement parts or optional system
components)
2.
Attach an antistatic wrist strap.
When servicing or removing server components, attach an antistatic strap to your wrist and then
to a metal area on the chassis. Then disconnect the power cords from the server.
Note - An antistatic wrist strap is not included in the Ship Kit for the server. However, antistatic
wrist straps might be included with options and components.
Related Information
■
“Remove Antistatic Measures” on page 211
Remove the Server Top Cover
Servicing some components requires that the top cover be removed.
1.
Ensure that the AC power cords are disconnected from the server power
supplies.
Note - Fan modules are hot-pluggable and do not require AC power cables to be disconnected
from the server power supplies.
62Oracle Server X7-2 Service Manual • October 2017
Page 63
Remove the Server Top Cover
2.
If the release button latch is in the locked position, use a Torx T10 screwdriver to
turn the release button latch clockwise to the unlocked position.
3.
Unlatch the top cover [1].
Lift up on the release button on top of the server cover. Lifting the release button causes the
server cover to slide toward the rear of the chassis for easy removal.
4.
Lift up and remove the top cover [2].
Preparing for Service63
Page 64
Remove the Server Top Cover
Related Information
■
“Take Antistatic Measures” on page 61
■
“Install the Server Top Cover” on page 210
64Oracle Server X7-2 Service Manual • October 2017
Page 65
Servicing Storage Drives (CRU)
This section describes how to service storage drives. Storage drives are customer-replaceable
units (CRUs) that do not require you to power off the server. For more information about CRUs,
see “Illustrated Parts Breakdown” on page 19 and “Customer-Replaceable Units” on page 21.
■
“Storage Drives Hot-Plug Conditions” on page 65
■
“Storage Drive Failure and RAID” on page 66
■
“Storage Drive Locations and Numbering” on page 66
■
“Removing and Replacing a Storage Drive” on page 67
■
“Removing and Replacing an NVMe Storage Drive Using Oracle Solaris” on page 71
■
“Removing and Replacing an NVMe Storage Drive Using Oracle Linux” on page 74
■
“Removing and Replacing an NVMe Storage Drive Using Microsoft Windows
Server” on page 78
Related Information
■
“Remove and Install Filler Panels” on page 209
Storage Drives Hot-Plug Conditions
The hard-disk drives (HDDs), solid-state drives (SSDs), or NVMe SSD drives that are installed
in the server are in most cases hot-pluggable. This capability depends on how the drives are
configured and whether the drive is an NVMe device. To hot-plug a drive you must be able to
take the drive offline before you can remove it. When you take the drive offline, you prevent
any application from accessing the drive and remove the logical software links to the drive. For
an NVMe storage drive, you must not only take the drive offline, but also power down the drive
slot.
The following conditions inhibit the ability to perform hot-plugging of a drive:
■
The drive provides the operating system, and the operating system is not mirrored on
another drive.
Servicing Storage Drives (CRU)65
Page 66
Storage Drive Failure and RAID
■
The drive cannot be logically isolated from the online operations of the server.
■
The operating system does not support hot plug for the drive.
If any of these disk drive conditions are true, then you must shut down the system before you
replace the drive. See “Powering Down the Server” on page 53.
Note - Replacing a storage drive does not require removing the server from a rack.
Related Information
■
“Storage Drive Status Indicators” on page 31
■
“Removing and Replacing a Storage Drive” on page 67
■
“Removing and Replacing an NVMe Storage Drive Using Oracle Solaris” on page 71
■
“Removing and Replacing an NVMe Storage Drive Using Oracle Linux” on page 74
■
“Removing and Replacing an NVMe Storage Drive Using Microsoft Windows
Server” on page 78
Storage Drive Failure and RAID
A single storage drive failure does not cause a data failure if the storage drives are configured
as a mirrored RAID 1 volume (optional). The storage drive can be removed, and when a new
storage drive is inserted, the contents are automatically rebuilt from the rest of the array with no
need to reconfigure the RAID parameters. If the replaced storage drive was configured as a hotspare, the new storage drive is automatically configured as a new hot-spare.
For instructions for configuring RAID on the server, refer to “Configuring Storage Drives for
Operating System Installation” in Oracle Servers X7-2 and X7-2L Installation Guide.
Storage Drive Locations and Numbering
The following illustration shows the location of the drives and the internal system software
designations for the drives. For information on storage drive status indicators (LEDs), see
“Storage Drive Status Indicators” on page 31.
66Oracle Server X7-2 Service Manual • October 2017
Page 67
CalloutDescription
1Storage drive 0 HDD/SSD/NVMe
2Storage drive 1 HDD/SSD/NVMe
3Storage drive 2 HDD/SSD/NVMe
4Storage drive 3 HDD/SSD/NVMe
5Storage drive 4 HDD/SSD/NVMe
6Storage drive 5 HDD/SSD/NVMe
7Storage drive 6 HDD/SSD/NVMe
8Storage drive 7 HDD/SSD/NVMe
Removing and Replacing a Storage Drive
Note - The NVMe storage drives are labeled NVMe0 through NVMe7; however, operating
systems installed on the server assign different names to the NVMe storage drives. For the
corresponding names assigned to the NVMe storage drives by server operating systems, see
“Server Operating System Names for the NVMe Storage Drives” on page 44.
Removing and Replacing a Storage Drive
The following sections describe how to remove and replace an HDD or SSD storage drive.
■
“Remove a Storage Drive” on page 67
■
“Install a Storage Drive” on page 70
Remove a Storage Drive
1.
Prepare the system for the drive removal.
Servicing Storage Drives (CRU)67
Page 68
Remove a Storage Drive
See “Take Antistatic Measures” on page 61.
2.
Identify the location of the drive that you want to remove.
For storage drive locations, see “Storage Drive Locations and Numbering” on page 66.
3.
Remove the storage drive.
■If you are removing an HDD or SSD storage drive, type the operating system
Note - The blue OK to Remove status indicator LED on the drive may or may not light, as
support for this varies depending on the operating system in use. For information on storage
drive status indicators (LEDs), see “Storage Drive Status Indicators” on page 31.
■If you are removing an NVMe storage drive, see the procedures for your
commands required to stop the server from using the drive; otherwise,
proceed to the next step.
The exact commands required depend on the configuration of your drives. Unmount file
systems or issue RAID commands as needed.
operating system.
Note - NVMe storage drives are supported only on servers that are running Oracle Solaris,
Oracle Linux, Oracle VM, or Microsoft Windows Server. Servers that are running Red Hat
Enterprise Linux do not support NVMe drives.
■
“Removing and Replacing an NVMe Storage Drive Using Oracle
Solaris” on page 71
■
“Removing and Replacing an NVMe Storage Drive Using Oracle
Linux” on page 74
■
“Removing and Replacing an NVMe Storage Drive Using Microsoft Windows
Server” on page 78
68Oracle Server X7-2 Service Manual • October 2017
Page 69
Remove a Storage Drive
4.
On the drive that you plan to remove, push the latch release button to open the
drive latch [1].
Caution - The latch is not an ejector. Do not open the latch too far to the right. Doing so can
damage the latch.
5.
Grasp the latch and pull the drive out of the drive slot [2].
6.
Consider your next steps:
■
If you are replacing the drive, continue to “Install a Storage Drive” on page 70.
■
If you are not replacing the drive, install a filler panel in the empty drive slot to maintain
proper airflow and perform administrative tasks to configure the server to operate without
the drive.
For information on how to install a storage drive filler panel, see “Remove and Install Filler
Panels” on page 209.
Related Information
■
“Storage Drive Status Indicators” on page 31
■
“Storage Drives Hot-Plug Conditions” on page 65
■
“Storage Drive Failure and RAID” on page 66
■
“Install a Storage Drive” on page 70
Servicing Storage Drives (CRU)69
Page 70
Install a Storage Drive
1.
2.
3.
4.
Install a Storage Drive
Remove the replacement drive from its packaging and place the drive on an
antistatic mat.
If necessary, remove the drive filler panel.
Align the replacement drive with the drive slot.
The drive is physically addressed according to the slot in which it is installed. It is important to
install a replacement drive in the same slot as the drive that was removed.
Slide the drive into the slot until the drive is fully seated [1].
5.
Close the drive latch to lock the drive in place [2].
6.
Perform administrative tasks to reconfigure the drive.
The procedures that you perform at this point depend on how your data is configured. You
might need to partition the drive, create file systems, load data from backups, or have the drive
updated from a RAID configuration.
Related Information
■
“Storage Drive Status Indicators” on page 31
■
“Storage Drives Hot-Plug Conditions” on page 65
■
“Storage Drive Failure and RAID” on page 66
■
“Remove a Storage Drive” on page 67
70Oracle Server X7-2 Service Manual • October 2017
Page 71
Removing and Replacing an NVMe Storage Drive Using Oracle Solaris
Removing and Replacing an NVMe Storage Drive Using
Oracle Solaris
The following sections describe how to remove and replace an NVMe storage drive on a server
that is running the Oracle Solaris operating system.
■
“Unmount an NVMe Storage Drive” on page 71
■
“Remove an NVMe Storage Drive” on page 72
■
“Verify Removal of an NVMe Storage Drive” on page 72
■
“Install an NVMe Storage Drive” on page 72
■
“Power On an NVMe Storage Drive and Attach a Device Driver” on page 73
Unmount an NVMe Storage Drive
1.
Log in to Oracle Solaris that is running on the server.
2.
To find the NVMe drive slot number, type:
# hotplug list –lc
This command produces output similar to the following for each of the NVMe drives installed
in the server:
# hotplug list –lc
Connection State Description Path
Verify that the OK to Remove indicator (LED) on the NVMe storage drive is lit.
Servicing Storage Drives (CRU)71
Page 72
Remove an NVMe Storage Drive
Remove an NVMe Storage Drive
Perform this procedure to physically remove an NVMe storage drive from the server.
1.
Identify the physical location of the NVMe drive that you want to remove.
For storage drive locations, see “Storage Drive Locations and Numbering” on page 66.
2.
Verify that the OK to Remove indicator (LED) on the NVMe storage drive is lit.
3.
On the drive that you plan to remove, push the latch release button to open the
drive latch.
4.
Grasp the latch and pull the drive out of the drive slot.
See “Remove a Storage Drive” on page 67.
5.
Consider your next steps:
■
If you are replacing the drive, continue to “Install an NVMe Storage Drive” on page 72.
■
If you are not replacing the drive, install a filler panel in the empty drive slot to maintain
proper airflow and perform administrative tasks to configure the server to operate without
the drive.
For information on how to install a storage drive filler panel, see “Remove and Install Filler
Panels” on page 209.
Verify Removal of an NVMe Storage Drive
To verify that the NVMe drive has been removed, type:
# hotplug list –lc
The following output appears for the NVMe drive that you removed:
# hotplug list –lc
Connection State Description Path
Perform this procedure to physically install an NVMe storage drive into the server.
72Oracle Server X7-2 Service Manual • October 2017
Page 73
Power On an NVMe Storage Drive and Attach a Device Driver
Note - After you physically remove an NVMe storage drive from the server, wait at least 10
seconds before installing a replacement drive.
1.
Remove the replacement drive from its packaging and place the drive on an
antistatic mat.
2.
If necessary, remove the drive filler panel.
3.
Align the replacement drive with the drive slot.
The drive is physically addressed according to the slot in which it is installed. It is important to
install a replacement drive in the same slot as the drive that was removed.
4.
Slide the drive into the slot until the drive is fully seated.
See “Install a Storage Drive” on page 70.
5.
Close the drive latch to lock the drive in place.
See “Install a Storage Drive” on page 70.
Power On an NVMe Storage Drive and Attach a
Device Driver
1.
Type: # hotplug enable Slot100
2.
Type: # hotplug list –lc
The following status is displayed for the NVMe storage drive.
# hotplug list –lc
Connection State Description Path
In the above listing the PCIe addresses are highlighted in bold text.
b.
To obtain the slot number (APIC ID) for the bus address, type the following
command to list all of the PCIe slot numbers and the corresponding bus
addresses:
74Oracle Server X7-2 Service Manual • October 2017
Page 75
Unmount an NVMe Storage Drive
# egrep –H ‘.*’ /sys/bus/pci/slots/*/address
This commands returns output similar to the following:
In the above output, the bus addresses for the corresponding NVMe instances are
highlighted in bold text.
Note - In the above output, notice that the instance names for the NVMe drives do not
correspond to the NVMe drive labels on the front of the server, that is, pci/slots/12/address:
0000:b2:00 corresponds to instance nvme0; however, on the front of the server, this drive is
labeled NVMe2. For a table that shows the relationship between the pci/slot# and the NVMe
storage drive label on the front of the server, see “Server Operating System Names for the
NVMe Storage Drives” on page 44.
c.
Disconnect all users of the NVMe drive and back up the NVMe drive data as
needed.
i.
Use the umount command to unmount any file systems that are mounted
on the device.
ii.
Remove the device from any multiple device (md) and Logical Volume
Manager (LVM) volume using it.
If the device is a member of an LVM Volume group, then it might be necessary
to move data off the device using the pvmove command, then use the vgreduce
command to remove the physical volume, and (optionally) pvremove to remove the
LVM metadata from the disk.
iii.
If the device uses multipathing, run multipath -l and note all the paths
to the device. Then, remove the multipathed device using the multipath
-f device command.
iv.
Run the blockdev --flushbufs device command to flush any outstanding
I/O to all paths to the device.
3.
To prepare the NVMe drive for removal, that is, to detach the NVMe device driver
and power off the NVMe drive slot, type:
Servicing Storage Drives (CRU)75
Page 76
Remove an NVMe Storage Drive
# echo 0 > /sys/bus/pci/slots/$slot/power
Where $slot is the slot number obtained in step Step 2b above.
4.
Verify that the OK to Remove indicator (LED) on the NVMe drive is lit.
Remove an NVMe Storage Drive
Perform this procedure to physically remove an NVMe storage drive from the server.
1.
Identify the location of the NVMe drive that you want to remove.
For storage drive locations, see “Storage Drive Locations and Numbering” on page 66.
2.
Verify that the OK to Remove indicator (LED) on the NVMe drive is lit.
3.
On the NVMe drive you plan to remove, push the latch release button to open the
drive latch.
4.
Grasp the latch and pull the drive out of the drive slot.
“Remove a Storage Drive” on page 67.
5.
Consider your next steps:
■
If you are replacing the drive, continue to “Verify Removal of an NVMe Storage
Drive” on page 76.
■
If you are not replacing the drive, install a filler panel in the empty drive slot to maintain
proper airflow and perform administrative tasks to configure the server to operate without
the drive.
For information on how to install a storage drive filler panel, see “Remove and Install Filler
Panels” on page 209.
Verify Removal of an NVMe Storage Drive
1.
To check the NVMe drive enumeration to verify that the NVMe drive has been
removed, type:
# lspci -nnd :0953
2.
View the command output and verify that the entry for the slot number that was
disabled no longer appears.
76Oracle Server X7-2 Service Manual • October 2017
Page 77
Install an NVMe Storage Drive
This commands returns output similar to the following:
Perform this procedure to physically install an NVMe storage drive into the server.
After you physically remove an NVMe drive from the server, wait at least 10 seconds before
installing a replacement drive.
1.
Remove the replacement drive from its packaging and place the drive on an
antistatic mat.
2.
If necessary, remove the drive filler panel.
3.
Align the replacement drive with the drive slot.
The drive is physically addressed according to the slot in which it is installed. It is important to
install a replacement drive in the same slot as the drive that was removed.
4.
Slide the drive into the slot until the drive is fully seated.
See “Install a Storage Drive” on page 70.
5.
Close the drive latch to lock the drive in place [2].
See “Install a Storage Drive” on page 70.
Power On an NVMe Storage Drive and Attach a
Device Driver
To power on the slot and attach the device driver, type:
# echo 1 /sys/bus/pci/slots/$slot/power
Where $slot is the slot number for the NVMe storage drive.
Servicing Storage Drives (CRU)77
Page 78
Verify Operation of an NVMe Storage Drive
Verify Operation of an NVMe Storage Drive
To verify that an NVMe drive is operating properly, do one of the following:
■
Check the /var/log/messages log file.
■
Type: # ls -l /dev/nvme*
Related Information
■
“Storage Drive Status Indicators” on page 31
■
“Storage Drives Hot-Plug Conditions” on page 65
■
“Storage Drive Failure and RAID” on page 66
■
“Storage Drive Locations and Numbering” on page 66
Removing and Replacing an NVMe Storage Drive Using
Microsoft Windows Server
The following sections describe how to remove and replace an NVMe storage drive on a server
that is running the Microsoft Windows Server operating system.
Note - NVMe storage drive hot plug is not supported for an Oracle Server X7-2 running
Microsoft Windows Server. The system must be powered down before removing and replacing
an NVMe storage drive.
■
“Remove an NVMe Storage Drive” on page 78
■
“Install an NVMe Storage Drive” on page 79
Remove an NVMe Storage Drive
Perform this procedure to physically remove an NVMe storage drive from the server.
1.
Power down the server that contains the storage drive to be removed.
See “Powering Down the Server” on page 53.
2.
On the NVMe drive you plan to remove, push the latch release button to open the
drive latch.
78Oracle Server X7-2 Service Manual • October 2017
Page 79
Install an NVMe Storage Drive
See “Remove a Storage Drive” on page 67.
3.
Grasp the latch and pull the drive out of the drive slot.
4.
Consider your next steps:
■
If you are replacing the drive, continue to “Install an NVMe Storage Drive” on page 79.
■
If you are not replacing the drive, install a filler panel in the empty drive slot to maintain
proper airflow and perform administrative tasks to configure the server to operate without
the drive.
For information on how to install a storage drive filler panel, see “Remove and Install Filler
Panels” on page 209.
Install an NVMe Storage Drive
Perform this procedure to physically install an NVMe storage drive into the server.
1.
Remove the replacement drive from its packaging and place the drive on an
antistatic mat.
2.
If necessary, remove the drive filler panel.
3.
Align the replacement drive with the drive slot.
The drive is physically addressed according to the slot in which it is installed. It is important to
install a replacement drive in the same slot as the drive that was removed.
4.
Slide the drive into the slot until the drive is fully seated.
See “Install a Storage Drive” on page 70.
5.
Close the drive latch to lock the drive in place.
6.
Power on the server.
See “Power On the Server” on page 214.
Verify that the power supply AC OK LED is lit.
Related Information
■
“Storage Drive Status Indicators” on page 31
■
“Storage Drives Hot-Plug Conditions” on page 65
Servicing Storage Drives (CRU)79
Page 80
Install an NVMe Storage Drive
■
“Storage Drive Failure and RAID” on page 66
■
“Storage Drive Locations and Numbering” on page 66
80Oracle Server X7-2 Service Manual • October 2017
Page 81
Servicing Fan Modules (CRU)
This section describes how to service fan modules. Fan modules are customer-replaceable units
(CRUs) that do not require you to power off the server. For more information about CRUs, see
“Illustrated Parts Breakdown” on page 19 and “Customer-Replaceable Units” on page 21.
The fan modules are located in the front of the server. See the following procedures:
■
“Remove a Fan Module” on page 81
■
“Install a Fan Module” on page 84
Related Information
■
“Remove the Server Top Cover” on page 62
Remove a Fan Module
You do not have to power off the server to service the fan modules.
You should not begin this procedure unless you have a replacement fan and you are ready to
install it right away.
Caution - When removing and replacing a fan module in the Oracle Server X7-2, you must
complete the entire procedure within 20 seconds to maintain adequate cooling within the
system. Remove and replace only one fan module at a time. Ensure that you have obtained the
replacement fan module and that is it is ready for installation before starting the replacement
procedure. Each fan module contains two fans, with two fan motors per fan. The four fan
motors provide separate tachometer signals so that the fan module reports four tach signals to
Oracle ILOM. Even if only one fan motor is faulted within the fan module, the Oracle ILOM
service processor detects that four fan motors have failed to spin while the fan module is
removed. If the fan module is not replaced within 20 seconds of removal, Oracle ILOM will
take the protective action to shut down the system to prevent thermal damage to the system.
Servicing Fan Modules (CRU)81
Page 82
Remove a Fan Module
1.
Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 59.
2.
To access the fan modules, remove the server top cover.
See “Remove the Server Top Cover” on page 62.
3.
Identify the faulty fan module.
Each fan module has a fan status indicator (LED) that is located next to the module. If the LED
is amber, the fan has failed. The location of the fan status LED is shown in the following figure.
LED Color and State Meaning
Amber – The fan module is faulty. The front Top Fan LED and the front and back panel Fault-Service Required
LEDs are also lit if the system detects a fan module fault.
4.
Remove the faulty fan module.
a.
Using your thumb and forefinger, grasp the fan module in the finger
recesses located in the plastic between the fans.
b.
Lift the fan module straight up and out of the chassis.
82Oracle Server X7-2 Service Manual • October 2017
Page 83
Remove a Fan Module
Caution - When removing a fan module, do not rock it back and forth. Rocking a fan module
can cause damage to the motherboard connectors.
5.
Set the fan module aside.
Caution - Do not service any other components in the fan compartment unless the system is
shut down and the power cords are removed.
6.
Consider your next step:
■
If you removed the fan assembly as part of another procedure, return to that procedure.
■
Otherwise, continue to “Install a Fan Module” on page 84.
Related Information
■
“Install a Fan Module” on page 84
Servicing Fan Modules (CRU)83
Page 84
Install a Fan Module
Install a Fan Module
1.
Remove the replacement fan module from its packaging and place it on an
antistatic mat.
2.
With the server top cover removed, position the replacement fan module into the
server.
The fan modules are keyed to ensure that they are installed in the correct orientation.
3.
Press down on the fan module and apply firm pressure to fully seat the fan
module.
4.
Verify that the fan module status indicator (LED) that aligns with the replaced fan
module is extinguished.
5.
Install the server top cover.
See “Install the Server Top Cover” on page 210.
6.
Verify that the Top Fan Fault LED on the front of the server and the Fault-Service
Required LEDs on the front and back of the server are extinguished.
See “Troubleshooting Using the Server Front and Back Panel Status Indicators” on page 28 for
more information about identifying and interpreting system indicators.
84Oracle Server X7-2 Service Manual • October 2017
Page 85
7.
Consider your next step:
■
If you installed the fan module as part of another procedure, return to that procedure.
■
Otherwise, return the server to operation. See “Returning the Server to
Operation” on page 209.
Related Information
■
“Remove a Fan Module” on page 81
Install a Fan Module
Servicing Fan Modules (CRU)85
Page 86
86Oracle Server X7-2 Service Manual • October 2017
Page 87
Servicing Power Supplies (CRU)
This section describes how to service power supplies. Power supplies are customer-replaceable
units (CRUs) that do not require you to power off the server. For more information about CRUs,
see “Illustrated Parts Breakdown” on page 19 and “Customer-Replaceable Units” on page 21.
The power supplies are located at the back of the server. See the following procedures:
■
“Power Supply Overview” on page 87
■
“Remove a Power Supply” on page 88
■
“Install a Power Supply” on page 90
Power Supply Overview
The server's redundant power supplies support concurrent maintenance, which enables you to
remove and replace a power supply without shutting down the server, provided that the other
power supply is online and working.
The server supports model A266 (1200 watt) power supplies. The A266 power supply unit
(PSU) provides conversion from the AC lines to the system, accepting ranges from 100-240
volts AC (VAC). These PSUs are designed to be hot-pluggable, and in most cases will provide
fully redundant “1+1” power, allowing the system to suffer the loss of a PSU or an AC feed
with no loss to system availability.
In maximally configured systems, it is possible that the worst-case power consumption of the
system could exceed the capacity of a single PSU. The PSUs provide an over-subscription
mode, which allows the system to operate with fault-tolerance even with modest excursions
beyond the rated capacity of a single PSU. This over-subscription support is accomplished
using hardware signaling between the PSU and motherboard circuitry, which can force the
system to throttle processor (CPU) and memory power in the event that a PSU is lost. The
resulting power savings will be enough to allow the system to continue to run (in a lowerperformance state) until the power problem is resolved.
The following status indicators (LEDs) are lit when a power supply fault is detected:
Servicing Power Supplies (CRU)87
Page 88
Remove a Power Supply
■
Front and back Fault-Service Required LEDs
■
Amber Fault-Service Required LED on the faulty power supply
■
Rear Power Supply Fault-Service Required LED located on the server front panel
For more information, see “Power Supply Status Indicators” on page 31.
If a power supply fails and you do not have a replacement available, leave the failed power
supply installed to ensure proper airflow in the server.
Related Information
■
“Troubleshooting Using the Server Front and Back Panel Status Indicators” on page 28
Remove a Power Supply
1.
Identify which power supply requires replacement.
CalloutDescription
1Power supply 0
2Power supply 1
88Oracle Server X7-2 Service Manual • October 2017
Page 89
Remove a Power Supply
A lit amber Fault-Service Required LED on a power supply indicates that a failure was
detected. You can also use the Oracle ILOM showfaulty command at the Oracle ILOM
command line prompt (->) to identify a power supply failure.
Alternatively, to list all known faults in the server, log in to the Oracle Solaris OS and issue
the fmadm faulty command, or log in to the Oracle ILOM service processor from the Oracle
ILOM Fault Management Shell and issue the fmadm faulty command. For more information
about how to use the Oracle ILOM Fault Management Shell and supported commands, refer to
the Oracle ILOM User's Guide for System Monitoring and Diagnostics Firmware Release 4.0.x
in the Oracle Integrated Lights Out Manager (ILOM) 4.0 Documentation Library at http://
www.oracle.com/goto/ilom/docs.
Note - The fans of a failed power supply might still be spinning when the system is powered on.
You can remove a power supply while the fans are still spinning.
2.
Gain access to the rear of the server where the faulty power supply is located.
3.
If the cable management arm (CMA) is installed, disconnect both CMA left-side
connectors and move the CMA out of the way.
Refer to “Remove the Cable Management Arm” in Oracle Servers X7-2 and X7-2L Installation
Guide.
Caution - When disconnecting the CMA left-side connectors, use your arm to support the
CMA so that it does not hang down under its own weight and stress the right-side connectors;
otherwise, the CMA might be damaged. You must continue to support the CMA until you have
reconnected both of the left-side connectors.
4.
Disconnect the power cord from the faulty power supply.
Servicing Power Supplies (CRU)89
Page 90
Install a Power Supply
5.
6.
Grasp the power supply handle and push the power supply latch to the left [1].
Pull the power supply out of the chassis [2].
Caution - Whenever you remove a power supply, you should replace it with another power
supply; otherwise, the server might overheat due to improper airflow.
7.
Continue to “Install a Power Supply” on page 90.
Related Information
■
“Back Panel Components and Cable Connections” on page 18
■
“Power Supply Status Indicators” on page 31
■
“Install a Power Supply” on page 90
Install a Power Supply
Caution - Always replace a faulty power supply with a power supply of the same type (model).
1.
Remove the replacement power supply from its packaging and place it on an
antistatic mat.
2.
Align the replacement power supply with the empty power supply slot [1].
90Oracle Server X7-2 Service Manual • October 2017
Page 91
3.
Slide the power supply into the bay until it is fully seated [2].
You will hear an audible click when the power supply fully seats.
4.
Reconnect the power cord to the power supply.
Install a Power Supply
5.
Verify that the amber Fault-Service Required LED on the replaced power supply
and the Fault-Service Required LEDs on the front and back panels of the server
are not lit.
Note - After you have replaced Power Supply 0, you must reset the Oracle ILOM service
processor (SP) to propagate the key identity properties (KIP) data to the new power supply. For
instructions on resetting the SP, refer to the Oracle Integrated Lights Out Manager (ILOM) 4.0Configuration and Maintenance Guide Firmware Release 4.0.x at http://www.oracle.com/
goto/ilom/docs. Power Supply 1 does not contain KIP data, and therefore does not require an
SP reset after replacement.
6.
If you disconnected the two CMA left-side connectors, reconnect the
connectors.
Refer to “Install the Cable Management Arm (Optional)” in Oracle Servers X7-2 and X7-2L
Installation Guide.
Related Information
■
“Back Panel Components and Cable Connections” on page 18
■
“Power Supply Status Indicators” on page 31
Servicing Power Supplies (CRU)91
Page 92
Install a Power Supply
■
“Remove a Power Supply” on page 88
92Oracle Server X7-2 Service Manual • October 2017
Page 93
Servicing an Internal USB Flash Drive (CRU)
This section describes how to service a USB flash drive. The USB flash drive is a customerreplaceable unit (CRU) that requires you to power off the server. For more information
about CRUs, see “Illustrated Parts Breakdown” on page 19 and “Customer-Replaceable
Units” on page 21.
Caution - These procedures require that you handle components that are sensitive to static
discharge. This sensitivity can cause the component to fail. To avoid damage, ensure that you
follow antistatic practices as described in “Electrostatic Discharge Safety” on page 50.
Caution - Ensure that all power is removed from the server before removing or installing a
flash drive or damage to the drive might occur. You must disconnect all power cables from the
system before performing these procedures.
This section covers the following procedures:
■
“Remove an Internal USB Flash Drive” on page 93
■
“Install an Internal USB Flash Drive” on page 94
Related Information
■
“USB Ports” on page 221
Remove an Internal USB Flash Drive
The server can be equipped with one internal USB flash drive.
1.
Prepare the server for service.
a.
Power off the server, and disconnect the power cords from the power
supplies.
See “Powering Down the Server” on page 53.
Servicing an Internal USB Flash Drive (CRU)93
Page 94
Install an Internal USB Flash Drive
b.
Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 59.
c.
Attach an antistatic wrist strap to your wrist and then to a metal area on the
chassis.
See “Take Antistatic Measures” on page 61.
d.
Remove the server top cover.
See “Remove the Server Top Cover” on page 62.
2.
Grasp the USB flash drive and pull it out of the internal USB port.
Related Information
■
“Remove an Internal USB Flash Drive” on page 93
Install an Internal USB Flash Drive
1.
Unpack the replacement USB flash drive.
94Oracle Server X7-2 Service Manual • October 2017
Page 95
2.
Insert the USB flash drive into the internal USB port.
3.
Return the server to operation.
a.
Install the server top cover.
See “Install the Server Top Cover” on page 210.
b.
Return the server to the normal rack position.
See “Return the Server to the Normal Rack Position” on page 212.
c.
Reconnect the power cords to the server power supplies and reconnect the
data cables.
See “Reconnect Data Cables and Power Cords” on page 214.
d.
Power on the server.
See “Power On the Server” on page 214.
Verify that the power supply AC OK LED is lit.
Install an Internal USB Flash Drive
Related Information
■
“Install an Internal USB Flash Drive” on page 94
Servicing an Internal USB Flash Drive (CRU)95
Page 96
96Oracle Server X7-2 Service Manual • October 2017
Page 97
Servicing the DIMMs (CRU)
This section describes how to service memory modules (DIMMs). DIMMs are customerreplaceable units (CRUs) that require you to power off the server. For more information
about CRUs, see “Illustrated Parts Breakdown” on page 19 and “Customer-Replaceable
Units” on page 21.
The Oracle Server X7-2 supports a variety of DDR4 DIMM configurations that can include
quad-rank (QR) and dual-rank (DR) DDR4 DIMMs.
Caution - These procedures require that you handle components that are sensitive to
electrostatic discharge. This sensitivity can cause the components to fail. To avoid
damage, ensure that you follow antistatic practices as described in “Electrostatic Discharge
Safety” on page 50.
Caution - Ensure that all power is removed from the server before removing or installing
DIMMs, or damage to the DIMMs might occur. You must disconnect all power cables from the
system before performing these procedures.
The following topics and procedures provide information to assist you when replacing a DIMM
or upgrading DIMMs:
■
“DIMM and Processor Physical Layout” on page 98
■
“DIMM Population Scenarios” on page 99
■
“DIMM Population Rules” on page 99
■
“Populating DIMMs for Optimal System Performance” on page 100
■
“Populating DIMMs in Single-Processor Systems for Optimal System
Performance” on page 100
■
“Populating DIMMs in Dual-Processor Systems for Optimal System
Performance” on page 102
■
“DIMM Operating Speeds” on page 103
■
“DIMM Rank Classification Labels” on page 103
■
“Inconsistencies Between DIMM Fault Indicators and the BIOS Isolation of Faulty
DIMMs” on page 104
Servicing the DIMMs (CRU)97
Page 98
DIMM and Processor Physical Layout
■
“Using the Server Fault Remind Button” on page 104
■
“Identify and Remove the Failed DIMM” on page 105
■
“Install a DIMM” on page 107
Related Information
■
“Servicing Processors (FRU)” on page 137
DIMM and Processor Physical Layout
The physical layout of the DIMMs and processor(s) is shown in the following figure. When
viewing the server from the front, processor 0 (P0) is on the left.
Note - In single-processor systems, the DIMM sockets associated with processor 1 (P1) are
nonfunctional and should not be populated with DIMMs.
Related Information
■
“DIMM Population Scenarios” on page 99
98Oracle Server X7-2 Service Manual • October 2017
Page 99
■
“DIMM Population Rules” on page 99
■
“Populating DIMMs for Optimal System Performance” on page 100
■
“DIMM Operating Speeds” on page 103
■
“DIMM Rank Classification Labels” on page 103
■
“Inconsistencies Between DIMM Fault Indicators and the BIOS Isolation of Faulty
DIMMs” on page 104
DIMM Population Scenarios
There are two scenarios in which you are required to populate DIMMs:
■
A DIMM fails and needs to be replaced.
In this scenario, you can use the Fault Remind button to determine the failed DIMM, then
remove the failed DIMM and replace it. To ensure that system performance is maintained,
you must replace the failed DIMM with a DIMM of the same size (in gigabytes) and type
(quad-rank or dual-rank). In this scenario, you should not change the DIMM configuration.
■
You have purchased new DIMMs and you want to use them to upgrade the server's memory.
In this scenario, you should adhere to DIMM population rules and follow the recommended
DIMM population order for optimal system performance.
DIMM Population Scenarios
Related Information
■
“DIMM Population Rules” on page 99
■
“Populating DIMMs for Optimal System Performance” on page 100
■
“Using the Server Fault Remind Button” on page 104
DIMM Population Rules
The population rules for adding DIMMs to the server are as follows:
■
The server supports 64-GB quad-rank (QR) Load-Reduced DIMMs (LRDIMMs) and 32GB dual-rank (DR) and 16-GB DR Registered DIMMs (RDIMMs).
■
Do not mix 64-GB LRDIMMs and 16-GB or 32-GB RDIMMs in the same server. If the
server has 64-GB LRDIMMs installed, you cannot install 16-GB or 32-GB RDIMMs and
vice versa.
Servicing the DIMMs (CRU)99
Page 100
Populating DIMMs for Optimal System Performance
■
Within a memory channel, DIMMs must be populated in the black sockets first, and then in
the white sockets.
■
If you are installing a mix of RDIMM sizes (for example, 16-GB and 32-GB), install the
larger size RDIMMs first, then the smaller RDIMMs, until all of the RDIMMs are installed.
■
The server operates properly with a minimum of one DIMM installed per processor.
Populating DIMMs for Optimal System Performance
Optimal performance is generally achieved by populating the DIMMs so that the memory is
symmetrical, or balanced. Symmetry is achieved by adhering to the following guidelines:
■
In single-processor systems, populate DIMMs of the same size in multiples of six.
■
In dual-processor systems, populate DIMMs of the same size in multiples of twelve.
■
Populate the DIMM sockets in the order described in the following sections.
The following sections provide an example of how to populate the DIMM sockets to achieve
optimal system performance.
Note - Not all possible configurations are shown here.
■
“Populating DIMMs in Single-Processor Systems for Optimal System
Performance” on page 100
■
“Populating DIMMs in Dual-Processor Systems for Optimal System
Performance” on page 102
Populating DIMMs in Single-Processor Systems for Optimal
System Performance
In single-processor systems, install DIMMs only into DIMM sockets associated with processor
0 (P0). Starting with socket P0 D7, first fill the black sockets, and then fill the white sockets, as
shown in the following figure.
100Oracle Server X7-2 Service Manual • October 2017
Loading...
+ hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.