Oracle MiniCluster S7-2 Service Manual

Page 1

Oracle® MiniCluster S7-2 Service Manual

Part No: E76798-02
March 2017
Page 2
Page 3
Oracle MiniCluster S7-2 Service Manual
Part No: E76798-02
Copyright © 2016, 2017, Oracle and/or its affiliates. All rights reserved.
The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing.
If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, then the following notice is applicable:
U.S. GOVERNMENT END USERS: Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, delivered to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, shall be subject to license terms and license restrictions applicable to the programs. No other rights are granted to the U.S. Government.
This software or hardware is developed for general use in a variety of information management applications. It is not developed or intended for use in any inherently dangerous applications, including applications that may create a risk of personal injury. If you use this software or hardware in dangerous applications, then you shall be responsible to take all appropriate fail-safe, backup, redundancy, and other measures to ensure its safe use. Oracle Corporation and its affiliates disclaim any liability for any damages caused by use of this software or hardware in dangerous applications.
Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners.
Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered trademark of The Open Group.
This software or hardware and documentation may provide access to or information about content, products, and services from third parties. Oracle Corporation and its affiliates are not responsible for and expressly disclaim all warranties of any kind with respect to third-party content, products, and services unless otherwise set forth in an applicable agreement between you and Oracle. Oracle Corporation and its affiliates will not be responsible for any loss, costs, or damages incurred due to your access to or use of third-party content, products, or services, except as set forth in an applicable agreement between you and Oracle.
Access to Oracle Support
Oracle customers that have purchased support have access to electronic support through My Oracle Support. For information, visit http://www.oracle.com/pls/topic/lookup?
ctx=acc&id=info or visit http://www.oracle.com/pls/topic/lookup?ctx=acc&id=trs if you are hearing impaired.
Page 4
Référence: E76798-02
Copyright © 2016, 2017, Oracle et/ou ses affiliés. Tous droits réservés.
Ce logiciel et la documentation qui l'accompagne sont protégés par les lois sur la propriété intellectuelle. Ils sont concédés sous licence et soumis à des restrictions d'utilisation et de divulgation. Sauf stipulation expresse de votre contrat de licence ou de la loi, vous ne pouvez pas copier, reproduire, traduire, diffuser, modifier, accorder de licence, transmettre, distribuer, exposer, exécuter, publier ou afficher le logiciel, même partiellement, sous quelque forme et par quelque procédé que ce soit. Par ailleurs, il est interdit de procéder à toute ingénierie inverse du logiciel, de le désassembler ou de le décompiler, excepté à des fins d'interopérabilité avec des logiciels tiers ou tel que prescrit par la loi.
Les informations fournies dans ce document sont susceptibles de modification sans préavis. Par ailleurs, Oracle Corporation ne garantit pas qu'elles soient exemptes d'erreurs et vous invite, le cas échéant, à lui en faire part par écrit.
Si ce logiciel, ou la documentation qui l'accompagne, est livré sous licence au Gouvernement des Etats-Unis, ou à quiconque qui aurait souscrit la licence de ce logiciel pour le compte du Gouvernement des Etats-Unis, la notice suivante s'applique :
U.S. GOVERNMENT END USERS: Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, delivered to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, shall be subject to license terms and license restrictions applicable to the programs. No other rights are granted to the U.S. Government.
Ce logiciel ou matériel a été développé pour un usage général dans le cadre d'applications de gestion des informations. Ce logiciel ou matériel n'est pas conçu ni n'est destiné à être utilisé dans des applications à risque, notamment dans des applications pouvant causer un risque de dommages corporels. Si vous utilisez ce logiciel ou ce matériel dans le cadre d'applications dangereuses, il est de votre responsabilité de prendre toutes les mesures de secours, de sauvegarde, de redondance et autres mesures nécessaires à son utilisation dans des conditions optimales de sécurité. Oracle Corporation et ses affiliés déclinent toute responsabilité quant aux dommages causés par l'utilisation de ce logiciel ou matériel pour des applications dangereuses.
Oracle et Java sont des marques déposées d'Oracle Corporation et/ou de ses affiliés. Tout autre nom mentionné peut correspondre à des marques appartenant à d'autres propriétaires qu'Oracle.
Intel et Intel Xeon sont des marques ou des marques déposées d'Intel Corporation. Toutes les marques SPARC sont utilisées sous licence et sont des marques ou des marques déposées de SPARC International, Inc. AMD, Opteron, le logo AMD et le logo AMD Opteron sont des marques ou des marques déposées d'Advanced Micro Devices. UNIX est une marque déposée de The Open Group.
Ce logiciel ou matériel et la documentation qui l'accompagne peuvent fournir des informations ou des liens donnant accès à des contenus, des produits et des services émanant de tiers. Oracle Corporation et ses affiliés déclinent toute responsabilité ou garantie expresse quant aux contenus, produits ou services émanant de tiers, sauf mention contraire stipulée dans un contrat entre vous et Oracle. En aucun cas, Oracle Corporation et ses affiliés ne sauraient être tenus pour responsables des pertes subies, des coûts occasionnés ou des dommages causés par l'accès à des contenus, produits ou services tiers, ou à leur utilisation, sauf mention contraire stipulée dans un contrat entre vous et Oracle.
Accès aux services de support Oracle
Les clients Oracle qui ont souscrit un contrat de support ont accès au support électronique via My Oracle Support. Pour plus d'informations, visitez le site http://www.oracle.com/
pls/topic/lookup?ctx=acc&id=info ou le site http://www.oracle.com/pls/topic/lookup?ctx=acc&id=trs si vous êtes malentendant.
Page 5

Contents

Using This Documentation ................................... ..............................................  7
Product Documentation Library . ..................................................................... ...  7
Feedback ................................... ....................................................................  7
Overview of System Status Indicators and LEDs .................. .............................  9
Compute Node Status Indicators .................................. ......................................  9
Storage Array Front Panel and Drive Indicators ....................... .................... .........  9
Storage Array Back Panel Indicators ............................................ ..................... 11
Preparing MiniCluster for Service ..... ................................................................  13
Detecting and Managing Faults .................................. ......................................  13
Preparing MiniCluster For Service ............................................... ....................  14
Component Service Categories . .................... ...................................................  14
Servicing CRUs That Do Not Require System Power Off ..................................  17
Compute Node Hot-Swappable Components ..................................... .................  17
Storage Array Hot-Swappable Components .......................... .................... ..........  18
Replacement Procedures .................................................................. .......  18
Servicing CRUs That Require Compute Node Power Off .............................. ....  31
Servicing Compute Node CRUs ..... .................... ..............................................  31
Servicing FRUs ............................... ..................................................................  33
Servicing Compute Node FRUs ..... ..................................................................  33
Returning MiniCluster to Operation ....... ...........................................................  35
Returning Compute Nodes to Operation ............................................................  35
5
Page 6
Contents
Powering On MiniCluster ......................................................... ......................  35
▼ Power On Procedure . ..................................................................... ...  36
Index ................................... ..................................................................... ..........  37
6 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 7

Using This Documentation

Overview – Describes how to service Oracle's MiniCluster S7-2 hardware
Audience – System administrators and authorized service providers
Required knowledge – Advanced experience installing enterprise servers

Product Documentation Library

Documentation and resources for this product and related products are available at http://
docs.oracle.com/cd/E69469_01/.

Feedback

Provide feedback about this documentation at http://www.oracle.com/goto/docfeedback.
Using This Documentation 7
Page 8
8 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 9

Overview of System Status Indicators and LEDs

This chapter describes system status indicators and LEDs.
“Compute Node Status Indicators” on page 9
“Storage Array Front Panel and Drive Indicators” on page 9
“Storage Array Back Panel Indicators” on page 11

Compute Node Status Indicators

These topics describe MiniCluster compute node status indicators (LEDs).
Note - The links below take you to topics outside of the MiniCluster library to the Oracle
SPARC S7-2 documentation library. Use your browser's Back button to return to the MiniCluster library.
“Front Panel Components” in SPARC S7-2 Server Service Manual
“Rear Panel Components” in SPARC S7-2 Server Service Manual

Storage Array Front Panel and Drive Indicators

The following figure shows the controls and indicators on the storage array front panel and drives.
Overview of System Status Indicators and LEDs 9
Page 10
Storage Array Front Panel and Drive Indicators
Callout Indicator/LED: Color State Meaning
1 Locate: white ■ Off – Normal operation.
■ Fast blink – This LED blinks to help locate the system. Operators can turn this indicator on and off remotely using Oracle ILOM.
■ Pressing this button toggles the fast blink on or off.
2 Service Required: amber ■ Off – Normal operation.
■ Steady on – A fault is present in the storage array.
3 OK: green ■ On – The storage array is powered on and operating normally.
■ Off – The storage array is unplugged or turned off.
4 Power Supply fault indicator:
amber
■ On – Indicates a fault with a power supply. View the LEDs at the back of the storage array to determine which power supply has the fault.
■ Off – Normal operation.
5 SAS I/O Module fault indicator:
amber
Service Required – Lights amber to indicate a fault with an I/O module. View the LEDs at the back of the storage array to determine which module has the fault.
6 Ready-to-Remove: blue ■ On – The storage drive can be safely removed during a hot-plug
operation.
■ Off – The storage drive has not been prepared for removal.
7 Drive Service Required: amber ■ Off – Normal operation.
10 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 11
Callout Indicator/LED: Color State Meaning
■ Steady on – A fault is present in the disk drive.
8 Power/Activity: green ■ On – The drive is installed and recognized by the system but
there is no drive activity.
■ Blink – The LED blinks randomly in response to data being written to or read from the drive.
■ Off – The drive is powered off or the installed drive is not recognized by the system.

Storage Array Back Panel Indicators

The following figure shows the controls and indicators on the storage array back panel and drives.
Storage Array Back Panel Indicators
Callout Indicator/LED: Color State Meaning
1 AC power fault indicator: amber ■ On – Indicates an AC power fault.
■ Off – Normal operation.
Overview of System Status Indicators and LEDs 11
Page 12
Storage Array Back Panel Indicators
Callout Indicator/LED: Color State Meaning
2 Fan fail indicator: amber ■ On – The fan is faulty.
3 Power supply status indicator:
green
4 DC power fail indicator: amber ■ On – The power supply is faulty.
5 Reserved This port is reserved for Oracle Service.
6 Network port (reserved) This port is reserved for Oracle Service.
7 Locate: white ■ Off – Normal operation.
8 Service Required: amber ■ Off – Normal operation.
9 OK: green ■ On – The storage array is powered on and operating normally.
10 SAS I/O Module Ready-to-
Remove: blue
11 SAS I/O Module fault indicator:
amber
12 SAS I/O Module OK: green ■ On – The I/O module is powered on and operating normally.
13 Host SAS port fault indicator:
amber
14 Host SAS port activity indicator:
green
■ Off – Normal operation.
■ On – The power supply is operating normally.
■ Off – The power supply is unplugged or turned off.
■ Off – Normal operation.
■ Fast blink – This LED blinks to help locate the system. Operators can turn this indicator on and off remotely using Oracle ILOM.
■ Pressing this button toggles the fast blink on or off.
■ Steady on – A fault is present in the storage array.
■ Off – The storage array is unplugged or turned off.
■ On – The I/O module can be safely removed during a hot-plug operation.
■ Off – The I/O module has not been prepared for removal.
Service Required – Lights amber to indicate a fault with an I/O module.
■ Off – The I/O module is unplugged or turned off.
Service Required – Lights amber to indicate a fault with the port or connection.
Lights green to indicate host SAS link activity.
12 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 13

Preparing MiniCluster for Service

This chapter describes how to prepare MiniCluster for servicing.
“Detecting and Managing Faults” on page 13
“Preparing MiniCluster For Service” on page 14
“Component Service Categories” on page 14

Detecting and Managing Faults

Note - The links in the following list take you to instructions outside of the MiniCluster library
to the SPARC S7-2 documentation libraries. Use your browser's Back button to return to the MiniCluster library.
“Checking for Faults” in SPARC S7-2 Server Service Manual
“Log In to Oracle ILOM (Service)” in SPARC S7-2 Server Service Manual
“Identify Faulted Components” in SPARC S7-2 Server Service Manual
“Performing Advanced Troubleshooting” in SPARC S7-2 Server Service Manual
“Check the Message Buffer” in SPARC S7-2 Server Service Manual
“View Log Files (Oracle Solaris)” in SPARC S7-2 Server Service Manual
“View Log Files (Oracle ILOM)” in SPARC S7-2 Server Service Manual
“POST Overview” in SPARC S7-2 Server Service Manual
“Configure POST” in SPARC S7-2 Server Service Manual
“Oracle ILOM Properties That Affect POST Behavior” in SPARC S7-2 Server Service
Manual
“Clear a Fault Manually” in SPARC S7-2 Server Service Manual
Preparing MiniCluster for Service 13
Page 14

Preparing MiniCluster For Service

Preparing MiniCluster For Service
Most service procedures are performed on a single appliance component. Service procedures that require a single compute node to be powered off can be performed without shutting down the entire MiniCluster. Before powering off any component for service, refer to the Getting Started Guide or Administration Guide for any special instructions or commands that must be performed on the MiniCluster to prepare it for service.
Note - The links in the following list take you to instructions outside of the MiniCluster library
to the SPARC S7-2 documentation libraries. Use your browser's Back button to return to the MiniCluster library.
“Safety Information” in SPARC S7-2 Server Service Manual
“Service Tools” in SPARC S7-2 Server Service Manual
“Find the Server Serial Number” in SPARC S7-2 Server Service Manual
“Locate the Server” in SPARC S7-2 Server Service Manual
“Prepare to Power Off the Server” in SPARC S7-2 Server Service Manual
“Power Off the Server” in SPARC S7-2 Server Service Manual
“Disconnect Cables From the Server” in SPARC S7-2 Server Service Manual
“Prevent ESD Damage” in SPARC S7-2 Server Service Manual
“Extend the Server to the Service Position” in SPARC S7-2 Server Service Manual
“Remove the Server From the Rack” in SPARC S7-2 Server Service Manual
“Open the Server Fan Door” in SPARC S7-2 Server Service Manual
“Remove the Top Cover” in SPARC S7-2 Server Service Manual

Component Service Categories

The server components and assemblies that can be replaced in the field fall into three categories:
Hot-service, replaceable by customer
Cold-service, replaceable by customer
Cold-service, replaceable by authorized service personnel
Cold service procedures require that you shut the server down and unplug the power cables that connect the power supplies to the power source.
14 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 15
Component Service Categories
Although hot service procedures can be performed while the server is running, you should usually bring it to standby mode as the first step in the replacement procedure. See “Preparing
MiniCluster For Service” on page 14 for instructions.
This table identifies the components in each category.
Component Service Category
Hot-service, replaceable by customer
Fan modules, compute node “Servicing Fan Modules” in SPARC S7-2 Server Service Manual
Power supplies, compute
HDD or SSD, storage array “Replace a Storage Array HDD or SSD (CRU) ” on page 18
Power supplies, storage
I/O module, storage array “Replace a Storage Array I/O Module (CRU)” on page 28
Cold-service, replaceable by customer
Battery, compute node “Servicing the Battery” in SPARC S7-2 Server Service Manual
PCIe cards, compute node “Servicing PCIe Cards” in SPARC S7-2 Server Service Manual
Front indicator module,
eUSB drive, compute node “Servicing the eUSB Drive” in SPARC S7-2 Server Service Manual
Cold-service, replaceable by authorized service personnel
Drive backplane, compute
Motherboard, compute node “Servicing the Motherboard” in SPARC S7-2 Server Service Manual
Component Service Information
Drives, compute node “Servicing Drives” in SPARC S7-2 Server Service Manual
node
array
DIMMs, compute node “Servicing DIMMs” in SPARC S7-2 Server Service Manual
compute node
Cables, compute node “Servicing Cables” in SPARC S7-2 Server Service Manual
node
“Servicing Power Supplies” in SPARC S7-2 Server Service Manual
“Replace a Storage Array Power Supply (CRU)” on page 24
“Servicing the Front Indicator Module” in SPARC S7-2 Server Service
Manual
“Servicing the Drive Backplane” in SPARC S7-2 Server Service Manual
Preparing MiniCluster for Service 15
Page 16
16 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 17

Servicing CRUs That Do Not Require System Power Off

This chapter describes how to remove and replace the hot-swappable components in MiniCluster. Hot-swappable means that these components can be replaced with the system powered on.
Note - Hot-swappable components can also be replaced with the system powered off. If you
choose this method, first power off the system as described in “Preparing MiniCluster for
Service” on page 13.
The following topics are covered:
“Compute Node Hot-Swappable Components” on page 17
“Storage Array Hot-Swappable Components” on page 18

Compute Node Hot-Swappable Components

This section describes replacing hot-swappable components for the compute node.
Hot-swappable components are those that can be removed and installed while the system and the compute nodes are running without requiring any administrative tasks. These are designated as Customer Replaceable Units (CRUs).
Refer to the following table for hot-swap component instructions for your appliance.
Note - The links in the following table take you to instructions outside of the MiniCluster
library to the SPARC S7-2 documentation library. Use your browser's Back button to return to the MiniCluster library.
“Servicing Drives” in SPARC S7-2 Server Service Manual
Servicing CRUs That Do Not Require System Power Off 17
Page 18

Storage Array Hot-Swappable Components

“Servicing Fan Modules” in SPARC S7-2 Server Service Manual
“Servicing Power Supplies” in SPARC S7-2 Server Service Manual
Storage Array Hot-Swappable Components
This section describes replacing hot-swappable components for the storage array.
Hot-swappable components are those that can be removed and installed while the system and the compute nodes are running without requiring any administrative tasks. These are designated as Customer Replaceable Units (CRUs).
Caution - Data loss. Do not remove power from both power supplies of the storage array or
storage expansion shelf (if equipped) while the database is running. If you must power off the storage array or expansion storage array, stop the database first.
Note - After a physical insertion or removal action of a storage array component (such as a disk
or SAS cable), you should wait before performing the next action to allow the system software to fully process the event. This could take up to 3 minutes to complete.

Replacement Procedures

These storage array components can be replaced while the storage array is powered-on and running:
“Replace a Storage Array HDD or SSD (CRU) ” on page 18
“Replace a Storage Array Power Supply (CRU)” on page 24
“Replace a Storage Array I/O Module (CRU)” on page 28
Replace a Storage Array HDD or SSD (CRU)
Use these procedures to replace a storage array HDD or SSD.
“Remove a Storage Array HDD or SSD (CRU)” on page 19
“Install a Storage Array HDD or SSD (CRU)” on page 22
18 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 19
Remove a Storage Array HDD or SSD (CRU)
Remove a Storage Array HDD or SSD (CRU)
1.
Enter the appropriate commands through the CLI to prepare the drive for removal from the storage array.
Use this procedure to logically remove a storage array drive from the system before you physically remove the drive.
The length of time that it takes to complete this procedure before you can physically remove the drive depends on the type of drive you are removing:
SSD – The detach operation completes quickly and the drive can be removed immediately.
HDD – The detach operation takes several minutes to complete. Do not remove the drive before the detach operation competes.
a.
Log into the mcmu CLI as a primary admin such as mcinstall.
% ssh mcinstall@minicluster_node_name_or_IPaddress
where minicluster_node_name_or_IPaddress is the name of the first node on MiniCluster, or the IP address of the first node.
For example:
% ssh mcinstall@mc4-n1
b.
When prompted, enter the password.
For more information about running mcmu CLI commands, refer to the Oracle MiniCluster S7-2 Installation Guide or the Oracle MiniCluster S7-2 Administration Guide.
c.
Identify the drive name.
You might be able to get the drive name from the error message or log file that led to the plan to replace the drive.
You can use a command such as diskinfo. Make note of the long drive name (such as c0t5000CCA23B0BF34Cd0) for use with this procedure. Also make note of the shorter name (such as HDD8) for use with the followup task when you reattach the replaced drive.
Also note that the storage array drives are identified by a JBODARRAY string. For example:
% diskinfo D:devchassis-path c:occupant-compdev
---------------------------------- --------------------­/dev/chassis/SYS/HDD0/disk c0t5000CCA02D1EE2A8d0 /dev/chassis/SYS/HDD1/disk c0t5000CCA02D1E7AACd0
Servicing CRUs That Do Not Require System Power Off 19
Page 20
Remove a Storage Array HDD or SSD (CRU)
/dev/chassis/SYS/HDD2/disk c0t5000CCA02D1EDCECd0 /dev/chassis/SYS/HDD3/disk c0t5000CCA02D1ED360d0 /dev/chassis/SYS/HDD4/disk c0t5000CCA02D1EE6D8d0 /dev/chassis/SYS/HDD5/disk c0t5000CCA02D1EE6CCd0 /dev/chassis/SYS/HDD6 ­/dev/chassis/SYS/HDD7 ­/dev/chassis/SYS/MB/EUSB-DISK/disk c1t0d0 /dev/chassis/JBODARRAY1/HDD0/disk c0t5000CCA25497267Cd0 /dev/chassis/JBODARRAY1/HDD1/disk c0t5000CCA2549732B8d0 /dev/chassis/JBODARRAY1/HDD2/disk c0t5000CCA254974F28d0 /dev/chassis/JBODARRAY1/HDD3/disk c0t5000CCA254965A78d0 /dev/chassis/JBODARRAY1/HDD4/disk c0t5000CCA254978510d0 /dev/chassis/JBODARRAY1/HDD5/disk c0t5000CCA254964E3Cd0 /dev/chassis/JBODARRAY1/HDD6/disk c0t5000CCA0536CA5E4d0 /dev/chassis/JBODARRAY1/HDD7/disk c0t5000CCA0536CA7B0d0 /dev/chassis/JBODARRAY1/HDD8/disk c0t5000CCA23B0BF34Cd0 /dev/chassis/JBODARRAY1/HDD9/disk c0t5000CCA0536CB828d0 /dev/chassis/JBODARRAY1/HDD10/disk c0t5000CCA0536CB308d0 /dev/chassis/JBODARRAY1/HDD11/disk c0t5000CCA0536CAF2Cd0 /dev/chassis/JBODARRAY1/HDD12/disk c0t5000CCA0536CABE4d0 /dev/chassis/JBODARRAY1/HDD13/disk c0t5000CCA0536CB684d0 /dev/chassis/JBODARRAY1/HDD14/disk c0t5000CCA0536CA870d0 /dev/chassis/JBODARRAY1/HDD15/disk c0t5000CCA0536CAB88d0 /dev/chassis/JBODARRAY1/HDD16/disk c0t5000CCA0536CA754d0 /dev/chassis/JBODARRAY1/HDD17/disk c0t5000CCA0536CAD10d0 /dev/chassis/JBODARRAY1/HDD18/disk c0t5000CCA0536CAEF8d0 /dev/chassis/JBODARRAY1/HDD19/disk c0t5000CCA0536CA83Cd0 /dev/chassis/JBODARRAY1/HDD20/disk c0t5000CCA04EB272E8d0 /dev/chassis/JBODARRAY1/HDD21/disk c0t5000CCA04EB27234d0 /dev/chassis/JBODARRAY1/HDD22/disk c0t5000CCA04EB27428d0 /dev/chassis/JBODARRAY1/HDD23/disk c0t5000CCA04EB272A0d0
d.
Use this syntax to detach the disk prior to removing the disk:
% mcmu diskutil -d diskname
where diskname is the disk identifier of the drive you plan to remove.
For example:
% mcmu diskutil -d c0t5000CCA23B0BF34Cd0 . <output omitted> . INFO:MCMU.controllers.common.pexpect_util:su to user oracle successfully. [INFO ] logged into the zone f18-vm1-mc5qt-n1 as oracle [INFO ] disk alias found to be RECO_0003 [INFO ] dropping disk c0t5000CCA23B0BF34Cd0s1
20 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 21
Remove a Storage Array HDD or SSD (CRU)
[INFO ] ['', 'Diskgroup altered.', ''] [INFO ] ASM rebalance complete in diskgroup RECO in zonegroup f18 [INFO ] disk alias found to be DATA_0003 [INFO ] dropping disk c0t5000CCA23B0BF34Cd0s4 [INFO ] ['', 'Diskgroup altered.', ''] [INFO ] ASM still initializing, please retry in a few minutes to check the progress by running 'mcmu diskutil -p'
[INFO ] Disk should NOT be detached until rebalance is complete.
[INFO ] disk alias found to be SYSTEM_0003 [INFO ] dropping disk c0t5000CCA23B0BF34Cd0s5 [INFO ] ['', 'Diskgroup altered.', ''] [INFO ] ASM rebalance in progress in the zonegroup f18. Estimated time of completion is 61.
[INFO ] Disk should NOT be detached until rebalance is complete. [INFO ] exiting surplus [INFO ] Partitions to be dropped: [1] [INFO ] Removing disk from zones... [INFO ] Modifying zones: [u'f18-vm1-mc5qt-n1'] on node: mc5qt-n1 . <output omitted> . [INFO ] Running.. exit [INFO ] Disk {disk} being detached from Minicluster.
e.
Check the status of the detach operation.
If you want to check the status, but the detach operation is still running, login to the mcmu CLI in another window.
Type:
% mcmu diskutil -p
[INFO ] Checking asm rebalance progress . <output omitted> . [INFO ] ASM Rebalance successfully complete. Attempting to remove disk from zones. . <output omitted> . [INFO ] Disk successfully removed from Minicluster.
Servicing CRUs That Do Not Require System Power Off 21
Page 22
Install a Storage Array HDD or SSD (CRU)
f.
When the ASM rebalance is complete, you can remove the drive.
2.
Locate the failed drive on the front of the chassis.
Caution - Equipment damage. Do not remove a drive unless you have an immediate
replacement.
3.
Remove the drive.
Press the button to release the drive lever [1], rotate the lever open and, grasping the drive body, slide the drive out of the chassis [2].
4.
Install a replacement drive, if necessary.
See “Install a Storage Array HDD or SSD (CRU)” on page 22.
Install a Storage Array HDD or SSD (CRU)
1.
Install the replacement drive.
22 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 23
Install a Storage Array HDD or SSD (CRU)
Open the drive lever on the replacement drive and slide the replacement drive as far into the drive slot as possible and then close the lever to lock it into place. Check that the drive is locked in place and flush with the other drives.
2.
Log into the mcmu CLI as a primary admin such as mcinstall.
See Step 1a and Step 1b in “Remove a Storage Array HDD or SSD (CRU)” on page 19 for those instructions.
3.
Identify the full disk name of the replaced drive.
You can use a command such as diskinfo.
In this example, HDD8 was replaced, and diskinfo shows that the full drive name for HDD8 is c0t5000CCA0536CA710d0.
Also note that the storage array drives are identified by a JBODARRAY string.
% diskinfo D:devchassis-path c:occupant-compdev
---------------------------------- --------------------­/dev/chassis/SYS/HDD0/disk c0t5000CCA02D1EE2A8d0 /dev/chassis/SYS/HDD1/disk c0t5000CCA02D1E7AACd0 /dev/chassis/SYS/HDD2/disk c0t5000CCA02D1EDCECd0 /dev/chassis/SYS/HDD3/disk c0t5000CCA02D1ED360d0 /dev/chassis/SYS/HDD4/disk c0t5000CCA02D1EE6D8d0 /dev/chassis/SYS/HDD5/disk c0t5000CCA02D1EE6CCd0 /dev/chassis/SYS/HDD6 ­/dev/chassis/SYS/HDD7 ­/dev/chassis/SYS/MB/EUSB-DISK/disk c1t0d0
Servicing CRUs That Do Not Require System Power Off 23
Page 24
Replace a Storage Array Power Supply (CRU)
/dev/chassis/JBODARRAY1/HDD0/disk c0t5000CCA25497267Cd0 /dev/chassis/JBODARRAY1/HDD1/disk c0t5000CCA2549732B8d0 /dev/chassis/JBODARRAY1/HDD2/disk c0t5000CCA254974F28d0 /dev/chassis/JBODARRAY1/HDD3/disk c0t5000CCA254965A78d0 /dev/chassis/JBODARRAY1/HDD4/disk c0t5000CCA254978510d0 /dev/chassis/JBODARRAY1/HDD5/disk c0t5000CCA254964E3Cd0 /dev/chassis/JBODARRAY1/HDD6/disk c0t5000CCA0536CA5E4d0 /dev/chassis/JBODARRAY1/HDD7/disk c0t5000CCA0536CA7B0d0 /dev/chassis/JBODARRAY1/HDD8/disk c0t5000CCA0536CA710d0 /dev/chassis/JBODARRAY1/HDD9/disk c0t5000CCA0536CB828d0 /dev/chassis/JBODARRAY1/HDD10/disk c0t5000CCA0536CB308d0 /dev/chassis/JBODARRAY1/HDD11/disk c0t5000CCA0536CAF2Cd0 /dev/chassis/JBODARRAY1/HDD12/disk c0t5000CCA0536CABE4d0 /dev/chassis/JBODARRAY1/HDD13/disk c0t5000CCA0536CB684d0 /dev/chassis/JBODARRAY1/HDD14/disk c0t5000CCA0536CA870d0 /dev/chassis/JBODARRAY1/HDD15/disk c0t5000CCA0536CAB88d0 /dev/chassis/JBODARRAY1/HDD16/disk c0t5000CCA0536CA754d0 /dev/chassis/JBODARRAY1/HDD17/disk c0t5000CCA0536CAD10d0 /dev/chassis/JBODARRAY1/HDD18/disk c0t5000CCA0536CAEF8d0 /dev/chassis/JBODARRAY1/HDD19/disk c0t5000CCA0536CA83Cd0 /dev/chassis/JBODARRAY1/HDD20/disk c0t5000CCA04EB272E8d0 /dev/chassis/JBODARRAY1/HDD21/disk c0t5000CCA04EB27234d0 /dev/chassis/JBODARRAY1/HDD22/disk c0t5000CCA04EB27428d0 /dev/chassis/JBODARRAY1/HDD23/disk c0t5000CCA04EB272A0d0
4.
Use this syntax to attach a disk:
% mcmu diskutil -a diskname
where diskname is the name of the disk that you want to attach.
For example:
% mcmu diskutil -a c0t5000CCA0536CA710d0
Replace a Storage Array Power Supply (CRU)
The storage array and the storage expansion shelf each have two power supplies. Indicators (LEDs) on the power supplies indicate their operation state.
The power supplies are redundant and one can be replaced while the appliance is running.
24 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 25
Replace a Storage Array Power Supply (CRU)
Servicing CRUs That Do Not Require System Power Off 25
Page 26
Replace a Storage Array Power Supply (CRU)
Callout Description
1 AC power fail indicator
2 Fan fail indicator
3 Power supply status indicator
4 DC power fail indicator
5 Release lever
6 AC connector
7 On/Off switch
If any of the indicators are amber, or if the power supply status indicator does not light, replace the power supply.
1.
Ensure the power supply on/off switch is in the "O" off position.
2.
Disconnect the power cord tie strap from the power cord, and unplug the power cord from the power supply.
3.
Remove the power supply unit.
26 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 27
Replace a Storage Array Power Supply (CRU)
Grasp the power supply handle, push the release button [1], and slide the power supply out of the chassis [2].
4.
Open the lever/ejection arms on the replacement power supply.
5.
Slide the replacement power supply module into the chassis.
Be careful to not bend the pins.
6.
Close the lever/ejection arms.
Push the lever fully closed until you hear or feel a click.
Servicing CRUs That Do Not Require System Power Off 27
Page 28
Replace a Storage Array I/O Module (CRU)
Replace a Storage Array I/O Module (CRU)
Storage array SAS I/O modules are fully redundant and can be replaced while the appliance is running.
1.
Disconnect the cables from the I/O module.
Be sure to mark the cables so you can return them to their proper connectors.
Callout Description
1 I/O module locking levers
2 I/O module 1 (top)
3 I/O module 0 (bottom)
2.
Remove the I/O module.
28 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 29
Replace a Storage Array I/O Module (CRU)
Using your thumb and forefinger, pinch the release buttons [1] together to release the locking lever [2]. Grasp the lever and remove the I/O module from the chassis [3] .
Caution - Equipment damage. Be careful not to damage the connector pins at the back of the
I/O module.
3.
Open the lever in the new I/O module if it is not already open.
4.
Slide the new I/O module into the disk shelf and close the locking lever.
5.
Reconnect the cables.
6.
Verify that the new I/O module is working correctly.
After approximately 60 seconds, the power LED should be solid green and the Fault/Locate LED should be off.
For each port that has a cable connected, all four activity LEDs should be solid green.
Servicing CRUs That Do Not Require System Power Off 29
Page 30
30 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 31

Servicing CRUs That Require Compute Node Power Off

These sections describe how to service customer-replaceable units (CRUs) for MiniCluster that require you to power off the compute node. All of these CRUs are located on the compute nodes.
Note - The servicing of hot-swap components such as the HDDs, SSDs, power supplies, and
fans is described in “Servicing CRUs That Do Not Require System Power Off” on page 17.

Servicing Compute Node CRUs

This section describes how to service customer-replaceable units (CRUs) for the compute nodes that require you to power off the server.
Note - All of the CRUs on the storage array and the storage expansion shelf can be replaced
while the power is on. See “Storage Array Hot-Swappable Components” on page 18 for storage array component replacement procedures.
Note - The links in the following list take you to instructions outside of the MiniCluster library
to the SPARC S7-2 documentation library. Use your browser's Back button to return to the MiniCluster library.
“Servicing DIMMs” in SPARC S7-2 Server Service Manual
“Servicing the Battery” in SPARC S7-2 Server Service Manual
“Servicing PCIe Cards” in SPARC S7-2 Server Service Manual
“Servicing the Front Indicator Module” in SPARC S7-2 Server Service Manual
“Servicing the eUSB Drive” in SPARC S7-2 Server Service Manual
Servicing CRUs That Require Compute Node Power Off 31
Page 32
32 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 33

Servicing FRUs

The following topics describe how to service field-replaceable units (FRUs) for MiniCluster. You must power down the system and disconnect the AC power cords from the server before servicing any of these components.
Only Oracle authorized service personnel should service FRU components.

Servicing Compute Node FRUs

This section describes how to service compute node FRUs. Only Oracle authorized service personnel should service FRU components.
Note - Most of the links in the following table take you to instructions outside of the
MiniCluster library to the SPARC S7-2 documentation library. Use your browser's Back button to return to the MiniCluster library.
“Servicing Cables” in SPARC S7-2 Server Service Manual
“Servicing the Drive Backplane” in SPARC S7-2 Server Service Manual
“Servicing the Motherboard” in SPARC S7-2 Server Service Manual
Servicing FRUs 33
Page 34
34 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 35

Returning MiniCluster to Operation

This chapter describes how to return MiniCluster to operation after you have performed service procedures. It includes:
“Returning Compute Nodes to Operation” on page 35
“Powering On MiniCluster” on page 35

Returning Compute Nodes to Operation

This section describes returning compute nodes to operation after servicing.
Note - The links in the following list take you to instructions outside of the MiniCluster library
to the SPARC S7-2 documentation library. Use your browser's Back button to return to the MiniCluster library.
“Replace the Top Cover” in SPARC S7-2 Server Service Manual
“Return the Server to the Normal Operating Position” in SPARC S7-2 Server Service
Manual
“Reconnect Data Cables and Power Cords” in SPARC S7-2 Server Service Manual
“Power On the Server” in SPARC S7-2 Server Service Manual

Powering On MiniCluster

This section describes how to restore power after servicing MiniCluster.
Before you begin, return the compute nodes to operational status. Go to “Returning Compute
Nodes to Operation” on page 35.
When powering on the system, each compute node is powered on separately.
Returning MiniCluster to Operation 35
Page 36

Power On Procedure

1.
2.
3.
4.
Power On Procedure
Perform the following procedure to power on the MiniCluster.
If you powered off and disconnected the power cords from a storage array (and storage expansion shelf if one is present), reconnect the power cords now.
Power on the storage array (and storage expansion shelf if one is present) by switching the power supply On/Off button on each power supply to the On position.
The disks begin to initialize and the power supply LEDs light green. This can take several minutes.
Note - Do not power on the compute nodes until the disks on the storage array and storage
expansion shelf have initialized.
If you powered off a compute node, reconnect any unplugged power cords now.
Power on a compute node on using one of the following methods:
Press the recessed Power button on the compute node front panel.
Log in to the Oracle ILOM web interface, click Host Management > Power Control, and select Power On from the Select Action list box.
Log in to the Oracle ILOM command-line interface (CLI), and type the following command at the Oracle ILOM CLI prompt.
-> start /SYS
When the server is powered on and the power-on self-test (POST) code checkpoint tests have completed, the green Power/OK status indicator on the compute node front panel lights and remains lit.
Note - If you powered off both compute nodes prior to servicing, repeat steps 3 and 4 to power
on the second compute node.
5.
Refer to the Oracle MiniCluster S7-2 Installation Guide and Oracle MiniCluster S7-2 Administration Guide for any instructions or commands required to return the appliance to service.
36 Oracle MiniCluster S7-2 Service Manual • March 2017
Page 37

Index

C
compute node
cold-swappable components, 31 hot-swappable components, 17 servicing
battery, 31 cables, 33 DIMMs, 31 drive backplane, 33 drives, 17 eUSB drive, 31 fan modules, 17 front indicator module, 31 motherboard, 33 PCIe cards, 31 power supplies, 17
status indicators, 9
CRUs
not requiring system power off (hot-swap), 17 requiring system power off (cold-swap), 31
F
faults, detecting and managing, 13 FRUs, servicing, 33
R
returning system to operation, 35
S
service categories, 14 status indicators
compute node, 9 storage array
back panel, 11 front panel, 9
storage array
hot-swappable components, 18 servicing
drives, 18 I/O module, 28 power supplies, 24
status indicators
back panel, 11 front panel, 9
P
powering on system, 36 preparing system for service, 13
37
Page 38
38 Oracle MiniCluster S7-2 Service Manual • March 2017
Loading...