Sun Oracle Sun Server X4-2 Service Manual

Page 1
Sun Server X4-2
Service Manual
Part No.: E38041-07 May 2014
Page 2
Copyright © 2013, 2014, Oracleand/or itsaffiliates. Allrights reserved. This softwareand related documentationare provided under a license agreement containingrestrictions on use and disclosure and areprotected by
intellectual propertylaws. Exceptas expressly permittedin yourlicense agreementor allowedby law,you maynot use,copy, reproduce, translate, broadcast, modify, license,transmit, distribute,exhibit, perform,publish, ordisplay anypart, inany form,or byany means.Reverse engineering, disassembly, or decompilation of this software, unlessrequired by law for interoperability, is prohibited.
The informationcontained hereinis subjectto changewithout noticeand isnot warranted to be error-free.If youfind anyerrors, please report them to us in writing.
If thisis softwareor related softwaredocumentation thatis delivered to the U.S.Government oranyone licensingit onbehalf ofthe U.S.Government, the following noticeis applicable:
U.S. GOVERNMENTEND USERS.Oracle programs,including anyoperating system,integrated software, anyprograms installedon thehardware, and/or documentation,delivered toU.S. Governmentend usersare "commercial computer software" pursuantto theapplicable FederalAcquisition Regulation andagency-specific supplementalregulations. Assuch, use,duplication, disclosure, modification,and adaptationof theprograms, including any operatingsystem, integratedsoftware, anyprograms installed on the hardware,and/or documentation,shall besubject tolicense termsand license restrictions applicableto theprograms. No other rights are granted to the U.S. Government.
This software or hardware is developed for general use ina varietyof information management applications. It is not developed orintended foruse inany inherently dangerous applications, including applicationsthat maycreate arisk ofpersonal injury. Ifyou usethis software or hardware indangerous applications, thenyou shallbe responsibleto takeall appropriate fail-safe,backup, redundancy, andother measures to ensure itssafe use.Oracle Corporation andits affiliatesdisclaim anyliability forany damagescaused byuse of this software or hardware indangerous applications.
Oracle andJava areregistered trademarks of Oracle and/or its affiliates.Other namesmay betrademarks oftheir respective owners. Intel andIntel Xeonare trademarksor registered trademarksof IntelCorporation. AllSPARC trademarks areused underlicense andare trademarks or
registered trademarks of SPARCInternational, Inc. AMD, Opteron, theAMD logo,and theAMD Opteron logo are trademarksor registered trademarksof Advanced MicroDevices. UNIXis aregistered trademark of The Open Group.
This software or hardware and documentation may provide access to or information on content, products, and services from third parties. Oracle Corporation and its affiliates are not responsible for and expressly disclaim all warranties of any kind with respect to third-party content, products, and services. Oracle Corporation and its affiliates will not be responsible for any loss, costs, or damages incurred due to your access to or use of third-party content, products, or services.
Copyright ©2013, 2014,Oracle et/ouses affiliés.Tous droits réservés. Ce logicielet ladocumentation quil’accompagne sontprotégés parles lois sur lapropriété intellectuelle. Ils sont concédés sous licence et soumis à des
restrictions d’utilisationet dedivulgation. Saufdisposition devotre contrat de licence ou de la loi, vous ne pouvez pas copier, reproduire, traduire, diffuser, modifier, breveter, transmettre,distribuer,exposer, exécuter, publier ouafficher le logiciel, même partiellement, sous quelque forme et par quelque procédéque cesoit. Parailleurs, ilest interdit deprocéder àtoute ingénierieinverse dulogiciel, dele désassembler ou de le décompiler, excepté à des finsd’interopérabilité avecdes logicielstiers outel queprescrit par la loi.
Les informationsfournies dansce documentsont susceptiblesde modificationsans préavis.Par ailleurs,Oracle Corporationne garantitpas qu’elles soient exemptesd’erreurs etvous invite,le caséchéant, àlui enfaire part par écrit.
Si celogiciel, oula documentationqui l’accompagne,est concédésous licenceau Gouvernementdes Etats-Unis,ou àtoute entitéqui délivre la licencede ce logicielou l’utilisepour lecompte duGouvernement desEtats-Unis, lanotice suivantes’applique :
U.S. GOVERNMENTEND USERS.Oracle programs,including anyoperating system,integrated software, anyprograms installedon thehardware, and/or documentation,delivered toU.S. Governmentend usersare "commercial computer software" pursuantto theapplicable FederalAcquisition Regulation andagency-specific supplementalregulations. Assuch, use,duplication, disclosure, modification,and adaptationof theprograms, including any operatingsystem, integratedsoftware, anyprograms installed on the hardware,and/or documentation,shall besubject tolicense termsand license restrictions applicableto theprograms. No other rights are granted to the U.S. Government.
Ce logicielou matériela étédéveloppé pourun usagegénéral dansle cadre d’applications de gestion des informations. Ce logiciel ou matériel n’est pas conçu nin’est destinéà êtreutilisé dansdes applicationsà risque,notamment dans des applications pouvant causer des dommages corporels. Si vous utilisez celogiciel oumatériel dansle cadred’applications dangereuses, ilest devotre responsabilité deprendre toutes les mesures de secours, de sauvegarde, deredondance et autres mesures nécessaires à son utilisation dans des conditions optimales de sécurité. Oracle Corporation et ses affiliés déclinent touteresponsabilité quantaux dommagescausés parl’utilisation dece logicielou matériel pour ce type d’applications.
Oracle etJava sontdes marquesdéposées d’OracleCorporation et/oude sesaffiliés.Tout autre nom mentionné peut correspondre à des marques appartenant àd’autres propriétaires qu’Oracle.
Intel etIntel Xeonsont desmarques oudes marques déposéesd’Intel Corporation.Toutes les marques SPARC sont utilisées sous licence et sont des marques oudes marques déposées de SPARC International, Inc. AMD, Opteron, le logo AMD et le logo AMD Opteron sontdes marques ou des marques déposées d’AdvancedMicro Devices.UNIX estune marque déposéed’The OpenGroup.
Ce logicielou matérielet ladocumentation quil’accompagne peuventfournir desinformations oudes liensdonnant accèsà descontenus, desproduits et des servicesémanant detiers. OracleCorporation etses affiliésdéclinent toute responsabilité ou garantie expresse quant aux contenus,produits ou services émanantde tiers.En aucuncas, OracleCorporation etses affiliésne sauraient être tenus pour responsables des pertes subies, des coûts occasionnés oudes dommagescausés parl’accès àdes contenus,produits ou services tiers, ou à leur utilisation.
Please
Recycle
Page 3
Contents
Using This Documentation xi
About the Sun Server X4-2 1
Product Description 1
About Controls and Connectors 2
Front Panel Controls and Indicators on a System With Eight 2.5-inch
Drives 3
Front Panel Controls and Indicators on a System With Four 2.5-inch Drives
and DVD Drive 4
Server Back Panel View 5
About Server and Component Status Indicators 6
Server General Status Indicators 7
Server Fan Status Indicators 9
Storage Drive Status Indicators 10
Power Supply Status Indicators 11
Network Management Port Status Indicators 11
Ethernet Ports Status Indicators 12
Motherboard Status Indicators 13
DDR3 DIMM Fault Status Indicators 13
Processor Fault Status Indicators 14
Fault Remind Status Indicator 14
STBY PWRGD Status Indicator 14
About System Components 15
iii
Page 4
Illustrated Parts Breakdown 15
Customer-Replaceable Units 17
Field-Replaceable Units 18
Battery Module 19
Troubleshooting the Server 21
Service Troubleshooting Task List 21
Diagnostic Tools 22
Gather Service Information 24
Locate the Server Serial Number 25
Inspecting the System 25
Troubleshoot Power Problems 26
Inspect the Server Externally 26
Inspect Internal Server Components 26
Preparing for Service 29
Safety Precautions 29
FRU TLI Auto-Update 30
Safety Symbols 30
Electrostatic Discharge Safety 31
Required Tools 32
Preparing the Server for Component Replacement 33
Powering Down the Server 33
Power Down Server Gracefully Using the Oracle ILOM CLI 34
Power Down Server Gracefully Using the Oracle ILOM Web
Interface 35
Power Down Server Gracefully Using the Power Button 36
Use the Power Button for Immediate Shutdown 36
Use the Oracle ILOM CLI for Immediate Shutdown 37
Use the Oracle ILOM Web Interface for Immediate Shutdown 38
iv Sun Server X4-2 Service Manual • May 2014
Page 5
Disconnect Cables From the Server 39
Extend the Server to the Maintenance Position 39
Remove the Server From the Rack 41
Take Antistatic Measures 42
Open the Server Fan Door 43
Remove the Server Top Cover 43
Servicing CRUs That Do Not Require Server Power-Off 45
Servicing Storage Drives (CRU) 45
Storage Drives Hot-Plug Conditions 46
HDD or SSD Failure and RAID 46
Storage Drive Status Indicators 47
Remove a Storage Drive 47
Install a Storage Drive 49
Servicing Fan Modules (CRU) 50
Remove a Fan Module 51
Install a Fan Module 54
Servicing Power Supplies (CRU) 55
Power Supply Status Indicators 56
Remove a Power Supply 57
Install a Power Supply 59
Servicing CRUs That Require Server Power-Off 61
CRU Locations 61
Servicing the DIMMs (CRU) 62
DIMM and Processor Physical Layout 64
DIMM Population Example for Optimal System Performance 65
DIMM Population Order for Single-Processor Systems 66
DIMM Population Order for Dual-Processor Systems 66
Contents v
Page 6
DIMM Population Rules 67
DIMM Rank Classification Labels 69
Inconsistencies Between DIMM Fault Indicators and the BIOS Isolation of
Faulty DIMMs 70
Using the Fault Remind Button 70
Identify and Remove a Faulty DIMM 71
Install a DIMM 73
Servicing PCIe Risers (CRU) 74
PCIe Riser Location and Differences 75
Remove a PCIe Riser From PCIe Slot 1 or 2 76
Install a PCIe Riser Into PCIe Slot 1 or 2 78
Remove a PCIe Riser From PCIe Slots 3 and 4 80
Install a PCIe Riser Into PCIe Slots 3 and 4 82
Servicing PCIe Cards (CRU) 84
PCIe Slot Characteristics 85
Remove a PCIe Card From PCIe Slot 1 or 2 86
Install a PCIe Card in PCIe Slot 1 or 2 87
Remove a PCIe Card From PCIe Slot 3 87
Install a PCIe Card in PCIe Slot 3 88
Remove the Internal HBA Card From PCIe Slot 4 89
Install the Internal HBA Card in PCIe Slot 4 90
Servicing the DVD Drive (CRU) 91
Remove the DVD Drive 91
Install the DVD Drive 93
Servicing the Internal USB Flash Drives (CRU) 94
Servicing the Oracle System Assistant USB Flash Drive 94
Remove an Internal USB Flash Drive 95
Install an Internal USB Flash Drive 95
Servicing the Battery (CRU) 96
vi Sun Server X4-2 Service Manual • May 2014
Page 7
Remove the Battery 97
Install the Battery 98
Servicing FRUs 99
FRU Locations 99
Servicing Processors (FRU) 100
Selecting the Correct Processor Removal/Replacement Tool 102
Remove a Processor 106
Install a Processor 112
Servicing the Disk Backplane (FRU) 117
Disk Backplane Configurations 118
Remove the Disk Backplane 119
Install the Disk Backplane 122
Servicing the Front Indicator Module (FRU) 125
Remove the Front Indicator Module 125
Install the Front Indicator Module 127
Servicing the Motherboard (FRU) 127
Remove the Motherboard 128
Install the Motherboard 135
Servicing the SAS Cables (FRUs) 138
Remove Storage Drive SAS Cables 139
Install Storage Drive SAS Cables 141
Returning the Server to Operation 143
Removing and Installing Server Filler Panels 143
Remove and Install Filler Panels 144
Install the Server Top Cover 145
Remove Antistatic Measures 146
Reinstall the Server Chassis Into the Rack 146
Contents vii
Page 8
Return the Server to the Normal Rack Position 148
Reconnect Data Cables and Power Cords 149
Power On the Server 150
Identifying the Server Ports 153
Gigabit Ethernet Ports 153
Network Management Port 154
Serial Management Port 155
Video Connector 157
USB Ports 158
Setting Up BIOS Configuration Parameters 161
Managing the BIOS Configuration 161
Accessing the BIOS Setup Utility 162
BIOS Setup Utility Menus 162
BIOS Key Mappings 163
Access BIOS Setup Utility Menus 164
Navigate BIOS Setup Utility Menus 165
Using Legacy or UEFI BIOS 167
Selecting Legacy BIOS or UEFI BIOS Boot Mode 167
Switching Between Legacy BIOS and UEFI BIOS 168
UEFI BIOS Boot Mode Advantages 169
Configuration Utilities for Add-In Cards 169
Using BIOS for Resource Allocation 170
Legacy Option ROM Allocation 170
I/O Resource Allocation 171
Common BIOS Setup Utility Tasks 172
Verify BIOS Factory Default Settings 172
Select Legacy BIOS or UEFI BIOS Boot Mode 173
viii Sun Server X4-2 Service Manual • May 2014
Page 9
Select the Boot Device 175
Configure iSCSI Virtual Drives 176
Enable or Disable Oracle System Assistant 184
Configure TPM Support 186
Configure SP Network Settings 189
Configure Option ROM Settings 192
Configure I/O Resource Allocation 193
Exit BIOS Setup Utility 194
BIOS Setup Utility Menu Options 197
BIOS Main Menu Selections 198
BIOS Advanced Menu Selections 203
BIOS IO Menu Selections 213
BIOS Boot Menu Selections 217
UEFI Driver Control Menu Selections 221
BIOS Save & Exit Menu Selections 225
Monitoring Components and Identifying SNMP Messages 227
Monitoring Component Health and Faults Using Oracle ILOM 227
Monitoring System Components 228
System Chassis Components 229
Cooling Unit Components 231
Disk Backplane Components 232
Memory Device Components 232
Power Unit Components 233
Processor Components 235
System Board Components 235
System Firmware Components 237
Hard Disk Drive Components 238
Contents ix
Page 10
Identifying SNMP Trap Messages 239
Generic Host Events 240
Environmental Events 240
Hard Disk Drive Events 242
Power Events 243
Fan Events 246
Memory Events 248
Entity Presence Events 253
Physical Presence Events 254
Index 257
x Sun Server X4-2 Service Manual • May 2014
Page 11
Using This Documentation
This service manual explains how to remove and replace parts in the Sun Server X4-2, and how to troubleshoot and maintain the system.
This document is intended for system administrators, network administrators, and service technicians who have an understanding of server systems.
This section describes how to get the latest software and firmware, documentation and feedback, and support and accessibility information.
“Getting the Latest Software and Firmware” on page xi
“About This Documentation” on page xii
“Related Documentation” on page xii
“Feedback” on page xii
“Access to Oracle Support” on page xiii
Getting the Latest Software and Firmware
Firmware, drivers and other hardware-related software for each Oracle x86 server, server module (blade), and blade chassis are updated periodically.
You can obtain the latest software in one of three ways:
Oracle System Assistant – This is a factory-installed option for Oracle x86 servers.
It has all the tools and drivers you need and is built into the server.
My Oracle Support: http://support.oracle.com
Physical media request
xi
Page 12
For more information, see the Sun Server X4-2 Installation Guide, “Getting Server
Firmware and Software Updates” on page 31.
About This Documentation
This documentation set is available in both PDF and HTML formats. The information is presented in topic-based organization (similar to online help) and therefore does not include chapters, appendices, or section numbering.
A PDF document that includes all information on a particular topic (such as hardware installation or product notes) can be generated by clicking the PDF button in the upper left corner of an HTML page.
Related Documentation
Documentation Link
All Oracle documentation http://www.oracle.com/documentation
Sun Server X4-2 http://www.oracle.com/goto/X4-2/docs
Oracle X4 Series Servers Administration Guide
Oracle Integrated Lights Out Manager (ILOM) 3.1
Oracle Hardware Management Pack 2.2
http://www.oracle.com/goto/x86AdminDiag/docs
http://www.oracle.com/goto/ILOM/docs
http://www.oracle.com/goto/OHMP/docs
Feedback
You can provide feedback on this documentation at:
http://www.oracle.com/goto/docfeedback
xii Sun Server X4-2 Service Manual • May 2014
Page 13
Access to Oracle Support
Oracle customers have access to electronic support through My Oracle Support. For information, visit http://www.oracle.com/pls/topic/lookup?ctx=acc&id=
info or visit http://www.oracle.com/pls/topic/lookup?ctx=acc&id=trs
if you are hearing impaired.
Using This Documentation xiii
Page 14
xiv Sun Server X4-2 Service Manual • May 2014
Page 15
About the Sun Server X4-2
These sections describe the controls, connectors, status indicators, system components, and replaceable components of the server.
Description Links
Review the product description. “Product Description” on page 1
Learn about server controls and connectors.
Learn about server and component status indicators.
Learn about system components. “About System Components” on page 15
“About Controls and Connectors” on page 2
“About Server and Component Status Indicators” on page 6
Product Description
The Sun Server X4-2 is an enterprise-class, one rack unit (1U) server. It supports the following components:
Up to two Intel processors. Processors with the following capabilities are
supported:
2.7 GHz, 12-core, 130W
3.0 GHz, 10-core, 130W
2.6 GHz, 8-core, 95W
2.6 GHz, 6-core, 80W
2.5 GHz, 4-core, 80W
Up to 8 DIMMs per processor for a maximum of 16 DDR3 DIMMs and a
maximum of 512 GB of memory on dual-processor systems. DIMM sizes of 8 GB, 16 GB, and 32 GB are supported.
1
Page 16
Note – A maximum of eight DIMMs for a maximum of 256 GB are supported in
single-processor systems.
Four PCIe Gen3 slots in dual-processor systems: three external slots and one
internal. PCIe slot 1, which is an external slot, is nonfunctional in single-processor systems.
Storage drive configurations can include both hard disk drives (HDDs) or solid
state drives (SSDs). Supported storage drive configurations include:
Eight 2.5-inch hot-pluggable SAS HDDs/SATA SSDs
Four 2.5-inch hot-pluggable SAS HDDs/SATA SSDs with DVD
Two hot-pluggable, redundant power supplies.
An on-board Oracle Integrated Lights Out Manager (Oracle ILOM) service
processor (SP) based on the AST2300 chip.
The Oracle System Assistant server setup tool, which is embedded on a
preinstalled USB flash drive.
About Controls and Connectors
The following sections describe the controls, indicators, connectors, and drives located on the front and rear panels.
“Front Panel Controls and Indicators on a System With Eight 2.5-inch Drives” on
page 3
“Front Panel Controls and Indicators on a System With Four 2.5-inch Drives and
DVD Drive” on page 4
“Server Back Panel View” on page 5
Related Information
“About Server and Component Status Indicators” on page 6
“About System Components” on page 15
“Illustrated Parts Breakdown” on page 15
2 Sun Server X4-2 Service Manual • May 2014
Page 17
Front Panel Controls and Indicators on a System With Eight 2.5-inch Drives
The following figure shows the controls, status indicators (LEDs), connectors, and drives on the front panel of a Sun Server X4-2 configured with eight 2.5-inch storage drives.
FIGURE: Front Panel View of a Server With Eight 2.5-inch Drives
Figure Legend
1 Product Serial Number (PSN) label and
Radio Frequency Identification (RFID) tag
2 Locator LED/Locator button: white 11 Storage drive 0
3 USB 2.0 connectors (2) 12 Storage drive 1
4 Service Required LED: amber 13 Storage drive 2
5 Power/OK LED: green 14 Storage drive 3
6 Power button 15 Storage drive 4
7 SP OK LED: green 16 Storage drive 5
8 Top Fan Fault LED: amber 17 Storage drive 6
9 Rear Power Supply (PS) Fault LED: amber 18 Storage drive 7 (In Oracle Engineered
10 System Over Temperature Warning LED:
amber
Systems, storage drive 7 might be populated with a remote battery module for the host bus adapter (HBA) card.)
About the Sun Server X4-2 3
Page 18
Related Information
“Front Panel Controls and Indicators on a System With Four 2.5-inch Drives and
DVD Drive” on page 4
“Server Back Panel View” on page 5
Front Panel Controls and Indicators on a System With Four 2.5-inch Drives and DVD Drive
The following figure shows the controls, status indicators (LEDs), connectors, and drives on the front panel of a Sun Server X4-2 configured with four 2.5-inch storage drives and a SATA DVD drive.
FIGURE: Front Panel View of a Server With Four 2.5-inch Drives and SATA DVD Drive
Figure Legend
1 Product Serial Number (PSN) label and
Radio Frequency Identification (RFID) tag
2 Locator LED/Locator button: white 10 System Over Temperature Warning LED:
3 USB 2.0 connectors (2) 11 Storage drive 0 (optional)
4 Service Required LED: amber 12 Storage drive 1
5 Power/OK LED: green 13 Storage drive 2
4 Sun Server X4-2 Service Manual • May 2014
9 Rear Power Supply Fault LED: amber
amber
Page 19
Figure Legend (Continued)
6 Power button 14 Storage drive 3
7 SP OK LED: green 15 SATA DVD drive
8 Top Fan Fault LED: amber 16 Not applicable
Related Information
“Front Panel Controls and Indicators on a System With Eight 2.5-inch Drives” on
page 3
“Server Back Panel View” on page 5
Server Back Panel View
The following figure shows the Sun Server X4-2 back panel and the location of power supplies, status indicators (LEDs), connectors, and PCIe slots.
FIGURE: Server Back Panel View
Figure Legend
1 Power Supply (PS) 0 9 Oracle Integrated Lights Out Manager (ILOM)
2 Power Supply (PS) 0 status indicators:
Service Required LED: amber AC OK LED: green
3 Power Supply (PS) 1 11 Network (NET) 100/1000/10000 port: NET 3
service processor (SP) network management 10/100BASE-T port (NET MGT)
10 Serial management (SER MGT)/RJ-45 serial
port
(Nonfunctional in single-processor systems.)
About the Sun Server X4-2 5
Page 20
Figure Legend (Continued)
4 Power Supply (PS) 1 status indicators:
Service Required LED: amber, AC OK LED: green
5 System status indicators:
Locator LED: white, Service Required LED: amber, Power/OK LED: green
6 PCIe card slot 1 (Nonfunctional in
single-processor systems.)
7 PCIe card slot 2 15 USB 2.0 connectors (2)
8 PCIe card slots 3 and 4 (Slot 4 is the primary
host bus adapter (HBA) card slot. This slot is internal and is not visible from the rear of the server. The server supports a maximum of one HBA card for controlling and managing the server storage drives.)
12 Network (NET) 100/1000/10000 port: NET 2
(Nonfunctional in single-processor systems.)
13 Network (NET) 100/1000/10000 port: NET 1
14 Network (NET) 100/1000/10000 port: NET 0
16 DB-15 video connector
Note – All of the PCIe slots comply with the PCI Express 3.0 specification and can
accommodate 25 watt PCIe3 cards.
Related Information
“Front Panel Controls and Indicators on a System With Eight 2.5-inch Drives” on
page 3
“Front Panel Controls and Indicators on a System With Four 2.5-inch Drives and
DVD Drive” on page 4
“About System Components” on page 15
About Server and Component Status Indicators
These sections describe the status indicators (LEDs) located on the front and rear of the server, including those found on components and ports.
“Server General Status Indicators” on page 7
“Server Fan Status Indicators” on page 9
“Storage Drive Status Indicators” on page 10
6 Sun Server X4-2 Service Manual • May 2014
Page 21
“Power Supply Status Indicators” on page 11
“Network Management Port Status Indicators” on page 11
“Ethernet Ports Status Indicators” on page 12
“Motherboard Status Indicators” on page 13
Related Information
“About Controls and Connectors” on page 2
“Service Troubleshooting Task List” on page 21
Server General Status Indicators
There are seven, system-level status indicators (LEDs), some of which are located on both the server front panel and the server back panel.
About the Sun Server X4-2 7
Page 22
TABLE: Server General Status Indicators
Status Indicator Name Icon/Label Colors State Meaning
Locator LED and button
White • OFF – Server is operating normally.
• FAST BLINK – Use Oracle ILOM to activate this LED to enable you to locate a particular system quickly and easily.
• Pressing the Locate button will toggle the LED fast blink on or off.
Service Required
Amber • OFF – The server is operating normally.
• STEADY ON – A fault is present on server. This LED lights whenever a fault indicator lights for a replaceable component on the server.
Note - The lighting of this indicator is always accompanied by a system console message that includes a recommended service action.
Power/OK Green Indicates the operational state of the chassis. This indicator can be in the
following states:
• OFF – AC power is not present or the Oracle ILOM boot is not complete.
• STEADY BLINK – Standby power is on, but the chassis power is off and the Oracle ILOM SP is running.
• SLOW BLINK – Startup sequence has been initiated on the host. This pattern should begin soon after you power on the server. This status indicates either: (1) POST code checkpoint tests are running on the server host system, or (2) the host is transitioning from the powered-on state to the standby state on shutdown.
• STEADY ON – The server is powered on, and all host POST code checkpoint tests are complete. The server is in one of the following states: 1) the server host is booting the operating system (OS), 2) the server host is running the OS.
SP OK Green • OFF – Service processor (SP) is not running.
• SLOW BLINK – SP is booting.
• STEADY ON – SP is fully operational.
8 Sun Server X4-2 Service Manual • May 2014
Page 23
TABLE: Server General Status Indicators (Continued)
Status Indicator Name Icon/Label Colors State Meaning
Top Fan TOP FAN Amber Indicates that one (or more) of the internal fan modules has failed.
• OFF – Indicates steady state; no service is required.
• STEADY ON – Indicates service required; service the fan modules,
Rear Power Supply Fault
System Over Temperature Warning
REAR PS Amber Indicates that one of the server power supplies has failed.
• OFF – Indicates steady state; no service is required.
• STEADY ON – Indicates service required; service the power supply.
Amber • OFF – Normal operation; no service is required.
• STEADY ON – The system is experiencing an overtemperature warning condition.
Note - This is a warning indication, not a fatal overtemperature. Failure to correct this might result in the system overheating and shutting down unexpectedly.
Related Information
“Front Panel Controls and Indicators on a System With Eight 2.5-inch Drives” on
page 3
“Front Panel Controls and Indicators on a System With Four 2.5-inch Drives and
DVD Drive” on page 4
“Server Back Panel View” on page 5
Server Fan Status Indicators
Each fan module has one bicolored status indicator (LED). The LEDs are located on the chassis mid-wall adjacent to and aligned with the fan modules and are visible when the top cover fan door is open.
TABLE: Server Fan Status Indicators
Status Indicator Name Icon Color State Meaning
Fan Status None Bicolored:
Amber/Green
• Amber – The fan module is faulty. The front TOP FAN LED and the front and rear panel Service Required LEDs are also lit if the system detects a fan module fault.
• Green – The fan module is correctly installed and operating within specification.
About the Sun Server X4-2 9
Page 24
Related Information
“Front Panel Controls and Indicators on a System With Eight 2.5-inch Drives” on
page 3
“Front Panel Controls and Indicators on a System With Four 2.5-inch Drives and
DVD Drive” on page 4
“Server Back Panel View” on page 5
“Servicing Fan Modules (CRU)” on page 50
Storage Drive Status Indicators
There are three status indicators (LEDs) on each drive.
TABLE: Server Front Storage Drive Indicators
Status Indicator Name Icon Color State Meaning
OK to Remove
Blue • STEADY ON – The storage drive can be removed safely during a
hot-plug operation.
• OFF – The storage drive has not been prepared for removal.
Service Required
Amber • OFF – The storage drive is operating normally.
• STEADY ON – The system has detected a fault with the storage drive.
OK/Activity Green • OFF – Power is off or the installed drive is not recognized by the system.
• STEADY ON – The drive is engaged and is receiving power.
• STEADY BLINK – There is disk activity. The LED blinks on and off to show activity.
Related Information
“Front Panel Controls and Indicators on a System With Eight 2.5-inch Drives” on
page 3
“Front Panel Controls and Indicators on a System With Four 2.5-inch Drives and
DVD Drive” on page 4
“Server Back Panel View” on page 5
“Servicing Storage Drives (CRU)” on page 45
10 Sun Server X4-2 Service Manual • May 2014
Page 25
Power Supply Status Indicators
There are two status indicators (LEDs) on each power supply. These indicators are visible from the rear of the server.
TABLE: Server Power Supply Indicators
Status Indicator Name Icon Color State Meaning
AC OK/DC OK
Service Required
Green • OFF – No AC power is present.
• SLOW BLINK – Normal operation. Input power is within specification. DC output voltage is not enabled.
• STEADY ON – Normal operation. Input AC power and DC output voltage are within specification.
Amber • OFF – Normal operation. No service action is
required.
• STEADY ON – The power supply (PS) has detected a PS fan failure, PS overtemperature, PS over current, or PS over or under voltage.
Related Information
“Front Panel Controls and Indicators on a System With Eight 2.5-inch Drives” on
page 3
“Front Panel Controls and Indicators on a System With Four 2.5-inch Drives and
DVD Drive” on page 4
“Server Back Panel View” on page 5
“Servicing Power Supplies (CRU)” on page 55
Network Management Port Status Indicators
The server has one 10/100BASE-T Ethernet management domain interface, labeled NET MGT. There are two status indicators (LEDs) on this port. These indicators are visible from the rear of the server.
About the Sun Server X4-2 11
Page 26
TABLE: Network Management Port Status Indicators
Status Indicator Name Location Color State and Meaning
Link speed Top left Bicolored:
Amber/Green
• Amber ON – 10BASE-T link.
• Green ON – 100BASE-T link.
• OFF – No link or link down.
• FLASHING – No function.
Activity Top right Green • ON – No function.
• OFF – No activity.
• FLASHING – Packet activity.
Related Information
“Network Management Port” on page 154
“Front Panel Controls and Indicators on a System With Eight 2.5-inch Drives” on
page 3
“Front Panel Controls and Indicators on a System With Four 2.5-inch Drives and
DVD Drive” on page 4
“Server Back Panel View” on page 5
“Servicing Power Supplies (CRU)” on page 55
Ethernet Ports Status Indicators
The server has four Ethernet ports (NET 3, NET 2, NET 1, and NET 0). There are two status indicators on each port. These indicators are visible from the rear of the server.
Note – Ethernet ports NET 2 and NET 3 are nonfunctional in single-processor
systems.
12 Sun Server X4-2 Service Manual • May 2014
Page 27
TABLE: Gigabit Ethernet Port Status Indicators
Status Indicator Name Location Color State and Meaning
Activity Top left Green • ON – No function.
• OFF– No activity.
• FLASHING – Packet activity.
Link speed Top right Bicolored:
Amber/Green
• Amber ON— 100BASE-T link.
• Green ON– 1000/10GBBASE-T link.
• OFF – No link or link down.
• FLASHING – No function.
Related Information
“Gigabit Ethernet Ports” on page 153
“Front Panel Controls and Indicators on a System With Eight 2.5-inch Drives” on
page 3
“Front Panel Controls and Indicators on a System With Four 2.5-inch Drives and
DVD Drive” on page 4
“Server Back Panel View” on page 5
“Servicing Power Supplies (CRU)” on page 55
Motherboard Status Indicators
The motherboard and modules that are installed on the motherboard contain several status indicators (LEDs), which are described in the following sections:
“DDR3 DIMM Fault Status Indicators” on page 13
“Processor Fault Status Indicators” on page 14
“Fault Remind Status Indicator” on page 14
“STBY PWRGD Status Indicator” on page 14
DDR3 DIMM Fault Status Indicators
Each of the 16 DDR3 DIMM sockets on the motherboard has an amber fault status indicator (LED) associated with it. If Oracle ILOM determines that a DIMM is faulty, pressing the Fault Remind button on the motherboard signals the service processor to light the fault LEDs associated with the faulted DIMMs.
About the Sun Server X4-2 13
Page 28
Processor Fault Status Indicators
The motherboard includes a fault status indicator (LED) adjacent to each of the two processor sockets. These LEDs indicate when a processor is faulty. For example, if on reboot the BIOS detects that there are uncorrectable processor errors recorded in the machine check architecture (MCA) registers left over from the previous boot, then the BIOS and Oracle ILOM work together to record and diagnose these errors. If it is determined that a processor is faulty, pressing the Fault Remind button on the motherboard signals the service processor to light the fault LED associated with the faulted processor.
Fault Remind Status Indicator
This status indicator (LED) is located next to the Fault Remind button and is powered from the super capacitor that powers the fault LEDs on the motherboard. This LED lights to indicate that the fault remind circuitry is working properly in cases where no components have failed and, as a result, none of the component fault LEDs illuminate.
STBY PWRGD Status Indicator
The service instructions for all server internal components require that all AC power be removed from the power supplies prior to removing the server top cover. This green status indicator (LED) is labeled STBY PWRGD and is located on the rear of the server near PCIe slot 2.
Note – If a PCIe card is installed in PCIe slot 2, this LED might not be visible.
The STBY PWRGD LED lights to inform the service technician that the motherboard is receiving standby power from at least one of the power supplies. This LED is provided to help prevent service actions on the server internal components while the AC power cords are installed and power is being supplied to the server.
Related Information
“Front Panel Controls and Indicators on a System With Eight 2.5-inch Drives” on
page 3
“Front Panel Controls and Indicators on a System With Four 2.5-inch Drives and
DVD Drive” on page 4
“Server Back Panel View” on page 5
“Illustrated Parts Breakdown” on page 15
14 Sun Server X4-2 Service Manual • May 2014
Page 29
About System Components
These sections describe the components of the server:
“Illustrated Parts Breakdown” on page 15
“Customer-Replaceable Units” on page 17
“Field-Replaceable Units” on page 18
“Battery Module” on page 19
Related Information
“Servicing CRUs That Do Not Require Server Power-Off” on page 45
“Servicing CRUs That Require Server Power-Off” on page 61
“Servicing FRUs” on page 99
Illustrated Parts Breakdown
The following figure identifies the major components of the server.
About the Sun Server X4-2 15
Page 30
FIGURE: Server Illustrated Parts Breakdown
Figure Legend
1 Server chassis for eight 2.5-inch storage
drives
2 Server chassis for four 2.5-inch storage
drives and DVD drive
3 SATA DVD drive 12 PCIe cards and the internal HBA card
16 Sun Server X4-2 Service Manual • May 2014
10 DIMMs
Note: A maximum of eight DIMMs are
supported in single-processor systems and the DIMMs must be installed in sockets associated with processor 0 (P0). No DIMM filler panels are required in vacant DIMM sockets associated with processors 0 (P0) or 1 (P1).
11 Battery
Note: PCIe slot 1 is nonfunctional in
single-processor systems.
Page 31
Figure Legend (Continued)
4 Four and eight 2.5-inch storage drive
configurations (In Oracle Engineered Systems, storage drive 7 in eight-drive systems might be populated with a remote battery module for the HBA card.)
5 Front indicator module for server with four or
eight 2.5-inch storage drives
6 Disk backplanes 15 USB flash drives
7 Fan modules 16 Processor heatsinks
8 Chassis mid-wall 17 Motherboard
9 Processors
Note: There is only one processor in
single-processor systems and that processor is located in processor socket 0 (P0).
13 To p c o ve r
14 PCIe riser
Note: In single-processor systems, neither a
heatsink nor a processor socket filler are installed in the processor socket 1 (P1). To protect the delicate processor socket pins, the cover that comes on the motherboard from manufacturing is left in place.
18 Power supplies
Customer-Replaceable Units
The following table lists the customer-replaceable units (CRUs) in the server and directs you to the replacement instructions.
CRU Description Replacement Instructions
Battery Lithium coin-cell battery that powers the
CMOS BIOS and real-time clock.
DIMMs Add or replace memory in the system. “Servicing the DIMMs (CRU)” on page 62
Storage drives Supports SAS hard-disk drives (HDDs) or
SATA solid-state drives (SSDs) and a DVD drive.
• Four-drive configuration contains four
2.5-inch storage drives and a DVD drive
• Eight-drive configuration contains eight
2.5-inch storage drives
Note - In Oracle Engineered Systems, storage drive 7 in eight-drive systems might be populated with a remote battery module for the HBA card. The battery module is not a customer-replaceable unit.
“Servicing the Battery (CRU)” on page 96
“Servicing Storage Drives (CRU)” on page 45
About the Sun Server X4-2 17
Page 32
CRU Description Replacement Instructions
DVD drive DVD drive on configurations with four
“Servicing the DVD Drive (CRU)” on page 91
2.5-inch storage drives.
Fan modules Contains four fan modules for cooling the
“Servicing Fan Modules (CRU)” on page 50
motherboard assembly and installed components.
PCIe cards Optional add-on cards that can expand the
“Servicing PCIe Cards (CRU)” on page 84
functionality of the server.
PCIe risers Houses and connects the PCIe cards. “Servicing PCIe Risers (CRU)” on page 74
Power supplies Two fully redundant AC power supplies. “Servicing Power Supplies (CRU)” on page 55
Related Information
“Field-Replaceable Units” on page 18
“Illustrated Parts Breakdown” on page 15
“Servicing CRUs That Do Not Require Server Power-Off” on page 45
“Servicing CRUs That Require Server Power-Off” on page 61
Field-Replaceable Units
The following table lists the field-replaceable units (FRUs) in the server and directs you to the replacement instructions.
FRU Description Replacement Instructions
Processor Carries out the instructions of the system. “Servicing Processors (FRU)” on page 100
Disk backplane Serves as the interface between storage
drives and the host bus adapter (HBA) card.
Front indicator module (FIM)
Motherboard assembly
Contains the front panel controls, indicators, and USB ports.
Provides connectors for the fans, DIMMs, processor(s), PCIe risers, and internal USB ports, and power supplies.
Internal HBA SAS controller
Serve to connect the disk backplane to the the internal host bus adapter (HBA) card.
cables
“Servicing the Disk Backplane (FRU)” on
page 117
“Servicing the Front Indicator Module (FRU)”
on page 125
“Servicing the Motherboard (FRU)” on
page 127
“Servicing the SAS Cables (FRUs)” on
page 138
18 Sun Server X4-2 Service Manual • May 2014
Page 33
Related Information
“Customer-Replaceable Units” on page 17
“Illustrated Parts Breakdown” on page 15
“Servicing FRUs” on page 99
Battery Module
In Oracle Engineered Systems, storage drive 7 might be populated with a remote battery module for the host bus adapter (HBA) card.
Caution – The battery module is not a customer-replaceable unit (CRU) and should
not be removed or replaced by customers. The battery module should be removed or replaced only by Oracle field service personnel.
The battery module is hot-pluggable and provides the backup power subsystem for the Sun Storage 6 Gb SAS PCIe RAID Internal Host Bus Adapter (HBA): SG-SAS6-R-INT-Z. It enables Oracle field service personnel to replace the battery at the end of its service life without requiring the server to be powered off.
Related Information
“Servicing Storage Drives (CRU)” on page 45
About the Sun Server X4-2 19
Page 34
20 Sun Server X4-2 Service Manual • May 2014
Page 35
Troubleshooting the Server
These sections introduce the diagnostic tools and strategies available to help you diagnose problems with the system.
Description Links
Review the tasks used to locate a specific problem with the system.
Understand the diagnostic system indicators, utilities, and commands available.
Gather information about the system to help a service engineer or technician work on your system.
Locate the server serial number. “Locate the Server Serial Number” on page 25
Inspect the system methodically to locate a faulty component or components.
“Service Troubleshooting Task List” on page 21
“Diagnostic Tools” on page 22
“Gather Service Information” on page 24
“Inspecting the System” on page 25
Related Information
“About the Sun Server X4-2” on page 1
“Preparing for Service” on page 29
“Returning the Server to Operation” on page 143
Service Troubleshooting Task List
Use the list in following table as a sequence for troubleshooting the server.
21
Page 36
TABLE: Troubleshooting Task List
No. Description Section or Document
1 Gather initial service information. “Gather Service Information” on page 24
2 Investigate power-on problems. “Troubleshoot Power Problems” on page 26
3 Perform external visual inspection
and internal visual inspection.
“Inspect the Server Externally” on page 26
“Inspect Internal Server Components” on
page 26
4 View service processor logs and
sensor information.
Oracle Integrated Lights Out Manager (ILOM)
3.1 Documentation Library at:
http://www.oracle.com/goto/ILOM/docs
5 Run diagnostics. Oracle x86 Servers Diagnostics, Applications, and
Utilities Guide for Servers With Oracle ILOM 3.1
at:
http://www.oracle.com/goto/x86AdminD
iag/docs
Related Information
“About the Sun Server X4-2” on page 1
“Diagnostic Tools” on page 22
Diagnostic Tools
There are a variety of diagnostic tools, commands, and indicators you can use to monitor and troubleshoot the server:
LEDs – These indicators provide a quick visual notification of the status of the
server and of some of the CRUs and FRUs.
Oracle ILOM firmware – This firmware is located on the service processor and
provides a comprehensive service portal using a command-line interface (CLI) and browser user interface (BUI) for
lights-out management capabilities (such as remote power-on, power-off, etc.)
monitoring the health of the environmental subsystem (such as power, fans,
temperature, cover interlock, etc.),
and fault management and automated diagnosis capabilities during server
initialization (such as QuickPath Interconnect code and Memory Reference code) and server runtime
22 Sun Server X4-2 Service Manual • May 2014
Page 37
Pc-Check – Accessed through Oracle ILOM, the DOS-based Pc-Check utility tests
motherboard components such as processor, memory, and I/O, as well as ports and slots. If enabled through Oracle ILOM, this utility will run each time the system powers on. For information about Pc-Check, refer to the Oracle X86 Servers Diagnostics, Applications, and Utilities Guide for Servers With Oracle ILOM 3.1 at
http://www.oracle.com/goto/x86AdminDiag/docs.
SNMP– Simple Network Management Protocol (SNMP) traps are generated by
the SNMP agents that are installed on the SNMP devices being managed by Oracle ILOM. Oracle ILOM receives the SNMP traps and converts them into SNMP event messages that appear in the event log.
POST – Power-on self-test (POST) performs diagnostics on system components
upon system power-on and resets to ensure the integrity of those components. POST messages are displayed and logged in the BIOS event logs. POST works with Oracle ILOM to take faulty components offline, if needed.
Oracle Solaris OS Diagnostic Tools
Oracle Solaris OS Predictive Self-Healing (PSH) – The PSH technology
provides automated diagnosis of error events encountered with the processor, memory subsystem, and integrated I/O subsystem during runtime. The ability of PSH to off-line faulty processors and retire memory pages during runtime enhances system availability and prevents future interruptions. The Oracle Solaris OS PSH technology in conjunction with ILOM and BIOS provide extensive fault management architecture for placing processors offline and disabling DIMMs.
Log files and console messages – These items provide the standard Oracle
Solaris OS log files and investigative commands that can be accessed and displayed on the device of your choice.
Oracle VTS software – This application exercises the system, provides
hardware validation, and discloses possible faulty components with recommendations for repair.
The LEDs, Oracle ILOM, Oracle Solaris OS PSH, and many of the log files and console messages are integrated. For example, Oracle Solaris software will display a detected fault, log it, pass information to Oracle ILOM where it will be logged, and, depending on the fault, might cause one or more LEDs to light.
Related Information
“About the Sun Server X4-2” on page 1
Oracle Solaris OS documentation set at:
Oracle Solaris 10 1/13 information library at:
http://docs.oracle.com/cd/E26505_01/index.html
Oracle Solaris 11.1 information library at:
http://docs.oracle.com/cd/E26502_01/index.html
Troubleshooting the Server 23
Page 38
Oracle Integrated Lights Out Manager (ILOM) 3.1 Documentation Library at:
http://www.oracle.com/goto/ILOM/docs
Oracle x86 Servers Diagnostics, Applications, and Utilities Guide for Servers With Oracle
ILOM 3.1 at:
http://www.oracle.com/goto/x86AdminDiag/docs
Oracle VTS documentation set at:
http://docs.oracle.com/cd/E19719-01/index.html
Gather Service Information
The first step in determining the cause of a problem with the server is to gather information for the service call paperwork or the onsite personnel. Follow these general guidelines when you begin troubleshooting.
1. Collect information about the following items:
Events that occurred prior to the failure
Whether any hardware or software was modified or installed
Whether the server was recently installed or moved
How long the server exhibited symptoms
The duration or frequency of the problem
2. Document the server settings before you make any changes.
If possible, make one change at a time in order to isolate potential problems. In this way, you can maintain a controlled environment and reduce the scope of troubleshooting.
3. Note the results of any change that you make. Include any errors or informational messages.
4. Check for potential device conflicts before you add a new device.
5. Check for version dependencies, especially with third-party software.
Related Information
“Diagnostic Tools” on page 22
“Locate the Server Serial Number” on page 25
24 Sun Server X4-2 Service Manual • May 2014
Page 39
Locate the Server Serial Number
To locate the server serial number, do one of the following:
From the front of the server, look at the left side of the front panel to locate the
server serial number. The serial number is located on the Radio-frequency identification (RFID) tag on the front panel, next to the general status indicators. For illustrations of the server front panel, see “About Controls and
Connectors” on page 2.
Refer to the service label that is attached to the top cover of the server. This
label includes the serial number.
Locate the yellow Customer Information Sheet (CIS) attached to your server
packaging. This sheet includes the serial number.
From the Oracle ILOM command-line interface (CLI), enter the show /System
command or go to the System Information Summary page in the Oracle ILOM web browser interface.
Related Information
“Diagnostic Tools” on page 22
“Gather Service Information” on page 24
Inspecting the System
Controls that have been improperly set and cables that are loose or improperly connected are common causes of problems with hardware components. Follow these procedures to locate common problems with the system.
“Troubleshoot Power Problems” on page 26
“Inspect the Server Externally” on page 26
“Inspect Internal Server Components” on page 26
Related Information
“About the Sun Server X4-2” on page 1
“Preparing for Service” on page 29
Troubleshooting the Server 25
Page 40
Troubleshoot Power Problems
1. If the server is powered off, power on the server.
See “Power On the Server” on page 150.
If the server powers on, go to “Inspect the Server Externally” on page 26.
If the server does not power on, go to Step 2.
2. Check that power cords are attached firmly to the server power supplies and to the power source.
Related Information
“Power On the Server” on page 150
“Inspect the Server Externally” on page 26
“Servicing Power Supplies (CRU)” on page 55
Inspect the Server Externally
1. Inspect the external status indicators (LEDs), which can indicate component malfunctions.
For the LED locations and descriptions of their behavior, see “About Server and
Component Status Indicators” on page 6.
2. Verify that nothing in the server environment is blocking airflow or making a contact that could short out power.
3. If the problem is not evident, continue with the next section, “Inspect Internal
Server Components” on page 26.
Related Information
“About the Sun Server X4-2” on page 1
“Inspect Internal Server Components” on page 26
Inspect Internal Server Components
1. Power down the server from main power mode to standby power mode.
Graceful power down – Notify users and gracefully power down the system.
Immediate power down – Power down the system quickly.
See “Powering Down the Server” on page 33 for instructions.
26 Sun Server X4-2 Service Manual • May 2014
Page 41
2. Disconnect the AC power cables from the server, extend the server to the maintenance position, and remove the server top cover.
See “Preparing the Server for Component Replacement” on page 33.
3. Inspect the internal status indicators (LEDs), which can indicate component malfunction.
For the LED locations and descriptions of their behavior, see “Motherboard Status
Indicators” on page 13.
To light these LEDs, press and hold down the Fault Remind button, which is located on the motherboard. For more information on the Fault Remind button, see “Using the Fault Remind Button” on page 70.
4. Verify that there are no loose or improperly seated components.
5. Verify that all cables inside the system are firmly and correctly attached to their appropriate connectors.
6. Verify that any components that you ordered separately and were not installed at the factory are qualified and supported.
Check with your customer representative for information about which PCIe cards and DIMMs are supported.
7. Check that the installed DIMMs comply with the supported DIMM population rules and configurations.
For more information, see “DIMM Population Rules” on page 67.
8. Return the server to operation.
See “Returning the Server to Operation” on page 143.
9. Press and release the Power button on the server front panel.
When the main power is applied to the server, the Power/OK indicator next to the Power button blinks slowly until the OS is ready. When the OS is ready, the Power/OK indicator remains lit. For more information about the indicator, see
“Server General Status Indicators” on page 7.
10. If the problem with the server is not evident, then log in to either the Oracle ILOM Fault Management Shell or Oracle Solaris service portal and use the fault management command fmadm faulty to list any faults that might be present on the server.
For instructions, see the Oracle Integrated Lights Out Manager (ILOM) 3.1 Documentation Library at http://www.oracle.com/goto/ILOM/docs.
Related Information
“Preparing for Service” on page 29
“About Server and Component Status Indicators” on page 6
Troubleshooting the Server 27
Page 42
“Returning the Server to Operation” on page 143
28 Sun Server X4-2 Service Manual • May 2014
Page 43
Preparing for Service
These sections describe safety considerations and provide prerequisite procedures and information to replace components within the server.
Description Links
Understand the safety precautions, understand the server FRU top-level indicator, understand the safety symbols, and take ESD precautions prior to removing or installing parts in the server.
Assemble the required tools. “Required Tools” on page 32
Before working with components within the server, power down the server and prepare for servicing.
“Safety Precautions” on page 29
“FRU TLI Auto-Update” on page 30
“Safety Symbols” on page 30
“Electrostatic Discharge Safety” on page 31
“Preparing the Server for Component Replacement” on page 33
Related Information
“Returning the Server to Operation” on page 143
Safety Precautions
For your protection, observe the following safety precautions when setting up your equipment:
Follow all standard cautions, warnings, and instructions marked on the
equipment and described in the online Sun Server X4-2 Safety and Compliance Guide and in the printed Important Safety Information for Oracle’s Hardware Systems.
Ensure that the voltage and frequency of your power source match the voltage
and frequency inscribed on the equipment’s electrical rating label.
Follow the electrostatic discharge safety practices as described in “Electrostatic
Discharge Safety” on page 31.
29
Page 44
Disconnect both power supply cords before servicing components.
Related Information
“Safety Symbols” on page 30
“Electrostatic Discharge Safety” on page 31
FRU TLI Auto-Update
Oracle ILOM includes a top-level indicator (TLI) auto-update feature that ensures that the TLI stored in the server field-replaceable units (FRUs) is always correct. The TLI, which is unique to each server, is used to track the server service entitlement and warranty coverage. When a server requires service, the server TLI is used to verify that the warranty on the server has not expired.
The TLI is stored in the FRUID (field-replaceable unit identifiers) of these components: power supply 0 (PSU 0), motherboard (MB), and disk backplane (DBP).
The TLI components stored in each component FRUID include:
Product name
PPN (product part number)
PSN (product serial number)
When a server FRU that contains the TLI is removed and a replacement module installed, the TLI of the replacement module is programmed by Oracle ILOM to contain the same TLI as the other two modules.
Safety Symbols
The following symbols might appear in this book. Note their meanings.
Caution – There is a risk of personal injury or equipment damage. To avoid
personal injury or equipment damage, follow the instructions.
30 Sun Server X4-2 Service Manual • May 2014
Page 45
Caution – The surface of the component is hot. Avoid contact. Surfaces are hot and
might cause personal injury if touched.
Caution – Hazardous voltages are present. To reduce the risk of electric shock and
danger to personal health, follow the instructions.
Related Information
“Safety Precautions” on page 29
“Electrostatic Discharge Safety” on page 31
Electrostatic Discharge Safety
Devices that are sensitive to electrostatic discharge (ESD), such as the motherboard, PCIe cards, drives, processors, and memory cards require special handling.
Caution – The boards and drives contain electronic components that are extremely
sensitive to static electricity. Ordinary amounts of static electricity from clothing or the work environment can destroy components. Do not touch the components along their connector edges.
Do the following when handling ESD-sensitive components:
Use an antistatic wrist strap.
Wear an antistatic wrist strap and use an antistatic mat when handling components such as drive assemblies, boards, or cards. When servicing or removing server components, attach an antistatic strap to your wrist and then to a metal area on the chassis. Then disconnect the power cords from the server. Following this practice equalizes the electrical potentials between you and the server.
Note – An antistatic wrist strap is not included in the Ship Kit for the server.
However, antistatic wrist straps are included with options and components.
Use an antistatic mat.
Preparing for Service 31
Page 46
Place ESD-sensitive components such as the motherboard, memory DIMMs, and other printed circuit boards on an antistatic mat. The following items can be used as an antistatic mat:
An antistatic bag used to wrap an Oracle replacement part
An Oracle ESD mat (orderable item)
A disposable ESD mat (shipped with some replacement parts or optional
system components)
Related Information
“Safety Precautions” on page 29
“Safety Symbols” on page 30
“Preparing the Server for Component Replacement” on page 33
“Returning the Server to Operation” on page 143
Required Tools
The server can be serviced with the following tools:
Antistatic wrist strap
Antistatic mat
No. 2 Phillips screwdriver
Related Information
“Preparing for Service” on page 29
“Servicing CRUs That Do Not Require Server Power-Off” on page 45
“Servicing CRUs That Require Server Power-Off” on page 61
“Servicing FRUs” on page 99
32 Sun Server X4-2 Service Manual • May 2014
Page 47
Preparing the Server for Component Replacement
Before you can remove and install components that are inside the server, you must perform the procedures in the following tasks:
Note – When replacing the storage drives or power supplies, not all of these
procedures are necessary. See the replacement tasks for those components for more information.
“Powering Down the Server” on page 33
“Disconnect Cables From the Server” on page 39
“Extend the Server to the Maintenance Position” on page 39
“Remove the Server From the Rack” on page 41
“Take Antistatic Measures” on page 42
“Remove the Server Top Cover” on page 43
Related Information
“Returning the Server to Operation” on page 143
Powering Down the Server
To determine how you want to power off the server, view the options in the following table:
Description Link
Power down the server gracefully to save all data and to prevent data from being corrupted.
Performing a graceful shutdown ensures that the system is ready for restart.
“Power Down Server Gracefully Using
the Oracle ILOM CLI” on page 34
“Power Down Server Gracefully Using
the Oracle ILOM Web Interface” on page 35
“Power Down Server Gracefully Using
the Power Button” on page 36
Preparing for Service 33
Page 48
Description Link
If the server is not responding, or you must shut down the server quickly, perform an immediate shutdown.
“Use the Power Button for Immediate
Shutdown” on page 36
“Use the Oracle ILOM CLI for
Immediate Shutdown” on page 37
“Use the Oracle ILOM Web Interface
for Immediate Shutdown” on page 38
Related Information
“Power On the Server” on page 150
Power Down Server Gracefully Using the Oracle ILOM
CLI
Performing a graceful shut down ensures that all of your data is saved and the system is ready for restart.
1. Log in to the server as superuser or equivalent.
Depending on the nature of the problem, you might want to view the system status or the log files or run diagnostics before you shut down the system. For more information, see the Oracle ILOM 3.1 Documentation Library at
http://www.oracle.com/goto/ILOM/docs
2. Notify affected users that the server will be powered down.
3. Save any open files, and quit all running applications.
Refer to your application documentation for specific information on these processes.
4. Log in to the Oracle ILOM command-line interface (CLI) using an Administrator account.
For instructions, see the Sun Server X4-2 Installation Guide, “Log In to Oracle ILOM
Remotely Using the CLI” on page 91.
5. At the Oracle ILOM prompt, shut down the operating system:
-> stop /System
If the system is running the Oracle Solaris OS, refer to the Oracle Solaris system administration documentation for additional information.
6. Disconnect the power cords and data cables from the server.
See “Disconnect Cables From the Server” on page 39.
34 Sun Server X4-2 Service Manual • May 2014
Page 49
Related Information
“Power Down Server Gracefully Using the Oracle ILOM Web Interface” on
page 35
“Power Down Server Gracefully Using the Power Button” on page 36
“Use the Power Button for Immediate Shutdown” on page 36
“Power On the Server” on page 150
Power Down Server Gracefully Using the Oracle ILOM
Web Interface
1. Log in to the server as superuser or equivalent.
Depending on the nature of the problem, you might want to view the system status or the log files or run diagnostics before you shut down the system. For more information, see the Oracle ILOM 3.1 Documentation Library at
http://www.oracle.com/goto/ILOM/docs.
2. Notify affected users that the server will be powered down.
3. Save any open files and quit all running applications.
Refer to your application documentation for specific information about these processes.
4. Log in to the Oracle ILOM web interface using an Administrator account.
For instructions, see the Sun Server X4-2 Installation Guide, “Log In to Oracle ILOM
Remotely Using the Web Interface” on page 89.
The Oracle ILOM web interface System Information > Summary page appears.
5. In the left pane, click Host Management > Power Control, and select Graceful Shutdown and Power Off from the Action list.
6. Click Save and then click OK.
The host server performs an orderly power shutdown.
7. Disconnect the power cords an data cables from the server.
See “Disconnect Cables From the Server” on page 39.
Related Information
“Power Down Server Gracefully Using the Oracle ILOM CLI” on page 34
“Power Down Server Gracefully Using the Power Button” on page 36
“Use the Power Button for Immediate Shutdown” on page 36
“Power On the Server” on page 150
Preparing for Service 35
Page 50
Power Down Server Gracefully Using the Power Button
Note – You can use the Power button on the front of the server to power down the
server gracefully.
1. Press and quickly release the Power button on the front panel.
This action causes ACPI-enabled operating systems to perform an orderly shutdown of the operating system. Servers not running ACPI-enabled operating systems shut down to standby power mode immediately.
When main power is off, the Power/OK LED on the front panel will begin flashing, indicating that the server is in standby power mode. See “Server General
Status Indicators” on page 7.
2. Disconnect the power cords and data cables from the server.
See “Disconnect Cables From the Server” on page 39.
Caution – When you press the Power button to enter standby power mode, power
is still directed to the service processor remote management subsystem and power supply fans. To completely power off the server, you must disconnect the power cords from the power supplies.
Related Information
“About Controls and Connectors” on page 2
“Power Down Server Gracefully Using the Oracle ILOM CLI” on page 34
“Power Down Server Gracefully Using the Oracle ILOM Web Interface” on
page 35
“Use the Power Button for Immediate Shutdown” on page 36
“Power On the Server” on page 150
Use the Power Button for Immediate Shutdown
Caution – You might corrupt your system data during an immediate power down,
so only use this procedure to power down the server after attempting the graceful power down procedure.
36 Sun Server X4-2 Service Manual • May 2014
Page 51
1. Press and hold the Power button for four seconds to force the main power off and to enter standby power mode.
When main power is off, the Power/OK LED on the front panel will begin flashing, indicating that the server is in standby power mode. See “Server General
Status Indicators” on page 7.
2. Disconnect the power and data cables from the server.
See “Disconnect Cables From the Server” on page 39.
Caution – When you press the Power button to enter standby power mode, power
is still directed to the service processor remote management subsystem and power supply fans. To completely power off the server, you must disconnect the power cords from the power supplies.
Related Information
“About Controls and Connectors” on page 2
“Power Down Server Gracefully Using the Oracle ILOM CLI” on page 34
“Power Down Server Gracefully Using the Oracle ILOM Web Interface” on
page 35
“Power Down Server Gracefully Using the Power Button” on page 36
“Power On the Server” on page 150
“Use the Oracle ILOM CLI for Immediate Shutdown” on page 37
“Use the Oracle ILOM Web Interface for Immediate Shutdown” on page 38
Use the Oracle ILOM CLI for Immediate Shutdown
Caution – You might corrupt your system data during an immediate power down,
so only use this procedure to power down the server after attempting the graceful power down procedure.
1. Log in to the Oracle ILOM command-line interface (CLI) using an Administrator account.
Oracle ILOM displays the default command prompt (->), indicating that you have successfully logged in to Oracle ILOM.
2. From the CLI prompt, type the following command:
-> stop -f /System
The server powers down immediately.
Preparing for Service 37
Page 52
3. Disconnect the power and data cables from the server.
See “Disconnect Cables From the Server” on page 39.
Caution – When the server is powered down, it enters standby power mode. In
standby power mode, power is still directed to the service processor remote management subsystem and power supply fans. To completely power off the server, you must disconnect the power cords from the power supplies.
Related Information
“Use the Power Button for Immediate Shutdown” on page 36
“Use the Oracle ILOM Web Interface for Immediate Shutdown” on page 38
Use the Oracle ILOM Web Interface for Immediate
Shutdown
Caution – You might corrupt your system data during an immediate power down,
so only use this procedure to power down the server after attempting the graceful power down procedure.
1. Log in to the Oracle ILOM web interface using an Administrator account.
The Oracle ILOM web interface System Information page appears.
2. In the left pane, click Host Management > Power Control, and select Immediate Power Off in the Actions list.
3. Click Save, and then click OK.
The server powers down immediately.
4. Disconnect the power and data cables from the server.
See “Disconnect Cables From the Server” on page 39.
Caution – When the server is powered down, it enters standby power mode. In
standby power mode, power is still directed to the service processor remote management subsystem and power supply fans. To completely power off the server, you must disconnect the power cords from the power supplies.
Related Information
“Use the Power Button for Immediate Shutdown” on page 36
“Power Down Server Gracefully Using the Oracle ILOM CLI” on page 34
38 Sun Server X4-2 Service Manual • May 2014
Page 53
Disconnect Cables From the Server
Caution – The system supplies standby power to the circuit boards when the power
cords are connected even when the system is powered off.
1. Label all cables connected to the server.
2. Disconnect the power cords from the rear of the server.
3. Disconnect all data cables from the rear of the server.
Related Information
“Server Back Panel View” on page 5
“Powering Down the Server” on page 33
“Remove the Server From the Rack” on page 41
“Reconnect Data Cables and Power Cords” on page 149
Extend the Server to the Maintenance Position
The following components can be serviced with the server in the maintenance position:
Storage drives
Fan modules
Power supplies
DVD drive module
PCIe risers
PCIe cards
DDR3 DIMMs
Internal USB flash drives
Motherboard battery
Processors
Disk backplane
FIM (front indicator module)
Motherboard
SAS cables
If the server is installed in a rack with extendable slide-rails, use this procedure to extend the server to the maintenance position.
Preparing for Service 39
Page 54
1. To prevent the rack from tipping forward when the server is extended, extend all rack anti-tilt devices.
Caution – To reduce the risk of personal injury, stabilize the expansion rack cabinet
and extend all anti-tilt devices before extending the server from the rack.
For instructions for stabilizing the rack, see the Sun Server X4-2 Installation Guide,
“Stabilize the Rack for Installation” on page 43.
2. Verify that no cables will be damaged or will interfere when the server is extended.
Although the cable management arm (CMA) that is supplied with the server is hinged to accommodate extending the server, you should ensure that all cables and cords are capable of extending.
3. To release the slide-rails locks, pull down on the flip-down handles, which are located on the sides of the server front panel (see FIGURE: Slide-Rail Release
Latches on page 41 [callout 1]).
Note – The slide-rail locks are located behind the flip-down handles on the front of
the server. The slide-rail locks are released when the flip-down handles are pulled down.
The server is now in the extended maintenance position.
40 Sun Server X4-2 Service Manual • May 2014
Page 55
FIGURE: Slide-Rail Release Latches
Figure Legend
1 Slide-rail lock
2 Slide-rail release tab
4. Slowly pull the server out of the rack until the slide-rails latch.
Related Information
“Disconnect Cables From the Server” on page 39
“Remove the Server From the Rack” on page 41
“Reinstall the Server Chassis Into the Rack” on page 146
Remove the Server From the Rack
Caution – The server weighs approximately 18.0 kg (40.0 lbs). Two people are
required to dismount and carry the chassis.
1. Disconnect all the cables and power cords from the server.
2. Remove the cable management arm (CMA).
For instructions for removing the second-generation CMA, see the Sun Server
X4-2 Installation Guide, “Remove the Second-Generation Cable Management Arm” on page 64.
Preparing for Service 41
Page 56
For instructions for removing the first-generation CMA, see the Sun Server X4-2
Installation Guide, “Install the First-Generation Cable Management Arm” on page 69 and perform the installation steps in reverse order.
3. Extend the server to the maintenance position.
For instructions, see “Extend the Server to the Maintenance Position” on page 39.
4. From the front of the server, pull the green slide-rail release tabs toward the front of the server, and pull the server out of the rack until it is free of the rack rails.
A slide-rail release tab is located on each slide-rail (see FIGURE: Slide-Rail Release
Latches on page 41 [callout 2]).
Note – To pull the green slide-rail release tab, place your finger in the center of the
tab, not on the end, and apply pressure as you pull the tab toward the front of the server.
5. Set the server on a sturdy work surface.
Related Information
“Disconnect Cables From the Server” on page 39
“Take Antistatic Measures” on page 42
“Extend the Server to the Maintenance Position” on page 39
“Reinstall the Server Chassis Into the Rack” on page 146
Take Antistatic Measures
1. Prepare an antistatic surface on which to set parts during removal and installation.
Place electrostatic discharge (ESD)-sensitive components such as the printed circuit boards on an antistatic mat. The following items can be used as an antistatic mat:
An antistatic bag used to wrap a replacement part
An Oracle ESD mat (orderable item)
A disposable ESD mat (shipped with some replacement parts or optional
system components)
2. Attach an antistatic wrist strap.
When servicing or removing server components, attach an antistatic strap to your wrist and then to a metal area on the chassis. Then disconnect the power cords from the server.
42 Sun Server X4-2 Service Manual • May 2014
Page 57
Note – An antistatic wrist strap is not included in the Ship Kit for the server.
However, antistatic wrist straps are still included with options and components.
Related Information
“Remove Antistatic Measures” on page 146
Open the Server Fan Door
Servicing the server fan modules and other components located in the front of the server, such as the front indication module, the DVD drive, and disk backplanes, requires that the fan door be opened. It is also easier to remove the server top cover if you open the fan door first.
To open the server fan door, slide the fan door latches to the front of the server
and swing the door up to the open position.
FIGURE: Opening the Server Fan Door
Related Information
“Take Antistatic Measures” on page 42
“Remove the Server Top Cover” on page 43
Remove the Server Top Cover
Servicing most of the server components requires that the top cover be removed.
Preparing for Service 43
Page 58
Caution – If the top cover is removed without first removing the AC power cords,
the server host shuts down immediately and an event is logged to indicate that the chassis top cover has been removed.
1. Ensure that the AC power cords are disconnected from the server power supplies.
2. To open the server fan door, slide the fan door latches toward the front of the server and swing the door up to the open position.
3. To open the server top cover, press and hold down the top cover release buttons and use the recessed areas to slide the top cover toward the rear of the server about 0.5 inches (12.7 mm) [1].
FIGURE: Removing the Server Top Cover
4. Lift the cover off the chassis and set it aside [2].
Related Information
“Take Antistatic Measures” on page 42
“Install the Server Top Cover” on page 145
44 Sun Server X4-2 Service Manual • May 2014
Page 59
Servicing CRUs That Do Not Require Server Power-Off
The following sections describe how to service customer-replaceable units (CRUs) that do not require you to power off the server.
Description Links
Service the storage drives. “Servicing Storage Drives (CRU)” on page 45
Service the fan modules. “Servicing Fan Modules (CRU)” on page 50
Service the power supplies. “Servicing Power Supplies (CRU)” on page 55
Related Information
“Servicing CRUs That Require Server Power-Off” on page 61
“Servicing FRUs” on page 99
Servicing Storage Drives (CRU)
These sections describe how to remove and install storage drives:
In addition to the service procedures included here, you can see animations that show how to remove and install the storage drives.
To play the Sun Server X4-2 Service Animations go to:
http://docs.oracle.com/cd/E36975_01/html/E49470/index.html
See the following procedures:
45
Page 60
“Storage Drives Hot-Plug Conditions” on page 46
“HDD or SSD Failure and RAID” on page 46
“Storage Drive Status Indicators” on page 47
“Remove a Storage Drive” on page 47
“Install a Storage Drive” on page 49
Related Information
“Removing and Installing Server Filler Panels” on page 143
Storage Drives Hot-Plug Conditions
The solid-state drives (SSDs) or hard-disk drives (HDDs) in the server are hot-pluggable, but this capability depends on how the drives are configured. To hot-plug a drive you must be able to take the drive offline before you can remove it. When you take the drive offline, you prevent any application from accessing the drive and remove the logical software links to the drive.
The following situations inhibit the ability to perform hot-plugging of a drive:
The drive provides the operating system, and the operating system is not mirrored
on another drive.
The drive cannot be logically isolated from the online operations of the server.
If either of the above drive restrictions is true, then you must shut down the system before you replace the drive. See “Powering Down the Server” on page 33.
Note – Replacing a drive does not require removing the server from a rack.
Related Information
“Remove a Storage Drive” on page 47
“Install a Storage Drive” on page 49
HDD or SSD Failure and RAID
A single storage drive failure does not cause a data failure if the storage drives are configured as a mirrored RAID 1 volume (optional). The storage drive can be removed, and when a new storage drive is inserted, the contents are automatically
46 Sun Server X4-2 Service Manual • May 2014
Page 61
rebuilt from the rest of the array with no need to reconfigure the RAID parameters. If the replaced storage drive was configured as a hot-spare, the new storage drive is automatically configured as a new hot-spare.
For instructions for configuring RAID on the server, refer to the Sun Server X4-2
Installation Guide, “Configuring Server Drives for OS Installation” on page 113.
Storage Drive Status Indicators
The following figure and table describe the storage drive status indicators (LEDs).
FIGURE: Storage Drive Status Indicators
TABLE: Storage Drive Status Indicators
Legend LED Color State Meaning
1 OK to Remove Blue • STEADY ON – The storage drive can be removed safely
during a hot-plug operation.
• OFF – The storage drive has not been prepared for removal.
2 Service Required Amber • OFF – The storage drive is operating normally.
• STEADY ON – The system has detected a fault with the storage drive.
3 OK/Activity Green • OFF – Power is off or the installed drive is not recognized by
the system.
• STEADY ON – The drive is engaged and is receiving power.
• STEADY BLINK – There is disk activity. The LED blinks on and off to show activity.
Remove a Storage Drive
1. Prepare the system for the drive removal.
See “Take Antistatic Measures” on page 42.
Servicing CRUs That Do Not Require Server Power-Off 47
Page 62
2. Identify the location of the drive that you want to remove.
The following illustrations show the location of the drives and the internal system software designations for the drives.
FIGURE: Drive Location and Numbering on a Server With Four 2.5-inch Drives and a
DVD Drive
DVD
HDD0 HDD1 HDD2 HDD3
FIGURE: Drive Location and Numbering on a Server With Eight 2.5-inch Drives
HDD4 HDD5 HDD6 HDD7
HDD0 HDD1 HDD2 HDD3
Caution – In Oracle Engineered Systems, storage drive 7 (HDD7) on an eight-drive
system might be populated with a remote battery module for the host bus adapter (HBA). The battery module is not a customer-replaceable unit (CRU) and should not be removed or replaced by customers. The battery module should be removed or replaced only by Oracle field service personnel. For more information, refer to
“Battery Module” on page 19.
3. Type the operating system commands required to stop the server from using the drive.
Note – The blue OK to Remove LED on the storage drive might not light, as support
for this varies depending on the operating system in use.
The exact commands required depend on the configuration of your drives. Unmount file systems or issue RAID commands as needed.
48 Sun Server X4-2 Service Manual • May 2014
Page 63
4. On the drive you plan to remove, push the latch release button to open the drive latch.
FIGURE: Locating the Hard Disk Drive Release Button and Latch
Caution – The latch is not an ejector. Do not open the latch too far to the right.
Doing so can damage the latch.
5. Grasp the latch and pull the drive out of the drive slot.
6. Consider your next steps:
If you are replacing the drive, continue to “Install a Storage Drive” on page 49.
If you are not replacing the drive, install a filler panel in the empty drive slot to
maintain proper airflow and perform administrative tasks to configure the server to operate without the drive.
Related Information
“Storage Drive Status Indicators” on page 47
“Storage Drives Hot-Plug Conditions” on page 46
“HDD or SSD Failure and RAID” on page 46
“Install a Storage Drive” on page 49
Install a Storage Drive
1. Remove the replacement drive from its packaging and place the drive on an antistatic mat.
Servicing CRUs That Do Not Require Server Power-Off 49
Page 64
2. If necessary, remove the drive filler panel.
3. Align the replacement drive with the drive slot.
The drive is physically addressed according to the slot in which it is installed. It is important to install a replacement drive in the same slot as the drive that was removed.
4. Slide the drive into the slot until the drive is fully seated [1].
5. Close the drive latch to lock the drive in place [2].
6. Perform administrative tasks to reconfigure the drive.
The procedures that you perform at this point depend on how your data is configured. You might need to partition the drive, create file systems, load data from backups, or have the drive updated from a RAID configuration.
Related Information
“Storage Drive Status Indicators” on page 47
“Storage Drives Hot-Plug Conditions” on page 46
“HDD or SSD Failure and RAID” on page 46
“Remove a Storage Drive” on page 47
Servicing Fan Modules (CRU)
The fan modules are located in the front of the server.
50 Sun Server X4-2 Service Manual • May 2014
Page 65
In addition to the service procedures included here, you can see animations that show how to remove and install the fans.
To play the Sun Server X4-2 Service Animations go to:
http://docs.oracle.com/cd/E36975_01/html/E49470/index.html
See the following procedures:
“Remove a Fan Module” on page 51
“Install a Fan Module” on page 54
Related Information
“Open the Server Fan Door” on page 43
Remove a Fan Module
You do not have to power off the server to service the fan modules.
You should not begin this procedure unless you have a replacement fan and you are ready to install it right away.
1. Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 39.
2. To access the fan modules, open the server fan door.
See “Open the Server Fan Door” on page 43.
Caution – Close the server fan door within 60 seconds to maintain adequate airflow
to properly cool the server. Leaving the door open for more than 60 seconds, while the server is running, might cause the server to overheat and shut down.
3. Identify the faulty fan module.
Each fan module has a status indicator (LED). If the LED is lit amber, the fan has failed. The LEDs are located on the chassis mid-wall adjacent to and aligned with the fan modules, as shown in the following figure:
Servicing CRUs That Do Not Require Server Power-Off 51
Page 66
FIGURE: Server Fan Status Indicator
Status Indicator Name Color State Meaning
Fan Status Bicolored:
Amber/Green
4. Remove the faulty fan module.
a. Using your thumb and forefinger, grasp the fan module in the finger recesses
located in the plastic between the fans.
b. Lift the fan module straight up and out of the chassis.
52 Sun Server X4-2 Service Manual • May 2014
• Amber – The fan module is faulty. The front TOP FAN LED and the front and rear panel Service Required LEDs are also lit if the system detects a fan module fault.
• Green – The fan module is correctly installed and operating within specification.
Page 67
Caution – When removing a fan module, do not rock it back and forth. Rocking the
fans module can cause damage to the motherboard connectors.
FIGURE: Removing a Fan Module
5. Set the fan module aside.
Caution – Do not service any other components in the fan compartment unless the
system is shut down and the power cords are removed.
6. Consider your next step:
If you removed the fan assembly as part of another procedure, return to that
procedure.
Otherwise, continue to “Install a Fan Module” on page 54.
Related Information
“Install a Fan Module” on page 54
Servicing CRUs That Do Not Require Server Power-Off 53
Page 68
Install a Fan Module
1. Remove the replacement fan module from its packaging and place it on an antistatic mat.
2. With the fan door open, position the replacement fan module into the server.
The fan modules are keyed to ensure that they are installed in the correct orientation.
FIGURE: Installing a Fan Module
3. Press down on the fan module and apply firm pressure to fully seat the fan module.
4. Verify that the fan status LED that aligns with the replaced fan module is lit green.
5. Close the fan door.
6. Verify that the Top Fan Fault LED on the front of the server and the Service Required LEDs on the front and rear of the server are extinguished.
See “About Server and Component Status Indicators” on page 6 for more information about identifying and interpreting system indicators.
7. Consider your next step:
If you installed the fan module as part of another procedure, return to that
procedure.
54 Sun Server X4-2 Service Manual • May 2014
Page 69
Otherwise, return the server to operation.
See “Returning the Server to Operation” on page 143.
Related Information
“Remove a Fan Module” on page 51
Servicing Power Supplies (CRU)
The redundant power supplies on the server support concurrent maintenance, which enables you to remove and replace a power supply without shutting down the server, provided that the other power supply is online and working.
The server supports model A256 (600 watt) power supplies. The A256 power supply unit (PSU) provides conversion from the AC lines to the system, accepting ranges from 100-240 volts AC (VAC). These PSUs are designed to be hot-swappable, and in most cases will provide fully redundant ”1+1” power, allowing the system to suffer the loss of a PSU or an AC feed with no loss to system availability.
In maximally configured systems, it is possible that the worst-case power consumption of the system could exceed the capacity of a single PSU. The PSUs provide an over-subscription mode, which allows the system to operate with fault-tolerance even with modest excursions beyond the rated capacity of a single PSU. This over-subscription support is accomplished using hardware signaling between the PSU and motherboard circuitry, which can force the system to throttle maximally CPU and memory power in the event that a PSU is lost. The resulting power savings will be enough to allow the system to continue to run (in a lower-performance state) until the power problem is resolved.
The following indicators are lit when a power supply fault is detected:
Front and rear Service Required LEDs
Amber Service Required LED on the faulty power supply
Rear Power Supply Fault LED located on the server front panel (see “Front Panel
Controls and Indicators on a System With Eight 2.5-inch Drives” on page 3)
If a power supply fails and you do not have a replacement available, leave the failed power supply installed to ensure proper airflow in the server.
In addition to the service procedures included here, you can see animations that show how to remove and install the power supplies.
Servicing CRUs That Do Not Require Server Power-Off 55
Page 70
To play the Sun Server X4-2 Service Animations go to:
http://docs.oracle.com/cd/E36975_01/html/E49470/index.html
See the following sections for more information:
“Power Supply Status Indicators” on page 56
“Remove a Power Supply” on page 57
“Install a Power Supply” on page 59
Related Information
“About Server and Component Status Indicators” on page 6
“Powering Down the Server” on page 33
Power Supply Status Indicators
Each power supply contains two status indicators (LEDs) on the back panel. The following figure and table provide a description of the power supply status indicators.
FIGURE: Power Supply Status Indicators
56 Sun Server X4-2 Service Manual • May 2014
Page 71
TABLE: Server Power Supply Indicators
Status Indicator
Legend
Name Icon Color State Meaning
1 AC OK/DC
OK
2 Service
Required
Green • OFF – No AC power is present.
• SLOW BLINK – Normal operating. Input power is within specification. DC output voltage is not enabled.
• STEADY ON – Normal operation. Input AC power and DC output voltage are within specification.
Amber • OFF – Normal operation; no service required.
• STEADY ON – The power supply (PS) has detected a PS fan failure, PS overtemperature, PS over current, or PS over/under voltage.
Remove a Power Supply
1. Identify which power supply requires replacement.
A lit amber Service Required LED on a power supply indicates that a failure was detected. You can also use the Oracle ILOM show faulty command at the Oracle ILOM prompt (->) to identify a power supply failure.
Alternatively, to list all known faults in the server, log into the Oracle Solaris OS and issue the fmadm faulty command, or log into the Oracle ILOM service processor from the Oracle ILOM Fault Management Shell and issue the fmadm faulty command.
Note – The fans of a failed power supply might still be spinning when the system is
powered on. You can remove a power supply while the fans are still spinning.
2. Gain access to the rear of the server where the faulty power supply is located.
3. If the cable management arm (CMA) is installed, disconnect both CMA left-side connectors and move the CMA out of the way.
For instructions for disconnecting the CMA left-side connectors, see the Sun Server
X4-2 Installation Guide, “Remove the Second-Generation Cable Management Arm” on page 64 or Sun Server X4-2 Installation Guide, “Install the First-Generation Cable Management Arm” on page 69 as applicable.
Servicing CRUs That Do Not Require Server Power-Off 57
Page 72
Caution – When disconnecting the CMA left-side connectors, be sure to use your
arm to support the CMA so that it does not hang down under its own weight and stress the right-side connectors; otherwise, the CMA might be damaged. You must continue to support the CMA until you have reconnected both of the left-side connectors.
4. Disconnect the power cord from the faulty power supply.
5. Grasp the power supply handle and push the power supply latch to the left [1].
FIGURE: Removing a Power Supply
Figure Legend
1 Power Supply 0 (PS0)
2 Power Supply 1 (PS1)
6. Pull the power supply out of the chassis [2].
Caution – Whenever you remove a power supply, you should replace it with
another power supply; otherwise, the server might overheat due to improper airflow.
7. Continue to “Install a Power Supply” on page 59.
Related Information
“Server Back Panel View” on page 5
“Power Supply Status Indicators” on page 56
“Install a Power Supply” on page 59
58 Sun Server X4-2 Service Manual • May 2014
Page 73
Install a Power Supply
Caution – Always replace a faulty power supply with a power supply of the same
type.
1. Remove the replacement power supply from its packaging and place it on an antistatic mat.
2. Align the replacement power supply with the empty power supply slot [1].
3. Slide the power supply into the bay until it is fully seated [2].
You will hear an audible click when the power supply fully seats.
4. Reconnect the power cord to the power supply.
5. Verify that the amber LED on the replaced power supply and the Service Required LEDs are not lit on the front and rear panels.
Note – After you have replaced Power Supply 0, you must reset the Oracle ILOM
service processor (SP) to propagate the FRU top-level indicator (TLI) data to the new power supply. For instructions on resetting the SP, see the Oracle Integrated Lights Out Manager (ILOM) 3.1 Configuration and Maintenance Guide at
http://www.oracle.com/goto/ILOM/docs. Power Supply 1 does not contain
FRU TLI data, and therefore does not require an SP reset after replacement.
6. If you disconnected the two CMA left-side connectors, reconnect the connectors.
For instructions for reconnecting the CMA left-side connectors, see Sun Server
X4-2 Installation Guide, “Install the Second-Generation Cable Management Arm” on page 52 or Sun Server X4-2 Installation Guide, “Install the First-Generation Cable Management Arm” on page 69 as applicable.
Servicing CRUs That Do Not Require Server Power-Off 59
Page 74
Related Information
“Server Back Panel View” on page 5
“Power Supply Status Indicators” on page 56
“Remove a Power Supply” on page 57
60 Sun Server X4-2 Service Manual • May 2014
Page 75
Servicing CRUs That Require Server Power-Off
These sections describe how to service customer-replaceable units (CRUs) that require you to power off the server.
Description Use
Learn about the locations of the CRUs. “CRU Locations” on page 61
Service the DIMMs. “Servicing the DIMMs (CRU)” on page 62
Service the PCIe Risers. “Servicing PCIe Risers (CRU)” on page 74
Service the PCIe cards. “Servicing PCIe Cards (CRU)” on page 84
Service the DVD drive. “Servicing the DVD Drive (CRU)” on page 91
Service the internal USB flash drives. “Servicing the Internal USB Flash Drives
(CRU)” on page 94
Service the battery. “Servicing the Battery (CRU)” on page 96
Related Information
“Servicing CRUs That Do Not Require Server Power-Off” on page 45
“Servicing FRUs” on page 99
CRU Locations
The CRU locations are shown in the following figure.
61
Page 76
FIGURE: CRU Locations
Figure Legend
1 DVD drive (CRU) 5 Internal HBA card (CRU) (installs in slot 4 on
2 DIMMS (CRUs)
Note: A maximum of eight DIMMs are
supported in single-processor systems and the DIMMs must be installed in DIMM sockets associated with the P0 processor socket.
3 Battery (CRU) 7 PCIe riser for slot 1 (CRU) (nonfunctional in
4 PCIe riser for slots 3 and 4 (CRU) 8 USB flash drives (CRU)
Servicing the DIMMs (CRU)
This section describes how to remove and install DIMMs.
62 Sun Server X4-2 Service Manual • May 2014
PCIe riser for slots 3 and 4)
6 PCIe card (CRU) (installs in slot 3 on PCIe
riser for slots 3 and 4)
single-processor systems)
Page 77
In addition to the service procedures included here, you can see animations that show how to remove and install DIMMs.
To play the Sun Server X4-2 Service Animations go to:
http://docs.oracle.com/cd/E36975_01/html/E49470/index.html
The Sun Server X4-2 supports a variety of DDR3 DIMM configurations that can include quad-rank (QR), dual-rank (DR), and single-rank (SR) DDR3 DIMMs.
Note – While single-rank DIMMs can be installed in the Sun Server X4-2, Oracle
does not offer any single-rank DIMMs for purchase, either as factory-installed options or as separately orderable options.
Caution – These procedures require that you handle components that are sensitive
to electrostatic discharge. This sensitivity can cause the components to fail. To avoid damage, ensure that you follow antistatic practices as described in “Take Antistatic
Measures” on page 42.
Caution – Ensure that all power is removed from the server before removing or
installing DDR3 DIMMs, or damage to the DDR3 DIMMs might occur. You must disconnect all power cables from the system before performing these procedures.
The following sections provide information to assist you when replacing or upgrading a DIMM:
“DIMM and Processor Physical Layout” on page 64
“DIMM Population Example for Optimal System Performance” on page 65
“DIMM Population Rules” on page 67
“DIMM Rank Classification Labels” on page 69
“Inconsistencies Between DIMM Fault Indicators and the BIOS Isolation of Faulty
DIMMs” on page 70
“Using the Fault Remind Button” on page 70
“Identify and Remove a Faulty DIMM” on page 71
“Install a DIMM” on page 73
Servicing CRUs That Require Server Power-Off 63
Page 78
Related Information
“Servicing CRUs That Require Server Power-Off” on page 61
“Servicing FRUs” on page 99
DIMM and Processor Physical Layout
The physical layout of the DIMMs and processor(s) is shown in the following figure. When viewing the server from the front, processor 0 (P0) is on the left.
FIGURE: DIMM and Processor Physical Layout
Note – In single-processor systems, the DIMM sockets associated with the processor
1 (P1) socket are nonfunctional and should not be populated with DIMMs or DIMM filler panels.
64 Sun Server X4-2 Service Manual • May 2014
Page 79
Related Information
“DIMM Population Example for Optimal System Performance” on page 65
“DIMM Population Rules” on page 67
“DIMM Rank Classification Labels” on page 69
“Inconsistencies Between DIMM Fault Indicators and the BIOS Isolation of Faulty
DIMMs” on page 70
DIMM Population Example for Optimal System Performance
This section provides an example of how to populate the DIMM sockets to achieve optimal system performance.
Note – Not all possible configurations are shown here.
The following figure shows the order in which the DIMM sockets should be populated in dual-processor systems. In single-processor systems, you should follow the same order except that you should not install DIMMs in the DIMM sockets associated with processor 1 (P1) socket, because the P1 DIMM sockets are nonfunctional.
FIGURE: DIMM Population Order
For more information on populating the DIMMs, see the following sections:
Servicing CRUs That Require Server Power-Off 65
Page 80
“DIMM Population Order for Single-Processor Systems” on page 66
“DIMM Population Order for Dual-Processor Systems” on page 66
DIMM Population Order for Single-Processor Systems
In single-processor systems, DIMMs should only be installed into DIMM sockets associated with processor 0 (P0) starting with P0 D0, first filling the blue sockets and then the white. This DIMM population order is illustrated in FIGURE: DIMM
Population Order on page 65.
The following table describes the order in which the DIMM sockets should be populated in single-processor systems. The figure callouts in column two of the table refer to the callouts in FIGURE: DIMM Population Order on page 65.
TABLE: DIMM Population Order for Single-Processor Systems
Population Order Figure Callouts DIMMs Sockets for Processor 0 (P0)
Fill the blue sockets first
First 1 D0
Second 3 D2
Third 5 D5
Fourth 7 D7
Then fill the white sockets
Fifth 9 D1
Sixth 11 D3
Seventh 13 D4
Eighth 15 D6
DIMM Population Order for Dual-Processor Systems
In dual-processor systems, DIMMs should be installed into DIMM sockets starting with P0 D0 and then alternating between sockets associated with processor 0 (P0) and matching sockets for processor 1 (P1), first filling the blue sockets and then the white. This DIMM population order is illustrated in FIGURE: DIMM Population
Order on page 65.
The following table describes the order in which the DIMM sockets should be populated in dual-processor systems. The figure callouts in columns two and four of the table refer to callouts in FIGURE: DIMM Population Order on page 65.
66 Sun Server X4-2 Service Manual • May 2014
Page 81
TABLE: DIMM Population Order for Dual-Processor Systems
Population Order Figure Callouts
DIMM Sockets for Processor 0 (P0) Figure Callouts
DIMM Sockets for Processor 1 (P1)
Fill the blue sockets first
First 1 Fill D0 first 2 Then D0
Second 3 Fill D2 first 4 Then D2
Third 5 Fill D5 first 6 Then D5
Fourth 7 Fill D7 first 8 Then D7
Then fill the white sockets
Fifth 9 Fill D1 first 10 Then D1
Sixth 11 Fill D3 first 12 Then D3
Seventh 13 Fill D4 first 14 Then D4
Eighth 15 Fill D6 first 16 Then D6
DIMM Population Rules
The DIMM population rules for the server are as follows:
In dual-processor systems, all DIMM sockets can have either a DIMM or DIMM
filler panel installed.
Note – In dual-processor systems, DIMM filler panels are not required in sockets
that do not have DIMMs installed.
In single-processor systems, all DIMM sockets associated with processor 0 (P0)
socket can have either a DIMM or DIMM filler panel installed. DIMM sockets associated with processor 1 (P1) socket must remain empty, that is, neither DIMMs nor DIMM filler panel should be installed.
Note – In single-processor systems, DIMM filler panels are not required in sockets
that do not have DIMMs installed.
A single DIMM configuration is supported.
Do not populate DIMM sockets next to an empty processor socket. Each processor
contains a separate memory controller.
Servicing CRUs That Require Server Power-Off 67
Page 82
Each processor has four channels and can support two DIMMs per channel for a
maximum of eight quad-rank (QR) Load Reduced DIMMs (LRDIMMs), dual-rank (DR) DIMMs, or single-rank (SR) DIMMs.
The server supports 32-GB QR LRDIMMs in addition to 8-GB and 16-GB
Registered DIMMs (RDIMMs).
You cannot mix LRDIMMs and RDIMMs in the same server. If you choose to
install 32-GB QR LRDIMMs in the server, you cannot install any 8-GB or 16-GB single-rank or dual-rank RDIMMs.
If a you are installing a mix of RDIMM sizes (for example, 8-GB and 16-GB), install
the larger RDIMMs first, then the smaller RDIMMs, until all of the RDIMMs are installed.
If you are installing QR LRDIMMs within a channel, populate the blue sockets
prior to populating the white sockets.
Note – While the processor used in the server supports four modes of
operation—Independent Channel Mode, Lockstep Channel Mode, Mirrored Channel Mode, and, Device Tagging Mode—the server supports the Independent Channel Mode only.
Note – Each processor (P0, P1) has eight associated DIMM sockets, numbered D0,
D1, D2, D3, D4, D5, D6, and D7.
DIMMs should be installed into DIMM sockets starting with P0 D0 and
alternating between sockets associate with P0 and matching sockets for P1, first filling the blue sockets and then the white. For an example of a configuration that follows this rule, see “DIMM Population Example for Optimal System
Performance” on page 65.
For maximum performance, apply the following rules:
The best performance is ensured by preserving symmetry. For example, add
four of the same kind of DIMMs, one per memory channel, and if the server has two processors, ensure that both processors have the same size of DIMMs installed in the same manner.
For optimal performance, the memory installation should be identical across
both sockets. Populate each socket with QR, DR, or SR DIMMs in sets of four, one per memory channel.
All memory installed in the system will operate at the same speed. DIMM
operational speed, or frequency, is limited to the lowest speed determined by each of the following factors:
Maximum memory frequency supported by processor
Maximum memory frequency supported by the installed DIMMs
Memory configuration within a channel
68 Sun Server X4-2 Service Manual • May 2014
Page 83
The table below lists the memory speed limitations associated with all the possible combinations of DIMMs within an individual memory channel.
Blue Socket White Socket Speed
Quad-rank LRDIMM Empty 1600 MT/s
Dual-rank DIMM Empty 1600 MT/s
Single-rank DIMM Empty 1600 MT/s
Quad-rank LRDIMM Quad-rank LRDIMM 1600 MT/s
Dual-rank DIMM Dual-rank DIMM 1600 MT/s
Dual-rank DIMM Single-rank DIMM 1600 MT/s
Single-rank DIMM Single-rank DIMM 1600 MT/s
DIMM Rank Classification Labels
DIMMs come in a variety of ranks: single, dual, or quad. Each DIMM is shipped with a label identifying its rank classification. The following table identifies the label corresponding to each DIMM rank classification:
TABLE: DIMM Rank Classifications Labels
Rank Classification Label
Quad-rank LRDIMM 4Rx4
Dual-rank DIMM 2Rx4
Single-rank DIMM 1Rx4
Note – While single-rank DIMMs can be installed in the Sun Server X4-2, Oracle
does not offer any single-rank DIMMs for purchase, either as factory-installed options or as separately orderable options.
Servicing CRUs That Require Server Power-Off 69
Page 84
Inconsistencies Between DIMM Fault Indicators and the BIOS Isolation of Faulty DIMMs
When a single DIMM is marked as faulty by Oracle ILOM (for example, fault.memory.intel.dimm.training-failed is listed in the SP Event Log), BIOS might disable the entire memory channel that contains the faulty DIMM as failing, that is, up to two DIMMs. As a result, the memory available to the operating system is reduced. However, when the Fault Remind button is pressed, only the fault status indicator (LED) associated with the faulty DIMM lights. The fault LED for the other DIMM in the memory channel remains off. Therefore, you can correctly identify the faulty DIMM.
When the faulty DIMM is replaced, the memory available to the operating system returns to normal. If the DIMM fault message in Oracle ILOM is not cleared under Open Problems, you must manually clear the fault in Oracle ILOM. For instructions for manually clearing a DIMM fault, see the procedure "Clear Faults for Undetected Replaced or Repaired Hardware Components" in the Oracle Integrated Lights Out Manager (ILOM) 3.1 User’s Guide in the Oracle Integrated Lights Out Manager (ILOM)
3.1 Documentation Library at http://www.oracle.com/goto/ILOM/docs.
Using the Fault Remind Button
When the Fault Remind button is pressed, an LED located next to the Fault Remind button lights green to indicate that there is sufficient voltage present in the fault remind circuit to light any fault LEDs that were lit due to a failure. If this LED fails to light when you press the Fault Remind button, it is likely that the capacitor powering the fault remind circuit has lost its charge. This can happen if the Fault Remind button is pressed for a long time with fault LEDs lit or if power has been removed from the server for more than 15 minutes.
The following figure shows the location of the Fault Remind button.
70 Sun Server X4-2 Service Manual • May 2014
Page 85
FIGURE: Fault Remind Button Location
Identify and Remove a Faulty DIMM
1. Prepare the server for service.
a. Power off the server, and disconnect the power cords from the power
supplies.
See “Powering Down the Server” on page 33
b. Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 39.
c. Attach an antistatic wrist strap to your wrist and then to a metal area on the
chassis.
See “Take Antistatic Measures” on page 42.
d. Remove the server top cover.
See “Remove the Server Top Cover” on page 43.
Servicing CRUs That Require Server Power-Off 71
Page 86
2. Identify and note the location of the faulty DDR3 DIMM by pressing the Fault
Remind button on the motherboard (see “Using the Fault Remind Button” on
page 70).
Faulty DDR3 DIMMs are identified with a corresponding amber LED on the motherboard.
If the DIMM fault LED is off, the DIMM is operating properly.
If the DIMM fault LED is on (amber), then the DIMM is faulty and should be
replaced.
3. To remove the faulty DIMM do the following:
a. Rotate both DIMM socket ejectors outward as far as they will go.
The DIMM is partially ejected from the socket (See FIGURE: DIMM Socket
Release and Alignment on page 73).
b. Carefully lift the DIMM straight up to remove it from the socket.
4. Replace each faulty DIMM with either another DIMM of the same rank size (quad rank, dual rank, or single rank) or leave the socket empty.
Note – DIMM filler panels are optional and not required.
For DIMM replacement instructions, see “Install a DIMM” on page 73.
72 Sun Server X4-2 Service Manual • May 2014
Page 87
FIGURE: DIMM Socket Release and Alignment
Figure Legend
1 DIMM connector socket
2 DIMM connector key
3 DIMM ejector lever
Install a DIMM
1. Unpack the replacement DDR3 DIMM and place it on an antistatic mat.
2. Ensure that the replacement DDR3 DIMM matches the size of the DIMM it is replacing.
You must not replace a dual-rank DIMM with a quad-rank DIMM and vice versa. If you violate this rule, the performance of the server might be adversely affected. For DIMM socket population rules, see “DIMM Population Rules” on page 67.
3. Install a DIMM.
a. Ensure that the ejector tabs are in the open position.
b. Align the notch in the replacement DIMM with the connector key in the
connector socket.
The notch ensures that the DIMM is oriented correctly.
Servicing CRUs That Require Server Power-Off 73
Page 88
c. Push the DDR3 DIMM into the connector until the ejector tabs lock the
DIMM in place.
If the DIMM does not easily seat into the connector socket, verify that the notch in the DIMM is aligned with the connector key in the connector socket. If the notch is not aligned, damage to the DIMM might occur.
4. Repeat Step 3 until all replacement DDR3 DIMMs are installed.
5. Return the server to operation.
a. Install the server top cover.
See “Install the Server Top Cover” on page 145.
b. Close the fan door.
c. Return the server to the normal rack position.
See “Return the Server to the Normal Rack Position” on page 148.
d. Reconnect the power cords to the server power supplies and reconnect the
data cables.
See “Reconnect Data Cables and Power Cords” on page 149.
e. Power on the server.
See “Power On the Server” on page 150.
Verify that the Power/OK status indicator is steady on.
6. (Optional) Use Oracle ILOM to clear server DDR3 DIMM faults.
DDR3 DIMM faults are automatically cleared after a new memory DIMM has been installed. If you need to manually clear DDR3 DIMM faults, refer to the Oracle Integrated Lights Out Manager (ILOM) 3.1 Documentation Library at
http://www.oracle.com/goto/ILOM/docs.
Servicing PCIe Risers (CRU)
This section describes how to remove and install the PCIe risers.
In addition to the service procedures included here, you can see animations that show how to remove and install the PCIe risers.
74 Sun Server X4-2 Service Manual • May 2014
Page 89
To play the Sun Server X4-2 Service Animations go to:
http://docs.oracle.com/cd/E36975_01/html/E49470/index.html
PCIe cards in all slots are installed on vertical risers. You must remove the relevant riser to remove and replace a PCIe card. You must remove all three PCIe risers when replacing the motherboard.
Caution – These procedures require that you handle components that are sensitive
to electrostatic discharge. This sensitivity can cause the components to fail. To avoid damage, ensure that you follow antistatic practices as described in “Take Antistatic
Measures” on page 42.
Caution – Ensure that all power is removed from the server before removing or
installing PCIe risers, or damage to the PCIe cards installed in the risers might occur. You must disconnect all power cables from the system before performing these procedures.
The following sections provide information to assist you in servicing PCIe risers:
“PCIe Riser Location and Differences” on page 75
“Remove a PCIe Riser From PCIe Slot 1 or 2” on page 76
“Install a PCIe Riser Into PCIe Slot 1 or 2” on page 78
“Remove a PCIe Riser From PCIe Slots 3 and 4” on page 80
“Install a PCIe Riser Into PCIe Slots 3 and 4” on page 82
Related Information
“Servicing PCIe Cards (CRU)” on page 84
PCIe Riser Location and Differences
The PCIe riser that installs in PCIe slots 3 and 4 is different than the risers in PCIe slots 1 and 2. The riser for slots 3 and 4 supports two cards: a standard PCIe card and an internal HBA card. Do not attempt to install the PCIe riser for PCIe slots 3 and 4 in slots 1 or 2, and vice versa.
Servicing CRUs That Require Server Power-Off 75
Page 90
FIGURE: PCIe Riser Locations
Figure Legend
1 PCIe riser and installed PCIe card in slot 1 (This slot is nonfunctional in single-processor systems.)
2 PCIe riser and installed PCIe card in slot 2
3 PCIe riser and installed cards (2) in slots 3 and 4 (this riser contains the internal HBA card.)
Note: This riser is different from the risers in slots 1 and 2.
Related Information
“PCIe Slot Characteristics” on page 85
Remove a PCIe Riser From PCIe Slot 1 or 2
Note – PCIe slot 1 is nonfunctional in single-processor systems.
76 Sun Server X4-2 Service Manual • May 2014
Page 91
1. Prepare the server for service.
a. Power off the server and disconnect the power cords from the power
supplies.
See “Powering Down the Server” on page 33.
b. Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 39.
c. Attach an antistatic wrist strap to your wrist and then to a metal area on the
chassis.
See “Take Antistatic Measures” on page 42.
d. Remove the server top cover.
See “Remove the Server Top Cover” on page 43.
2. Select the PCIe riser that you want to remove.
See “PCIe Slot Characteristics” on page 85.
3. If a PCIe card is installed in the riser, disconnect any external or internal cables from the card.
4. Lift the green-tabbed latch on the rear of the server chassis next to the applicable PCIe slot to release the rear bracket on the PCIe card [1].
Servicing CRUs That Require Server Power-Off 77
Page 92
Note – If the riser does not have a PCIe card installed, then lift the latch to release
the PCIe slot filler panel.
5. Lift the green-tabbed riser release lever on the PCIe riser with one hand and use your other hand to remove the riser from the motherboard connector [2, 3].
6. If there is a PCIe card installed in the riser, place the riser on an antistatic mat and note the slot in which the PCIe riser was originally installed; otherwise, set the PCIe riser aside.
Related Information
“Install a PCIe Riser Into PCIe Slot 1 or 2” on page 78
Install a PCIe Riser Into PCIe Slot 1 or 2
Note – PCIe slot 1 is nonfunctional in single-processor systems.
Note – The PCIe riser in slots 3 and 4 is different than the PCIe risers in slots 1 and
2. Do not attempt to install a PCIe riser from slot 1 or 2 into PCIe slots 3 and 4 and
vice versa.
1. Determine into which slot you will install the PCIe riser.
2. Retrieve the PCIe riser and any PCIe cards attached to it.
3. Raise the green-tabbed release lever on the PCIe riser to the open (up) position [1].
4. Gently press the riser into the motherboard connector until it is seated, and then press the green-tabbed release lever on the PCIe riser to the closed (down) position [2].
78 Sun Server X4-2 Service Manual • May 2014
Page 93
5. If there is a PCIe card installed in the riser, reconnect any external or internal cables to the card.
6. Close the green-tabbed latch on the rear of the server chassis next to the applicable PCIe slot to secure the rear bracket on the PCIe card to the server chassis [3].
Note – If the riser does not have a PCIe card installed, install a PCIe slot filler panel
and close the green-tabbed latch to secure the filler panel.
7. Return the server to operation.
a. Install the server top cover.
See “Install the Server Top Cover” on page 145.
b. Close the fan door.
c. Return the server to the normal rack position.
See “Return the Server to the Normal Rack Position” on page 148.
d. Reconnect the power cords to the server power supplies and reconnect the
data cables.
See “Reconnect Data Cables and Power Cords” on page 149.
Servicing CRUs That Require Server Power-Off 79
Page 94
e. Power on the server.
See “Power On the Server” on page 150.
Verify that the Power/OK status indicator is steady on.
8. Use Oracle ILOM to clear the server PCIe riser fault.
If the PCIe riser fault message in Oracle ILOM is not cleared under Open Problems, you must manually clear the fault in Oracle ILOM. For instructions for manually clearing a PCIe riser fault, see the procedure "Clear Faults for Undetected Replaced or Repaired Hardware Components" in the Oracle Integrated Lights Out Manager (ILOM) 3.1 User’s Guide in the Oracle Integrated Lights Out Manager (ILOM) 3.1 Documentation Library at
http://www.oracle.com/goto/ILOM/docs.
Related Information
“Remove a PCIe Riser From PCIe Slot 1 or 2” on page 76
Remove a PCIe Riser From PCIe Slots 3 and 4
1. Prepare the server for service.
a. Power off the server and disconnect the power cords from the power
supplies.
See “Powering Down the Server” on page 33
b. Extend the server to the maintenance position.
See “Extend the Server to the Maintenance Position” on page 39.
c. Attach an antistatic wrist strap to your wrist and then to a metal area on the
chassis.
See “Take Antistatic Measures” on page 42.
d. Remove the server top cover.
See “Remove the Server Top Cover” on page 43.
2. If there is a PCIe card installed in the riser, disconnect any external or internal cables.
Note – Do not disconnect the SAS cable from the internal host bus adapter card until
after you have removed the riser from the server.
3. Open the green-tabbed latch located on the rear of the server chassis next to PCIe slot 3 to release the rear bracket on the PCIe card [1].
80 Sun Server X4-2 Service Manual • May 2014
Page 95
Note – If the riser does not have a PCIe card installed in slot 3, then lift the latch to
release the PCIe slot 3 filler panel.
4. To release the riser from the motherboard connector, lift the green-tabbed lease lever on the PCIe riser to the open position [2].
5. Slide the plastic PCIe card retainer, which is mounted on the side of the chassis, toward the front of the server to release the card(s) installed in the riser [3].
6. Grasp the riser with both hands and remove it from the server [4].
7. Disconnect the SAS storage drive (HDD) cables from the internal HBA card installed in PCIe slot 4 [5].
Note – If the server has four or fewer storage drives, the SAS cable for HDDs 0-3
will be the only one installed.
8. Place the riser on an antistatic mat.
Related Information
“Install a PCIe Riser Into PCIe Slots 3 and 4” on page 82
Servicing CRUs That Require Server Power-Off 81
Page 96
Install a PCIe Riser Into PCIe Slots 3 and 4
Note – The PCIe riser in slots 3 and 4 is different than the PCIe risers in slots 1 and
2. Do not attempt to install a PCIe riser from slot 1 or 2 into PCIe slots 3 and 4 and vice versa.
1. Retrieve the PCIe riser and any PCIe cards attached to it.
2. Reconnect the SAS cable(s) to the internal host bus adapter card [1].
Be sure to connect the SAS cable for storage drives 0 through 3 (HDDs 0-3) to the connector that is farthest from the riser in which the HBA card is installed; otherwise, the server will incorrectly identify the storage drives when it is powered on.
3. Raise the green-tabbed release lever on the PCIe riser to the open (up) position, and then gently press the riser into the motherboard connector until it is seated [2].
82 Sun Server X4-2 Service Manual • May 2014
Page 97
4. Ensure that the rear bracket on the internal HBA card in PCIe slot 4 is connected to the slot in the server chassis side wall.
If the bracket is not connected, remove the riser and reposition it so that the rear bracket connects to the side wall, and then gently press the riser into the motherboard connector.
5. Slide the plastic PCIe card retainer, which is mounted on the side of the chassis, toward the back of the server to secure the card(s) installed in the riser [3].
6. Press the green-tabbed release lever on the PCIe riser to the closed (down) position [4].
7. To secure the rear bracket on the PCIe card to the server, close the green-tabbed latch on the rear of the server chassis [5].
Note – If the riser does not have a PCIe card installed in slot 3, install a PCIe slot
filler panel and close the green-tabbed latch to secure the PCIe slot filler panel.
8. If there is a PCIe card installed in slot 3 of the riser, reconnect any external or internal cables to the card.
9. Return the server to operation.
a. Install the server top cover.
See “Install the Server Top Cover” on page 145.
b. Close the fan door.
c. Return the server to the normal rack position.
See “Return the Server to the Normal Rack Position” on page 148.
d. Reconnect the power cord to the server power supply and the data cables.
See “Reconnect Data Cables and Power Cords” on page 149.
e. Power on the server.
See “Power On the Server” on page 150.
Verify that the Power/OK status indicator is steady on.
10. Use Oracle ILOM to clear the server PCIe riser fault.
If the PCIe riser fault message in Oracle ILOM is not cleared under Open Problems, you must manually clear the fault in Oracle ILOM. For instructions for manually clearing a PCIe riser fault, see the procedure "Clear Faults for Undetected Replaced or Repaired Hardware Components" in the Oracle Integrated Lights Out Manager (ILOM) 3.1 User’s Guide in the Oracle Integrated Lights Out Manager (ILOM) 3.1 Documentation Library at
http://www.oracle.com/goto/ILOM/docs.
Servicing CRUs That Require Server Power-Off 83
Page 98
Related Information
“Remove a PCIe Riser From PCIe Slots 3 and 4” on page 80
Servicing PCIe Cards (CRU)
These sections describe how to remove and install PCIe cards. Refer to your PCIe card documentation for complete software and cabling information about your card.
In addition to the service procedures included here, you can see animations that show how to remove and install the PCIe cards.
To play the Sun Server X4-2 Service Animations go to:
http://docs.oracle.com/cd/E36975_01/html/E49470/index.html
Caution – These procedures require that you handle components that are sensitive
to electrostatic discharge. This sensitivity can cause the components to fail. To avoid damage, ensure that you follow antistatic practices as described in “Take Antistatic
Measures” on page 42.
Caution – Ensure that all power is removed from the server before removing or
installing PCIe cards. You must disconnect all power cables from the system before performing these procedures.
Note – For a complete list of supported PCIe cards, see the Sun Server X4-2 Product
Notes at http://www.oracle.com/goto/X4-2/docs.
The following sections provide information to assist you when servicing PCIe cards:
“PCIe Slot Characteristics” on page 85
“Remove a PCIe Card From PCIe Slot 1 or 2” on page 86
“Install a PCIe Card in PCIe Slot 1 or 2” on page 87
“Remove a PCIe Card From PCIe Slot 3” on page 87
“Install a PCIe Card in PCIe Slot 3” on page 88
“Remove the Internal HBA Card From PCIe Slot 4” on page 89
84 Sun Server X4-2 Service Manual • May 2014
Page 99
“Install the Internal HBA Card in PCIe Slot 4” on page 90
Related Information
“Servicing PCIe Risers (CRU)” on page 74
PCIe Slot Characteristics
There are three external and one internal PCIe slots available in the Sun Server X4-2. The external slots support optional, standard PCIe cards and are numbered 1, 2, and 3 from left to right when you view the server from the rear. The internal slot, which installs on the riser in PCIe slots 3 and 4, supports a required internal SAS controller HBA card. For a view of the rear panel that shows PCIe slot numbering, see “Server
Back Panel View” on page 5.
Note – PCIe slot 1 is nonfunctional in single-processor systems.
Note – All of the PCIe slots comply with the PCI Express 3.0 specification and can
accommodate 25 watt PCIe cards.
The following table lists the characteristics and requirements of the PCIe slots.
Slot Number
1 Low-profile cards
2 Low-profile cards
3 and 4 Low-profile cards
Supported PCIe Card Types
only
only
only
Supported PCIe Specifications
PCIe 1.0, PCIe 2.0, PCIe 3.0 x16 mechanical/x16 electrical
PCIe 1.0, PCIe 2.0, PCIe 3.0 x16 mechanical/x8 electrical
PCIe 1.0, PCIe 2.0, PCIe 3.0 x8 mechanical/x8 electrical
Slot Connector Width/PCI Express Lanes
Related Information
“Server Back Panel View” on page 5
Servicing CRUs That Require Server Power-Off 85
Page 100
Remove a PCIe Card From PCIe Slot 1 or 2
Note – PCIe slot 1 is nonfunctional in single-processor systems.
1. Remove the PCIe riser from the server.
For instructions, see “Remove a PCIe Riser From PCIe Slot 1 or 2” on page 76.
2. Remove the PCIe card from the PCIe riser.
a. Hold the riser in one hand and use your other hand to carefully pull the PCIe
card connector out of the riser.
b. Disconnect the rear bracket that is attached to the PCIe card from the rear of
the PCIe riser.
3. Place the PCIe card on an antistatic mat.
Caution – Whenever you remove a PCIe card, you should replace it with another
PCIe card or a PCIe filler panel. Installing PCIe filler panels in vacant PCIe slots helps reduce the level of electromagnetic interference (EMI) emitted by the server. For instructions for installing a PCIe filler panel, see “Removing and Installing Server
Filler Panels” on page 143.
86 Sun Server X4-2 Service Manual • May 2014
Loading...