HP Apollo sx40 Maintenance And Service Manual

HPE Apollo sx40 Server
Maintenance and Service Guide
Abstract:
This document provides an overview of the HPE Apollo sx40 server hardware and
information and procedures used to maintain and service it.
Part Number: P05957-002
Published: April 2019
Edition: 2
Copyright 2019 Hewlett Packard Enterprise Development LP
Notices
The information contained herein is subject to change without notice. The only warranties for Hewlett Packard Enterprise products and services are set forth in the express warranty statements accompanying such products and services. Nothing herein should be construed as constituting an additional warranty. Hewlett Packard Enterprise shall not be liable for technical or editorial errors or omissions contained herein.
Confidential computer software. Valid license from Hewlett Packard Enterprise required for possession, use, or copying. Consistent with FAR 12.211 and 12.212, Commercial Computer Software, Computer Software Documentation, and Technical Data for Commercial Items are licensed to the U.S. Government under vendor's standard commercial license.
Links to third-party websites take you outside the Hewlett Packard Enterprise website. Hewlett Packard Enterprise has no control over and is not responsible for information outside the Hewlett Packard Enterprise website.
Acknowledgments
Intel® and Intel Xeon® are trademarks of Intel Corporation in the U.S. and other countries.
Microsoft® and Windows® are either registered trademarks or trademarks of Microsoft Corporation in the United States and/or other countries.
Adobe® and Acrobat® are trademarks of Adobe Systems Incorporated.
Java® and Oracle® are registered trademarks of Oracle and/or its affiliates.
UNIX® is a registered trademark of The Open Group.
Record of Revision
Version Description
-001 February 2019
Original printing
-002 April 2019
Added PCIe accelerator part numbers and updated GPU module replacement procedure
P05957-002 Record of Revision iii
iv Record of Revision P05957-002

Contents

1. Hardware overview........................................................................................1-1
1.1 Introduction.................................................................................................... 1-1
1.2 Front view ...................................................................................................... 1-2
1.3 Rear view ....................................................................................................... 1-4
1.4 Top view ........................................................................................................ 1-5
1.5 Block diagram................................................................................................ 1-6
1.5.1 Motherboard................................................................................. 1-7
1.6 Processors ...................................................................................................... 1-9
1.7 Memory DIMMs............................................................................................ 1-9
1.8 GPUs............................................................................................................ 1-11
1.9 Power ........................................................................................................... 1-13
1.10 Cooling......................................................................................................... 1-14
2. Customer self repair .....................................................................................2-1
3. Parts catalog ..................................................................................................3-1
3.1 Server spare parts........................................................................................... 3-1
3.2 Options spare parts......................................................................................... 3-3
4. Part replacement procedures.......................................................................4-1
4.1 Safety precautions.......................................................................................... 4-1
4.2 Teardown video ............................................................................................. 4-1
4.3 Accessing internal components ..................................................................... 4-2
4.3.1 Removing the front top cover ...................................................... 4-2
4.3.2 Removing the center brace .......................................................... 4-4
4.3.3 Removing the rear top cover........................................................ 4-5
4.4 Part replacement procedures.......................................................................... 4-7
4.4.1 Bridge board replacement............................................................ 4-7
4.4.2 Control panel replacement ........................................................... 4-8
4.4.3 Disk drive replacement (external drive) ...................................... 4-9
4.4.4 Disk drive replacement (internal drive) ..................................... 4-10
4.4.5 Fan replacement......................................................................... 4-12
4.4.6 Fan control board replacement .................................................. 4-12
4.4.7 GPU interface board replacement.............................................. 4-13
P05957-002 Contents v
4.4.8 GPU module replacement.......................................................... 4-15
4.4.9 I/O port module (third PCIe riser) replacement......................... 4-16
4.4.10 Memory DIMM replacement..................................................... 4-17
4.4.11 Power supply replacement ......................................................... 4-18
4.4.12 Processor replacement ............................................................... 4-19
4.4.13 Motherboard replacement .......................................................... 4-21
4.4.14 PCIe card replacement ............................................................... 4-22
4.4.15 PCIe riser replacement............................................................... 4-23
4.4.16 SATA interface board replacement ........................................... 4-25
5. Troubleshooting ............................................................................................5-1
5.1 No power........................................................................................................ 5-1
5.2 No video......................................................................................................... 5-1
5.3 BIOS beep codes............................................................................................ 5-2
5.4 System boot failure ........................................................................................ 5-2
5.5 Memory errors ............................................................................................... 5-2
5.6 Memory is missing after POST ..................................................................... 5-2
5.7 System loses its setup configuration .............................................................. 5-3
6. Firmware ........................................................................................................6-1
6.1 About firmware.............................................................................................. 6-1
6.2 Flashing the BIOS.......................................................................................... 6-1
6.2.1 Using the UEFI shell to flash the BIOS....................................... 6-1
6.2.2 Using the SUMTool to flash the BIOS........................................ 6-2
6.2.3 Using the Web GUI to flash the BIOS ........................................ 6-2
6.3 Flashing the BMC.......................................................................................... 6-3
6.3.1 Using a Linux command to flash the BMC ................................. 6-3
6.3.2 Using the SUMTool to flash the BMC ........................................ 6-4
6.3.3 Using the Web GUI to flash the BMC......................................... 6-4
7. Specifications ................................................................................................7-1
7.1 Physical specifications................................................................................... 7-1
7.2 Environmental specifications......................................................................... 7-1
8. Websites.........................................................................................................8-1
9. Support and other resources .......................................................................9-1
9.1 Accessing Hewlett Packard Enterprise Support ............................................ 9-1
9.1.1 Information to collect................................................................... 9-1
9.2 Accessing updates.......................................................................................... 9-1
9.3 Customer self repair....................................................................................... 9-2
vi Contents P05957-002
9.4 Remote support .............................................................................................. 9-2
9.5 Warranty information .................................................................................... 9-3
9.6 Regulatory information.................................................................................. 9-3
9.7 Documentation feedback ............................................................................... 9-4
P05957-002 Contents vii
viii Contents P05957-002
Chapter 1

1. Hardware overview

This chapter provides an overview of the hardware used in an HPE Apollo sx40 server.

1.1 Introduction

The HPE Apollo sx40 server is a 1U chassis with two Intel processors that supports four NVIDIA Pascal or Volta SXM2 GPUs.
Figure 1-1 HPE Apollo sx40 server
P05957-002 Hardware overview 1-1

1.2 Front view

1. HPE Apollo sx40 chassis (1U) 4. Unit Identification (UID) LED/button
2. Two SFF hot-swap SATA drive bays 5. Power button
3. Internal fans 6. Status LED
Figure 1-2 Front view
Figure 1-3 shows the control panel in more detail.
Figure 1-3 Control panel
1-2 Hardware overview P05957-002
Table 1-1 Control panel features
Item Feature Description
1 Information LED Provides system status:
Continuously on and red: An overheat condition has occurred. (This might be caused by cable congestion.)
Blinking red: Fan failure, check for an inoperative fan.
Solid blue: Local UID has been activated. Use this function to locate th
e server in a rackmount environment.
Blinking blue: Remote UID is on. Use this function to identify the
server from a remote location.
2 NIC2 LED Indicates network activity on the
3 NIC1 LED Indicates network activity on the
LAN2 port when flashing
LAN1 port when flashing
4 HDD LED Indicates activity on the hard drive when flashing
5 Power LED Indicates power is being supplied to the system power supply units.
This
LED should normally be illuminated when the system is
operating.
6 UID LED The unit identification (U
ID) button turns on or off the blue light function of the Information LED and the blue LED on the rear of the chassis. These are used to locate the server in large racks and server banks.
7 Power button The main power button is used to apply or remove power from the
power supply to the server. Turning off system power with this button removes the main power but maintains standby power. To perform many maintenance tasks, you must also unplug system before servicing
P05957-
002 Hardware overview 1-3

1.3 Rear view

1. Two 2000W Titanium level power supplies 5. Two USB 3.0 connectors
2. Embedded 1 Gb NIC 1 6. Dedicated IPMI LAN port
3. Embedded 1 Gb NIC 2 7. Two full height PCIe Gen3 x16 slots
4. Two half height PCIe Gen3 x16 slots
Figure 1-4 Rear view
1-4 Hardware overview P05957-002

1.4 Top view

1. Processor 1 4. Processor 2
2. DIMMs for processor 1 5. DIMMs for processor 2
3. NVIDIA SXM2 GPUs
Figure 1-5 Top view
P05957-002 Hardware overview 1-5

1.5 Block diagram

SPI
FRONT PANEL
SYSTEM POWER
CTRL
FAN SPEED
#7 USB2.0
PCH
6.0 Gb/S
USB 2.0
USB
#1
#0
SATA
#3
#2
Temp Sensor
W83773 at SMBUS
SPI
AST2500
BMC
BMC Boot Flash
DDR4
SLOT 3
5+1 PHASE 205W
DDRIV
2133/2666
P1
P1
P0
VR13
P0
#G-1
DDRIV
2133/2666
#B-1
#A-1
UPI
PCI-E X16
DMI3 DMI3
UPI 10.4G/11.2G
5+1 PHASE 205W
VR13
#H-1
#J-1
#K-1
#L-1
#M-1
VCCP1 12vVCCP0 12v
UPI
PECI:30 PECI:31 SOCKET ID:0 SOCKET ID:1
#C-1
#D-1
#E-1
#F-1
UPI
P2P2
SLOT 4
PCI-E X16
SLOT 1
PCI-E X16
SLOT 2
PCI-E X16
CPU0 PCIe X16 CPU1 PCIe X16
SLOT 5
PCI-E X16 G3 (RSC-G-6)
PCI-E X16 G3 (RSC-G-6)
PCI-E X16 G3 (RSC-G-6)
PCI-E X16 G3 (RSC-GR-6)
PCI-E X16 G3
PCI-E X16 G3 with re-driver
AOM-PIO-i2XT
RMII/NCSI
VGA
COM PORT
RGRMII
Rear IO riser card
PCI-E X4 G2 #0-3
USB3.0 x2
#0/1 USB2.0
PCI-E X4 G2 #5
ESPI
Debug Card
TPM HEADER
SPI
BIOS
Bus S/W
#3 #2 #1 #1#2#3
DMI3
CPU0 PCIe X16 CPU1 PCIe X8
NVME X8
RSC-G-A66
RSC-GN2-A68
Two riser options
PCIE x4 or SATA
PCIex4 or SATA
M.2
Flexible I/O
Figure 1-6 shows the block diagram.
Figure 1-6 Block diagram
1-6 Hardware overview P05957-002

1.5.1 Motherboard

X11DGQ REV:1.00 DESIGNED IN USA
IPMI CODE BAR CODE
BIOS LICENSE
JTEMP1
JF1
M.2
JPWR4
FAN9
FAN8
JUSB3
JTPM1
JSDCARD1
FAN_PWR1
HDD_PWR1
HDD_PWR2
LE6
FAN_CTRL
JPSU1JPSU2
JRK1
JP2
JL1
JSTBY1
BT1
JVRM1
JVRM2
JWD1
JPME2
LEDM1
LE2
JBT1
I-SGPIO1
JPCIE2
JPCIE1
JSLOT6
JPCIE3
JPCIE4
JPCIE5
P2-DIMMF1
P2-DIMMD1
P2-DIMME1
P2-DIMMA1
P2-DIMMC1
P2-DIMMB1
P1-DIMMF1
P1-DIMMD1
P1-DIMME1
JPW3 JPW2
CPU1/CPU2 SLOT5 PCIE 3.0 X16/X16
CPU1 SLOT4 PCIE 3.0 X16
CPU1 SLOT3 PCIE 3.0 X16
CPU2 SLOT2 PCIE 3.0 X16
P1-DIMMC1
P1-DIMMA1
P1-DIMMB1
JPW1
USB2/3 (3.0)
I-SATA3I-SATA2I-SATA0 I-SATA1
CPU2 SLOT1 PCIE 3.0 X16
The motherboard uses the Intel PCH C612 chipset.
Figure 1-7 shows the motherboard layout and components.
Figure 1-7 Motherboard layout
P05957-002 Hardware overview 1-7
Figure 1-8 describes the motherboard jumpers, connectors, and LEDs.
Jumper
Description
-%7 &OHDU&026
-30( 0(0DQXIDFWXUH0RGH
-950-950 ,&%XVIRU950
-:' :DWFK'RJ7LPHU(QDEOH
Default Setting
3LQV1RUPDO3LQV
%0&1RUPDO3LQV
5HVHW
Connectors Description
%DWWHU\%7 2QERDUG&026%DWWHU\
)$1)$1 &386\VWHP&RROLQJ)DQ+HDGHUV
)$1B&75/ )DQFRQWUROKHDGHU
)$1B3:5 3RZHUFRQQHFWRUIRUIURQWIDQV
-) )URQWFRQWUROSDQHOKHDGHU
-/ &KDVVLVLQWUXVLRQKHDGHU
-3 &3/'SURJUDPPLQJKHDGHU
+''B3:5 3RZHUFRQQHFWRUVIRU+'' GHYLFHV
-3:5a 9SLQ*38SRZHU VXSSO\FRQQHFWRUV
-5. 5$,'NH\KHDGHU
-368a 3RZHUVXSSO\LQSXW
-6&$5' 0LFUR6'FDUG VORW
-67%< 6WDQGE\SRZHUKHDGHU
-730 7307UXVWHG3ODWIRUP0RGXOH3RUW+HDGHU
-7(03 )URQWFRQWUROSDQHOWHPSHUDWXUH KHDGHU
-86% 86%KHDGHU
,6$7$a ,QWHO
6$7$&RQQHFWRUVIURP,QWHO 3&+6$7$FRQWUROOHU
,6*3,2 6HULDO/LQN*HQHUDO3XUSRVH ,2+HDGHU
LED Description State Status
/('0 %0&+HDUWEHDW/(' *UHHQ %0&1RUPDO
/( 2QERDUG3RZHU/(' *UHHQ 3RZHU 2Q
/( 3RZHU6WDWXV/('
*UHHQ5HG
5HG
3RZHU2Q6WDQGE\
Figure 1-8 Motherboard connectors, jumpers, and LEDs
1-8 Hardware overview P05957-002

1.6 Processors

Processor 2
Processor 1
The system supports two Intel Xeon processors from the following processor families:
Intel Xeon Bronze 3100 series
Intel Xeon Silver 4100 se
Intel Xeon Gold 5100, 6100, and 8100 series
See Chapter 3, “Parts catalog,” for a listing of specific pro
ries
cessors that are supported.
Figure 1-9 Intel processors

1.7 Memory DIMMs

The system includes 12 DIMM slots supporting 2666 MHz DDR4 memory. Figure 1-10 shows the DIMM slot numbering.
P05957-002 Hardware overview 1-9
P2
P1
Figure 1-10 DIMM slot nu
mbering
1-10 Hardware overview P05957-002

1.8 GPUs

SXM2 GPUs
The system supports up to four Pascal or Volta SXM2 GPUs running over NVLINK connections. The SXM2 GPUs attach to the GPU interface board (SXM2 add-on module) in the front section of the chassis.
Figure 1-11 SXM2 GPUs
The following SXM2 GPU accelerators are supported:
Pascal GPUs:
NVIDIA Tesla P100 16GB SXM2 GPU accelerator
•Volta GPUs:
NVIDIA Tesla V100 16GB SXM
NVIDIA Tesla V100 32GB SXM
2 GPU accelerator
2 GPU accelerator
P05957-002 Hardware overview 1-11
Figure 1-12 shows the GPU block diagram.
Figure 1-12 GPU block diagram
1-12 Hardware overview P05957-002

1.9 Power

PS1
PS2
Two hot-swappable 2000-W power supplies supply power to the system. The power supplies have an 80 Plus Titanium level rating.
Note: The HPE Apollo sx40 server is only considered to be N+1 in the 200-240V range; the
100-127V range requires both power supplies to be operating.
Figure 1-13 Power supplies
P05957-002 Hardware overview 1-13

1.10 Cooling

Fans
There are seven fans located at the front of the system. Airflow is from the front to the back of the chassis.
Figure 1-14 Fans
1-14 Hardware overview P05957-002
Chapter 2

2. Customer self repair

Hewlett Packard Enterprise products are designed with many Customer Self Repair (CSR) parts to minimize repair time and allow for greater flexibility in performing defective parts replacement. If during the diagnosis period Hewlett Packard Enterprise (or Hewlett Packard Enterprise service providers or service partners) identifies that the repair can be accomplished by the use of a CSR part, Hewlett Packard Enterprise will ship that part directly to you for replacement. There are two categories of CSR parts:
Mandatory—Parts for which customer self repair is mandatory. If you request Hewlett Packard Enterprise to replace these parts, you will be charged for the travel and labor costs of this service.
Optional—Parts for which customer self repair is optional. These parts are also designed for customer self repair. If, however, you require that Hewlett Packard Enterprise replace them for you, there may or may not be additional charges, depending on the type of warranty service designated for your product.
Note: Some Hewlett Packard Enterprise parts are not designed for customer self repair. In order
to satisfy the customer warranty, Hewlett Packard Enterprise requires that an authorized service provider replace the part. These parts are identified as "No" in the Illustrated Parts Catalog.
Based on availability and where geography permits, CSR parts will be shipped for next business day delivery. Same day or four-hour delivery may be offered at an additional charge where geography permits. If assistance is required, you can call the Hewlett Packard Enterprise Support Center and a technician will help you over the telephone. Hewlett Packard Enterprise specifies in the materials shipped with a replacement CSR part whether a defective part must be returned to Hewlett Packard Enterprise. In cases where it is required to return the defective part to Hewlett Packard Enterprise, you must ship the defective part back to Hewlett Packard Enterprise within a defined p eriod of time, normally five (5) business days. The defective part must be returned with the associated documentation in the provided shipping material. Failure to return the defective part may result in Hewlett Packard Enterprise billing you for the replacement. With a customer self repair, Hewlett Packard Enterprise will pay all shipping and part return costs and determine the courier/carrier to be used.
For more information about the Hewlett Packard Enterprise CSR program, contact your local service provider. For the North American program, go to the
Hewlett Packard Enterprise CSR website.
P05957-002 Customer self repair 2-1
Loading...
+ 53 hidden pages