ERserver
pSeries 655
User’s Guide
SA38-0617-03
ER s e r v e r
pSeries 655
User’s Guide
SA38-0617-03
A
©
Fourth Edition (February 2004)
Before using this information and the product it supports, read the information in “Safety Notices” on page ix,
Appendix A, “Environmental Notices,” on page 123, and Appendix B, “Notices,” on page 125.
reader’s comment form is provided at the back of this publication. If the form has been removed, address
comments to Information Development, Department H6DS-905-6C006, 11501 Burnet Road, Austin, Texas
78758-3493. To send comments electronically, use this commercial internet address: aix6kpub@austin.ibm.com. Any
information that you supply may be used without incurring any obligation to you.
Copyright International Business Machines Corporation, 2002, 2004 All rights reserved.
Note to U.S. Government Users -- Documentation related to restricted rights -- Use, duplication or disclosure is
subject to restrictions set forth is GSA ADP Schedule Contract with IBM Corp.
Contents
Safety Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .ix
Rack Safety Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .ix
Electrical and Mechanical Safety . . . . . . . . . . . . . . . . . . . . . . . . . . .ix
Laser Safety Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xi
Laser Compliance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xi
Data Integrity and Verification . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii
About This Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xv
ISO 9000 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xv
Highlighting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xv
Accessing Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xv
References to AIX Operating System . . . . . . . . . . . . . . . . . . . . . . . . .xv
Related Publications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xv
Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvi
Chapter 1. Reference Materials . . . . . . . . . . . . . . . . . . . . . . . . . . .1
Documentation Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3
Chapter 2. Introducing the pSeries 655 . . . . . . . . . . . . . . . . . . . . . . . .7
Processor Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .8
Partitioned System Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . .8
Partition Profiles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9
System Profiles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9
Types of Partitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9
System Attention LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .11
System Attention LED . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .12
Accessing System Log Error Information . . . . . . . . . . . . . . . . . . . . . . .12
PCI Adapters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .12
Resetting the System Attention LED . . . . . . . . . . . . . . . . . . . . . . . .13
Chapter 3. Using the Hardware Management Console for pSeries . . . . . . . . . . . . .15
Hardware Management Console (HMC) Overview and Setup . . . . . . . . . . . . . . . .15
System Power-On Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . .15
Powering On the Processor Subsystem Using the HMC . . . . . . . . . . . . . . . . . .15
Powering Off the Processor Subsystem Using the HMC . . . . . . . . . . . . . . . . . .15
Graphics Console Support . . . . . . . . . . . . . . . . . . . . . . . . . . . . .16
Understanding the Power-On Self-Test (POST) . . . . . . . . . . . . . . . . . . . . .16
POST Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .16
POST Keys . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .17
Chapter 4. Using the Service Processor . . . . . . . . . . . . . . . . . . . . . . .19
Service Processor Menus . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19
Accessing the Service Processor Menus . . . . . . . . . . . . . . . . . . . . . . .19
Saving and Restoring Service Processor Settings . . . . . . . . . . . . . . . . . . .19
Menu Inactivity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .20
General User Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .20
Privileged User Menus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .22
Main Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .22
Service Processor Setup Menu . . . . . . . . . . . . . . . . . . . . . . . . . .23
Passwords . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .24
System Power Control Menu . . . . . . . . . . . . . . . . . . . . . . . . . . .27
System Information Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . .30
iii
iv
Language Selection Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . .38
Call-In/Call-Out Setup Menu . . . . . . . . . . . . . . . . . . . . . . . . . . .39
Service Processor Parameters in Service Mode (Full System Partition) . . . . . . . . . . . . .39
Service Processor Reboot/Restart Recovery . . . . . . . . . . . . . . . . . . . . . .39
Boot (IPL) Speed . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .39
Failure During Boot Process . . . . . . . . . . . . . . . . . . . . . . . . . . .39
Failure During Normal System Operation . . . . . . . . . . . . . . . . . . . . . . .39
Service Processor Reboot/Restart Policy Controls . . . . . . . . . . . . . . . . . . .40
Updating System Firmware and Microcode . . . . . . . . . . . . . . . . . . . . . . .42
General Information on Processor Subsystem Firmware Updates . . . . . . . . . . . . . .42
Determining the Level of Firmware on the Processor Subsystem . . . . . . . . . . . . . .42
Processor Subsystem Firmware Update Using a Locally Available Image . . . . . . . . . . .43
Updating System Firmware from the AIX Service Aids . . . . . . . . . . . . . . . . . .44
Updating System Firmware from the AIX Command Line . . . . . . . . . . . . . . . . .44
Frame (Power Subsystem) Firmware Update . . . . . . . . . . . . . . . . . . . . .44
Integrated SCSI Controller Microcode Update . . . . . . . . . . . . . . . . . . . . .45
Integrated Ethernet Microcode Update . . . . . . . . . . . . . . . . . . . . . . . .45
Installing Corrective Service on the Frame . . . . . . . . . . . . . . . . . . . . . .45
Reconfiguration Procedure for SNI Adapters . . . . . . . . . . . . . . . . . . . . .46
Configuring and Deconfiguring Processors or Memory . . . . . . . . . . . . . . . . . . .46
Run-Time CPU Deconfiguration (CPU Gard) . . . . . . . . . . . . . . . . . . . . .47
Service Processor System Monitoring - Surveillance . . . . . . . . . . . . . . . . . . .47
System Firmware Surveillance . . . . . . . . . . . . . . . . . . . . . . . . . . .47
Operating System Surveillance . . . . . . . . . . . . . . . . . . . . . . . . . .47
Service Processor Error Logs . . . . . . . . . . . . . . . . . . . . . . . . . . . .48
LCD Progress Indicator Log . . . . . . . . . . . . . . . . . . . . . . . . . . . .49
Resetting the Service Processor . . . . . . . . . . . . . . . . . . . . . . . . . . .50
Service Processor Operational Phases . . . . . . . . . . . . . . . . . . . . . . . .50
Pre-Standby Phase . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .50
Standby Phase . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .50
Bring-Up Phase . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .51
Runtime Phase . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .51
Clearing L3 Gard Records . . . . . . . . . . . . . . . . . . . . . . . . . . . .52
Chapter 5. Using System Management Services . . . . . . . . . . . . . . . . . . . .53
Select Language . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .54
Change Password Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . .54
Set Privileged-Access Password . . . . . . . . . . . . . . . . . . . . . . . . . .54
View Error Log . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .55
Setup Remote IPL (Initial Program Load) . . . . . . . . . . . . . . . . . . . . . . . .55
Change SCSI Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .58
Select Console . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .58
Select Boot Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .58
Select Boot Devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .61
Display Current Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .64
Restore Default Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .64
Multiboot Startup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .64
Exiting System Management Services . . . . . . . . . . . . . . . . . . . . . . . . .64
Chapter 6. Using the Online and Standalone Diagnostics . . . . . . . . . . . . . . . .65
Online and Standalone Diagnostics Operating Considerations . . . . . . . . . . . . . . . .65
Identifying the Terminal Type to the Diagnostics . . . . . . . . . . . . . . . . . . . .65
Undefined Terminal Types . . . . . . . . . . . . . . . . . . . . . . . . . . . .66
Resetting the Terminal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .66
Running Online Diagnostics . . . . . . . . . . . . . . . . . . . . . . . . . . . .66
Online Diagnostics Modes of Operation . . . . . . . . . . . . . . . . . . . . . . . .66
Eserver pSeries 655 User’s Guide
Service Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .66
Concurrent Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .67
Maintenance Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .67
Running Online Diagnostics in Service Mode . . . . . . . . . . . . . . . . . . . . . .68
Running the Online Diagnostics in Concurrent Mode . . . . . . . . . . . . . . . . . . .68
Running the Online Diagnostics in Maintenance Mode . . . . . . . . . . . . . . . . . . .68
Standalone Diagnostic Operation . . . . . . . . . . . . . . . . . . . . . . . . . . .69
Partitioned System Considerations for Standalone Diagnostics . . . . . . . . . . . . . . .69
Running Standalone Diagnostics from a Network Installation Management (NIM) Server . . . . .69
NIM Server Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . .69
Client Configuration and Booting Standalone Diagnostics from the NIM Server . . . . . . . . .70
Chapter 7. Introducing Tasks and Service Aids . . . . . . . . . . . . . . . . . . . .73
Tasks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .73
Add Resource to Resource List . . . . . . . . . . . . . . . . . . . . . . . . . . .75
AIX Shell Prompt . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .75
Analyze Adapter Internal Log . . . . . . . . . . . . . . . . . . . . . . . . . . . .75
Backup and Restore Media . . . . . . . . . . . . . . . . . . . . . . . . . . . . .75
Certify Media . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .76
Change Hardware Vital Product Data . . . . . . . . . . . . . . . . . . . . . . . . .77
Configure Dials and LPF Keys . . . . . . . . . . . . . . . . . . . . . . . . . . . .77
Configure ISA Adapter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .78
Configure Reboot Policy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .78
Configure Remote Maintenance Policy . . . . . . . . . . . . . . . . . . . . . . . . .79
Configure Ring Indicate Power-On Policy . . . . . . . . . . . . . . . . . . . . . . . .80
Configure Scan Dump Policy . . . . . . . . . . . . . . . . . . . . . . . . . . . .81
Configure Surveillance Policy . . . . . . . . . . . . . . . . . . . . . . . . . . . .81
Create Customized Configuration Diskette . . . . . . . . . . . . . . . . . . . . . . .82
Delete Resource from Resource List . . . . . . . . . . . . . . . . . . . . . . . . .82
Disk Maintenance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .82
Disk to Disk Copy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .82
Display/Alter Sector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .83
Display Configuration and Resource List . . . . . . . . . . . . . . . . . . . . . . . .83
Display Firmware Device Node Information . . . . . . . . . . . . . . . . . . . . . . .83
Display Hardware Error Report . . . . . . . . . . . . . . . . . . . . . . . . . . .83
Display Hardware Errors for Any Resource . . . . . . . . . . . . . . . . . . . . . .83
Display Hardware Errors for PCI-X SCSI RAID Adapters . . . . . . . . . . . . . . . . .84
Display Hardware Errors for PCI-X SCSI Adapters . . . . . . . . . . . . . . . . . . .84
Display Hardware Vital Product Data . . . . . . . . . . . . . . . . . . . . . . . . .84
Display Machine Check Error Log . . . . . . . . . . . . . . . . . . . . . . . . . .84
Display Microcode Level . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .84
Display MultiPath I/O (MPIO) Device Configuration . . . . . . . . . . . . . . . . . . . .84
Display or Change Bootlist . . . . . . . . . . . . . . . . . . . . . . . . . . . . .85
Display or Change Diagnostic Run-Time Options . . . . . . . . . . . . . . . . . . . . .85
Display Previous Diagnostic Results . . . . . . . . . . . . . . . . . . . . . . . . .86
Display Resource Attributes . . . . . . . . . . . . . . . . . . . . . . . . . . . . .87
Display Service Hints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .87
Display Software Product Data . . . . . . . . . . . . . . . . . . . . . . . . . . .87
Display System Environmental Sensors . . . . . . . . . . . . . . . . . . . . . . . .87
Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .88
Display Test Patterns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .88
Display USB Devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .88
Download Microcode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .89
Download Microcode to PCI SCSI RAID Adapter . . . . . . . . . . . . . . . . . . . .89
Download Microcode to a PCI-X Dual Channel Adapter . . . . . . . . . . . . . . . . .89
Download Microcode to Disk Drive Attached to a PCI SCSI RAID Adapter . . . . . . . . . .89
Contents
v
vi
Download Microcode to a Fiber Channel Adapter . . . . . . . . . . . . . . . . . . . .90
Download Microcode to DVD-RAM Attached to a PCI SCSI Adapter . . . . . . . . . . . . .90
Download Microcode to Disk Attached to PCI SCSI Adapter . . . . . . . . . . . . . . . .90
Download Microcode to Other Devices . . . . . . . . . . . . . . . . . . . . . . .90
Fault Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .91
Fibre Channel RAID Service Aids . . . . . . . . . . . . . . . . . . . . . . . . . .91
Flash SK-NET FDDI Firmware . . . . . . . . . . . . . . . . . . . . . . . . . . . .92
Format Media . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .92
Format and/or Erase Hard File Attached to a Non-RAID and PCI-X SCSI Adapter . . . . . . . .92
Hardfile Attached to PCI SCSI RAID Adapter . . . . . . . . . . . . . . . . . . . . .93
Optical Media . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .94
Diskette Format . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .94
Gather System Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . .94
Generic Microcode Download . . . . . . . . . . . . . . . . . . . . . . . . . . . .94
Hot-Plug Task . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .94
PCI Hot-Plug Manager . . . . . . . . . . . . . . . . . . . . . . . . . . . . .95
SCSI Hot Swap Manager . . . . . . . . . . . . . . . . . . . . . . . . . . . .96
SCSI and SCSI RAID Hot-Plug Manager . . . . . . . . . . . . . . . . . . . . . . .97
RAID Hot-Plug Devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . .98
Identify Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .99
Identify and Remove Resource Task . . . . . . . . . . . . . . . . . . . . . . . . .99
Identify and System Attention Indicators . . . . . . . . . . . . . . . . . . . . . . . . 100
Local Area Network Analyzer . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100
Log Repair Action . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
PCI RAID Physical Disk Identify . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
PCI SCSI Disk Array Manager . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
Process Supplemental Media . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
RAID Array Manager . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
PCI SCSI Disk Array Manager . . . . . . . . . . . . . . . . . . . . . . . . . . 102
PCI-X SCSI Disk Array Manager . . . . . . . . . . . . . . . . . . . . . . . . . 102
Run Diagnostics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
Run Error Log Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
Run Exercisers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103
Exerciser Commands (CMD) . . . . . . . . . . . . . . . . . . . . . . . . . . . 103
Abbreviations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
Memory Exerciser . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
Tape Exerciser . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
Diskette Exerciser . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
CD-ROM Exerciser . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
Floating Point Exerciser . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
Save or Restore Hardware Management Policies . . . . . . . . . . . . . . . . . . . . 105
SCSI Bus Analyzer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
SCSI RAID Physical Disk Status and Vital Product Data . . . . . . . . . . . . . . . . . . 106
SCSD Tape Drive Service Aid . . . . . . . . . . . . . . . . . . . . . . . . . . . 106
Spare Sector Availability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
SSA Service Aid . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
System Fault Indicator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
System Identify Indicator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
Update Disk-Based Diagnostics . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
Update System or Service Processor Flash . . . . . . . . . . . . . . . . . . . . . . 108
7135 RAIDiant Array Service Aid . . . . . . . . . . . . . . . . . . . . . . . . . . 109
Command Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
7318 Serial Communications Network Server Service Aid . . . . . . . . . . . . . . . . .110
Chapter 8. Verifying Hardware Operation . . . . . . . . . . . . . . . . . . . . . . 111
Considerations Before Running This Procedure . . . . . . . . . . . . . . . . . . . . . 111
Eserver pSeries 655 User’s Guide
. . . . . . . . . . . . . . . . . . . . . . . . . .
Loading the Online Diagnostics in Service Mode . . . . . . . . . . . . . . . . . . . . . 111
Running Standalone Diagnostics from a Network Installation Management (NIM) Server . . . . . .112
NIM Server Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . .112
Client Configuration and Booting Standalone Diagnostics from the NIM Server . . . . . . . .112
Default Boot List and Service Mode Boot List . . . . . . . . . . . . . . . . . . . . . .114
Chapter 9. Hardware Problem Determination . . . . . . . . . . . . . . . . . . . . .115
Problem Determination Using the Standalone or Online Diagnostics . . . . . . . . . . . . .115
Problem Determination When Unable to Load Diagnostics . . . . . . . . . . . . . . . . . 120
Appendix A. Environmental Notices . . . . . . . . . . . . . . . . . . . . . . . . 123
Product Recycling and Disposal . . . . . . . . . . . . . . . . . . . . . . . . . . . 123
Environmental Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123
Acoustical Noise Emissions
(1, 2)
. 124
Appendix B. Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125
Appendix C. Removing and Replacing PCI Adapters . . . . . . . . . . . . . . . . . . 127
Removing a PCI Adapter Cassette . . . . . . . . . . . . . . . . . . . . . . . . . . 128
Replacing a PCI Adapter Cassette . . . . . . . . . . . . . . . . . . . . . . . . . 131
Removing a Non-Hot-Pluggable PCI Adapter . . . . . . . . . . . . . . . . . . . . . 133
Replacing a Non-Hot-Pluggable PCI Adapter . . . . . . . . . . . . . . . . . . . . . 133
Hot-Pluggable PCI Adapter . . . . . . . . . . . . . . . . . . . . . . . . . . . 134
PCI Hot-Plug Manager Access . . . . . . . . . . . . . . . . . . . . . . . . . . 137
PCI Adapter or Blank Filler Removal from a Snap-Assembly-Type Cassette . . . . . . . . . 140
Replacing an Adapter in a PCI Adapter Cassette . . . . . . . . . . . . . . . . . . . . 152
Short Adapter or Blank Filler Installation . . . . . . . . . . . . . . . . . . . . . . . . 156
Long Adapter Installation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 191
Contents
vii
viii
Eserver pSeries 655 User’s Guide
A
v ix A
v x v x v xi v
v
v Do
v Do
v
v
v An
Safety Notices
danger notice indicates the presence of a hazard that has the potential of causing death or serious
personal injury. Danger notices appear on the following pages:
caution notice indicates the presence of a hazard that has the potential of causing moderate or minor
personal injury. Caution notices appear on the following pages:
140
152
For a translation of the safety notices contained in this book, see the System Unit Safety Information,
order number SA23-2652.
Rack Safety Instructions
not install this unit in a rack where the internal rack ambient temperatures will exceed 35 degrees C.
not install this unit in a rack where the air flow is compromised. Any side, front or back of the unit
used for air flow through the unit must not be in direct contact with the rack.
Care should be taken to ensure that a hazardous condition is not created due to uneven mechanical
loading when installing this unit in a rack. If the rack has a stabilizer it must be firmly attached before
installing or removing this unit.
Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of circuits does not compromise the supply wiring or overcurrent protection. To provide the
correct power connection to the rack, refer to the rating labels located on the equipment in the rack to
determine the total power requirement for the supply circuit.
electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the
system or the devices that attach to the system. It is the responsibility of the customer to ensure that
the outlet is correctly wired and grounded to prevent an electrical shock.
Electrical and Mechanical Safety
Observe the following safety instructions any time you are connecting or disconnecting devices attached to
the system.
DANGER
electrical outlet that is not correctly wired could place hazardous voltage on metal parts of
An
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
Use one hand, when possible, to connect or disconnect signal cables to prevent a possible
shock from touching two surfaces with different electrical potentials.
During an electrical storm, do not connect cables for display stations, printers, telephones, or
station protectors for communications lines.
D06
ix
x
CAUTION:
This product is equipped with a four-wire (three-phase and ground) power cable for the user’s
safety. Use this power cable with a properly grounded electrical outlet to avoid electrical shock.
C27
DANGER
To prevent electrical shock hazard, disconnect all power cables from the electrical outlet before
relocating the system.
D01
CAUTION:
Metal edges might be sharp.
C38
Eserver pSeries 655 User’s Guide
1
of an
Laser Safety Information
CAUTION:
This product may contain a CD-ROM, DVD-ROM, or laser module on a PCI card, which are class 1
laser products.
C30
Laser Compliance
All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class
laser products. Outside the U.S., they are certified to be in compliance with the IEC 825 (first edition
1984) as a class 1 laser product. Consult the label on each part for laser certification numbers and
approval information.
CAUTION:
All IBM laser modules are designed so that there is never any human access to laser radiation
above a class 1 level during normal operation, user maintenance, or prescribed service conditions.
Data processing environments can contain equipment transmitting on system links with laser
modules that operate at greater than class 1 power levels. For this reason, never look into the end
optical fiber cable or open receptacle. Only trained service personnel should perform the
inspection or repair of optical fiber cable assemblies and receptacles.
C26
C25,
Preface
xi
xii
Eserver pSeries 655 User’s Guide
or
Data Integrity and Verification
IBM computer systems contain mechanisms designed to reduce the possibility of undetected data corruption
loss. This risk, however, cannot be eliminated. Users who experience unplanned outages, system failures,
power fluctuations or outages, or component failures must verify the accuracy of operations performed and
data saved or transmitted by the system at or near the time of the outage or failure. In addition, users must
establish procedures to ensure that there is independent data verification before relying on such data in
sensitive or critical operations. Users should periodically check the IBM support websites for updated
information and fixes applicable to the system and related software.
xiii
xiv
Eserver pSeries 655 User’s Guide
v To
v To
v
v
v
About This Book
This book provides information on how to use the server, use diagnostics, use service aids, and verify
server operation. This book also provides information to help you solve some of the simpler problems that
might occur.
ISO 9000
ISO 9000 registered quality systems were used in the development and manufacturing of this product.
Highlighting
The following highlighting conventions are used in this book:
Bold
Identifies commands, subroutines, keywords, files, structures, directories, and other items
whose names are predefined by the system. Also identifies graphical objects such as buttons,
labels, and icons that the user selects.
Italics
Monospace
Identifies parameters whose actual names or values are to be supplied by the user.
Identifies examples of specific data values, examples of text similar to what you might see
displayed, examples of portions of program code similar to what you might write as a
programmer, messages from the system, or information you should actually type.
Accessing Information
Documentation for the IBM Eserver pSeries is available online. Visit the IBM Eserver pSeries
Information Center at http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base.
access the pSeries publications, click Hardware documentation.
view information about the accessibility features of Eserver pSeries hardware and the AIX operating
system, click AIX and pSeries accessibility.
References to AIX Operating System
This document may contain references to the AIX operating system. If you are using another operating
system, consult the appropriate documentation for that operating system.
This document may describe hardware features and functions. While the hardware supports them, the
realization of these features and functions depends upon support from the operating system. AIX provides
this support. If you are using another operating system, consult the appropriate documentation for that
operating system regarding support for those features and functions.
Related Publications
The following publications provide related information:
The System Unit Safety Information, order number SA23-2652, contains translations of safety
information used throughout this book.
The Site and Hardware Planning Information, order number SA38-0508, contains information to help
you plan your installation.
The Eserver pSeries 655 Service Guide, order number SA38-0618, contains reference information,
maintenance analysis procedures (MAPs), error codes, removal and replacement procedures, and a
parts catalog.
xv
to
v
v
v
v
v
v
v
v
v
v
v
v
v
The Eserver pSeries 655 Installation Guide, order number SA38-0616, contains information about how
set up and cable the server, install additional processors and subsystems, and verify server
operation.
The IBM Hardware Management Console for pSeries Installation and Operations Guide, order number
SA38-0590, provides information to system administrators about how to install and use a Hardware
Management Console (HMC) to manage a system.
The IBM Hardware Management Console for pSeries Maintenance Guide, order number SA38-0603,
provides information about how to service a Hardware Management Console (HMC).
The RS/6000 and Eserver Diagnostic Information for Multiple Bus Systems, order number SA38-0509,
contains diagnostic information, service request numbers (SRNs), and failing function codes (FFCs).
The RS/6000 and Eserver Adapters, Devices and Cable Information for Multiple Bus Systems, order
number SA38-0516, contains information about adapters, devices, and cables for your server. This
manual is intended to supplement the service information found in the RS/6000 and Eserver
Diagnostic Information for Multiple Bus Systems .
The RS/6000 and Eserver PCI Adapter Placement Reference, order number SA38-0538, contains
information regarding slot restrictions for adapters that can be used in this system.
The AIX Installation Guide and Reference, order number SC23-4389, describes how to install the
operating system, to use a network server, and to install the operating system and run diagnostics on
systems connected to a network.
The AIX Installation in a Partitioned Environment, order number SC23-4384 , provides information about
installing AIX in a partitioned environment.
Trademarks
The following terms are trademarks of International Business Machines Corporation in the United States,
other countries, or both:
AIX
Eserver
IBM
pSeries
RS/6000
company, product, and service names may be trademarks or service marks of others.
Other
xvi
Eserver pSeries 655 User’s Guide
v
v
Chapter 1. Reference Materials
Note: This document may contain references to the AIX operating system. If you are using another
operating system, consult the appropriate documentation for that operating system.
This document may describe hardware features and functions. While the hardware supports them,
the implementation of these features and functions depends on support from the operating system.
AIX provides this support. If you are using another operating system, consult the appropriate
documentation for that operating system regarding support for those features and functions.
This chapter helps you get started with installing and configuring the Eserver pSeries environment. The
following information is included in the chapter:
Eserver pSeries Roadmap
Documentation Overview - Brief description of the printed and softcopy documentation shipped including
targeted audience
Eserver pSeries Roadmap helps you locate marketing, service, and customer task information. The
The
roadmap guides you through the tasks and the publications that document those tasks.
1
2
Marketing and Customer Tasks
Begin
Managed
by HMC
?
Yes
No
Site and Hardware Planning Information
Planning for Partitioned-System Operations
Hardware Management Console
Installation and Operations Guide
Planning
Planning
Hardware
Installation
Planning for
Partitioned-System Operations
AIX Installation in a
Partitioned Environment
AIX Installation Guide
and Reference
Operating System Installation:
Getting Started
Installing/Configuring
the Operating System
Site and Hardware Planning Information
Installer Tasks
Hardware Installation Guide
Hardware Management Console
Installation and Operations Guide
Customer Tasks
Configuring
Partitions
Installing/Configuring
the Operating System
Installing/Configuring
Applications
AIX Installation in a Partitioned Environment
Application Documentation
AIX Documentation Library
Yes
Hardware
Installation
Is System
Using
Partitions
?
No
Configuring Full
System Partition
Using the System
The publications listed in this section are available online. To access the online books, visit our IBM
Eserver pSeries Information Center at http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base.
Eserver pSeries 655 User’s Guide
Hardware User's Guide
AIX Documentation Library
Application Documentation
Documentation Overview
This section provides descriptions and target audience information for the Eserver pSeries and AIX 5L
documentation libraries. Some of the documentation may only be available in softcopy form. Based on the
documentation content, the books are divided into the following categories: Planning, Installing and
Configuring , and Using the System .
Table 1. Planning
Documentation Title
Site and Hardware Planning
Information
Planning for Partitioned-System
Operations
Hardware Management
Console for pSeries Installation
and Operations Guide
Description
Contains information to help plan for site
preparation tasks, such as floor-planning,
electrical needs, air conditioning, and other
site-planning considerations.
Describes planning considerations for
partitioned systems, including information on
dynamic partitioning and Capacity Upgrade on
Demand.
Provides information on how to install,
configure, and use a Hardware Management
Console (HMC). Logical partition (LPAR) tasks,
such as configuring and managing partitions on
multiple host servers, are included.
Audience
Marketing, system
administrators
System
administrators
System
administrators
Type
softcopy
printed and
softcopy
printed and
softcopy
Chapter 1. Reference Materials
3
4
Table 2. Installing and Configuring
Documentation Title
Hardware Installation Guide
Planning for Partitioned-System
Operations
Hardware Management
Console for pSeries Installation
and Operations Guide
AIX Installation in a Partitioned
Environment
AIX Operating System
Installation: Getting Started
AIX 5L Installation Guide and
Reference
PCI Adapter Placement
Reference
AIX 5L Release Notes
AIX 5L Documentation CD
Description
Provides information on how to install system
hardware, cable the system, and verify
operations.
Describes planning considerations for
partitioned systems, including information on
dynamic partitioning and Capacity Upgrade on
Demand.
Provides information on how to install,
configure, and use a Hardware Management
Console (HMC). Logical partition (LPAR) tasks,
such as configuring and managing partitions on
multiple host servers, are included.
Provides information on how to install the AIX
operating system in an LPAR environment.
Provides information on how to install and
configure the AIX operating system on a
standalone system using a CD-ROM device.
Provides information on installing the AIX 5L
operating system on standalone systems, as
well as on client systems using the Network
Installation Management (NIM) interface.
Outlines system-specific PCI adapter slot
placement and adapter support configurations.
Provides late-breaking information for a
specific AIX release.
AIX documentation library (system
management guides, user guides, application
programmer guides, commands and files
references, AIX man pages, and so on).
Audience
Type
System installer printed and
softcopy
System
administrators
System
administrators
System
administrators
System
administrators
System
administrators
System
printed and
softcopy
printed and
softcopy
printed and
softcopy
printed and
softcopy
printed and
softcopy
softcopy
administrators,
service personnel
System
administrators
System
printed and
softcopy
softcopy
administrators
Eserver pSeries 655 User’s Guide
Table 3. Using the System
Documentation Title
Hardware Management
Console for pSeries Installation
and Operations Guide
Description
Provides information on how to install, configure,
and use a Hardware Management Console
(HMC). Logical partition (LPAR) tasks, such as
configuring and managing partitions on multiple
host servers, are included.
Hardware User’s Guide
Provides using, problem determination, and
service processor information.
Diagnostic Information for
Multiple Bus Systems
Combines operating instructions for hardware
diagnostic programs with common MAPs and
SRNs (Service Request Numbers).
PCI Adapter Placement
Reference
Hardware Management
Console for pSeries
Maintenance Guide
Adapters, Devices, and Cable
Information for Multiple Bus
Systems
Outlines system-specific PCI adapter slot
placement and adapter support configurations.
Contains MAPs, removal and replacement, error
code, and parts information to help diagnose and
repair the system.
Provides information about adapters, devices,
and cables that are attached to or used within the
system.
System Unit Safety Information Contains the English version of safety notices, as
well as translations of those safety notices into
other languages.
AIX 5L Documentation CD
AIX documentation library (system management
guides, user guides, application programmer
guides, commands and files references, AIX man
pages, and so on).
Audience
System
administrators
System
administrators
Type
printed and
softcopy
printed and
softcopy
Service personnel printed and
softcopy
System
softcopy
administrators,
service personnel
Service personnel printed and
softcopy
System
administrators
System
administrators,
printed and
softcopy
printed and
softcopy
service personnel
System
softcopy
administrators
Chapter 1. Reference Materials
5
6
Eserver pSeries 655 User’s Guide
v
–
–
–
–
–
-
-
-
-
–
–
–
–
–
32
v
v
up to 16
Chapter 2. Introducing the pSeries 655
The pSeries 655 is a shared multiprocessor server.
The processor subsystem can be configured (or partitioned) as multiple separate systems. This
configuration is known as a logically partitioned system. The processor subsystem is described in
“Processor Subsystem” on page 8.
Other options for a rack configuration include the 7040 Model 61D I/O Subsystem and 7040 Model W42
Integrated Battery Feature (IBF) for the power subsystem.
The following components comprise configurations of the pSeries 655 system:
Bulk Power Assembly (BPA). The BPA is the main power control unit for the pSeries 655 system. This
redundant bulk power assembly distributes power at 350 V to each drawer where conversion is made to
the required chip level. The BPA contains the following components:
Bulk Power Regulator (BPR)
Bulk Power Controller (BPC)
Bulk Power Distributor (BPD)
Bulk Power Jumper (BPJ) - optional
Processor Subsystem . The processor subsystem is a 4 EIA-unit-high drawer. The processor
v
subsystem contains:
One multichip module (MCM) processor operating at one of the following speeds:
1.1 GHz (8-way)
1.3 GHz (4-way)
1.5 GHz (8-way)
1.7 GHz (4-way
Four L3 cache modules
–
Two externalized RIO or RIO-2 ports
One pair of internal RIO or RIO-2 ports servicing three PCI-X slots, two 10/100 Ethernet adapters, a
dual SCSI adapter, and a service processor card
Fan assembly
Two 3.5-inch SCSI DASD bays
Four memory cards with a minimum of 4 GB and a maximum of 64 GB of memory
The processor subsystem contains the Distributed Converter Assembly (DCA) used in the conversion of
350 V bulk power to the supply voltages required by the various internal components.
The minimum memory required to operate this system is 4 GB, and the maximum amount of memory is
GB.
Integrated Battery Feature (IBF). The IBF is a 2 EIA-unit-high drawer that can be added to your
system. The IBF is optional and provides backup electric power in case of a power outage. You can
install up to six IBFs in the rack configuration. The total number of IBFs that can be installed depends
upon the number of processor subsystems and I/O subsystems installed in the rack.
I/O Subsystem . Each I/O subsystem is a 4 EIA-unit-high subsystem containing up to two I/O boards,
disk drives, four DASD backplanes, a midplane card, four cooling fans, and two power supplies
(which are independent of the bulk power assembly). A rack can have up to five I/O subsystems with
each drawer having 20 PCI card slots, and more than 500 GB of storage.
Note: If your system configuration contains IBFs, the rack drawer space may be limited for I/O
subsystems.
7
1
2
3
8
v
Hardware Management Console (HMC). The HMC consists of a display, independent processor,
keyboard, and mouse. One HMC is standard for all systems. An additional HMC is optional. Two HMCs
can attach to one processor subsystem, or two HMCs can jointly manage up to 16 processor
subsystems in up to four racks with the use of 8-port asynchronous adapters and 128-port
asynchronous adapters.
For more information about the use of logical partitioned systems, see the “Partitioned System
Overview.”
Processor Subsystem
The pSeries 655 Model 651 (processor subsystem) is a processor node installed in a frame-mounted
cage. The equipment rack holds a maximum of 16 processor subsystems.
Rear View
2
3
1
Front View
Frame Cage (shown with two processor subsystems and front cover removed)
First pSeries 655 Processor Subsystem
Second pSeries 655 Processor Subsystem
The Hardware Management Console for pSeries (HMC) is used to manage the resources in the system.
The system can be configured as a full system partition, which means that all resources of the system are
used as a single system.
The system can also be configured into multiple (or logical) partitioned systems. With a logically partitioned
system, system resources can be divided into a number of systems, each running in its own partition.
Numerous configurations of pSeries 655 systems can be managed from one HMC. A second HMC can be
used for redundancy.
Partitioned System Overview
Partitioning enables users to configure a single computer into several independent systems. Each of these
systems, called logical partitions, is capable of running applications in its own independent environment.
This independent environment contains its own operating system, its own set of system processors, its
own set of system memory, and its own I/O adapters.
Eserver pSeries 655 User’s Guide
A
A
A
The HMC allows you to perform many hardware management tasks for your managed system, including
configuring logical partitions. You can choose to operate your managed system as a single server (called
full system partition ), or you can choose to run multiple partitions.
Partition Profiles
profile defines a configuration setup for a managed system or partition. The HMC allows you to create
multiple profiles for each managed system or partition. You can then use the profiles you created to start a
managed system or partition in a particular configuration.
partition does not actually own any resources until it is activated; resource specifications are stored
within partition profiles. The same partition can operate using different resources at different times,
depending on the profile you activate.
When you activate a partition, you enable the system to create a partition using the set of resources in a
profile created for that partition. For example, a logical partition profile might indicate to the managed
system that its partition requires three processors, 2 gigabytes of memory, and I/O slots 6, 11, and 12
when activated.
You can have more than one profile for a partition. However, you can only activate a partition with one
profile at a time.
When you create a partition profile, the HMC shows you all the resources available on your system. The
HMC does not, however, verify if another partition profile is currently using a portion of these resources.
For example, the HMC might show eight processors on your system, but does not notify you that other
partitions are using six of them. You can create two partition profiles, each using a majority of system
resources. If you attempt to activate both of these partitions at the same time, the second partition in the
activation list fails.
System Profiles
Using the HMC, you can create and activate often-used collections of predefined partition profiles. A
collection of predefined partition profiles is called a system profile. The system profile is an ordered list of
partitions and the profile that is to be activated for each partition. The first profile in the list is activated
first, followed by the second profile in the list, followed by the third, and so on.
The system profile helps you change the managed systems from one complete set of partition
configurations to another. For example, a customer may want to switch from using 8 partitions to using
only four, every day. To do this, the system administrator deactivates the 8 partitions and activates a
different system profile, one specifying four partitions.
When you create a group of affinity partitions, the HMC automatically creates a system profile that
includes all of the affinity partitions that you created.
Types of Partitions
The HMC allows you to use two types of partitions: logical partitions and the full system partition.
Logical Partitions
Logical partitions are user-defined system resource divisions. Users determine the number of processors,
memory, and I/O that a logical partition can have when active.
Full System Partition
special partition called the full system partition assigns all of your managed system’s resources to one
large partition. The full system partition is similar to the traditional, non-partitioned method of operating a
system. Because all resources are assigned to this partition, no other partitions can be started when the
full system partition is running. Likewise, the full system partition cannot be started while other partitions
are running.
Chapter 2. Introducing the pSeries 655
9
10
The HMC allows you to easily switch from the full system partition to logical partitions. The actual setup of
the operating system in a partition may require some careful planning to ensure no conflicts exist between
the two environments.
For more detail about partitions, see the IBM Hardware Management Console for pSeries Installation and
Operations Guide .
Eserver pSeries 655 User’s Guide
On
1
11
21
2
12
22
3
13
23 SP
4
14 L3
24
5
15 L3
25
6
16 L3
26
7
17 L3
27
8
18
28
9
19
10
20
System Attention LEDs
the processor subsystem, there are two system attention LEDs, one on the DCA and the other on the
rear of the system. For the specific locations of these LEDs, refer to the following illustration.
Cable Identify
Port 0
Port 1
6
4
5
3
10
7
8
2
1
12 13
9
26
27
28
Link Status
Enet 0
Enet 1
System Attention
SP Adapter FRU Indicator
Service Processor FRU Indicator
Processor Planar FRU Indicator
11
23
24
25
Hardfile
Carrier
Hot Plug Slot ID
Hot Plug Slot Power
FRU Indicator
Bank 2
Bank 1
14
18
16
19
Memory Card 1
Memory Card 2
Memory Card 3
Memory Card 4
15
17
20
21
22
Activity Identify
Hardfile 1
DASD Cage
FRU Identify
DASD Backplane
DCA
Power In
Power Out
System Attention
DCA FRU Identify
Fan FRU Identify
Activity Identify
Hardfile 2
Light Pipes to
make LED visible
with File plugged
Ethernet Port 1
Ethernet Port 0
RIO (or RIO-2) Port 0
(A0)
RIO (or RIO-2) Port 1
(A1)
RIO (or RIO-2) Adapter
Memory Card Slot 1
Memory Card Slot 2
Memory Card Slot 3
Memory Card Slot 4
Memory Controller 0
The two LEDs are tied together, so they will always be in the same state, either both on or both off. They
are referred to as the system attention LEDs. The system attention LED is turned on when an entry is
Memory Controller 1
Memory Controller 3
Memory Controller 2
Cache Module 0
Cache Module 1
Cache Module 3
Cache Module 2
MCM Module
(processor)
RIO/PCI-X Bridge
SCSI Controller
DASD Ribbon Cable Connector
Service Processor/VPD Card Connector
Adapter Connector
PCI-X Bridge (PH2)
PCI-X Bridge (PH0)
PCI Adapter Slot 1
PCI Adapter Slot 2
PCI Adapter Slot 3
Chapter 2. Introducing the pSeries 655
11
If a
v
v
v As an
of
If
1.
2.
If
12
made in the service processor error log that gets transmitted to the system-level error logs (the AIX error
log and the service action event log in the Service Focal Point application). When the attention light comes
on, examine these error logs to see if user intervention is required.
hardware problem is indicated, call service support.
System Attention LED
The system attention LED lights and stays on when an event occurs that needs either customer
intervention or IBM service. The system attention LED lights when an entry is made in the service
processor error log. The error entry is transmitted to the following:
System-level error logs
AIX error log
entry in the service action event log in the Service Focal Point application; for example, the loss
surveillance from the HMC to a logical partition.
Accessing System Log Error Information
When an error is detected by the system, information about the error is stored in system error logs. The
error logs are accessed from the HMC that is used to manage the system.
the system attention LED lights, use the HMC to access the error logs by doing the following:
Open the Service Action Event Log in the Service Focal Point application on the HMC.
Check the open service events using the procedure described in ″Working with Serviceable Events″ in
the IBM Hardware Management Console for pSeries Installation and Operations Guide.
Note:
For information on these Service Focal Point settings, see ″Setting Up Surveillance and Enabling
Surveillance Notifications″ in the IBM Hardware Management Console for pSeries Installation and
Operations Guide .
Events requiring customer intervention are marked Call Home Candidate? NO. For each of these events,
examine the description in the serviceable event error details. If actions are listed in the description,
perform those actions. If the error indicates a loss of surveillance between the HMC and a partition, check
the status of the partition, the network, and the cabling between the HMC and the partition. Correct any
problems found. If a surveillance problem is still indicated, call service support.
the service action event is labeled Call Home Candidate? YES, an error code is supplied to identify the
problem. If the system is configured to automatically call home on error, the request for service is placed.
Note: The system attention LED can be reset by following the procedure described in “Resetting the
System Attention LED” on page 13.
PCI Adapters
For complete information about removing and replacing PCI adapters, see Appendix C, “Removing and
Replacing PCI Adapters,” on page 127.
Eserver pSeries 655 User’s Guide