Before using this information and the product it supports, read the information in “Safety
Notices” on page ix, Appendix A, “Environmental Notices” on page 169, and Appendix B,
“Notices” on page 171.
International Business Machines Corporation 2000. All rights reserved.
Note to U.S. Government Users Restricted Rights--Use, duplication or disclosure restricted by
GSA ADP Schedule Contract with IBM Corp.
Reader's Comments — We'd Like to Hear From You ............... 207
viiiRS/6000 Enterprise Server Model H80 Series User's Guide
Safety Notices
A
danger
death or serious personal injury.
ix
x
A
caution
moderate or minor personal injury.
x
xi
18
notice indicates the presence of a hazard that has the potential of causing
notice indicates the presence of a hazard that has the potential of causing
Danger
Caution
notices appear on the following pages:
notices appear on the following pages:
For a translation of the safety notices contained in this book, see the
Safety Information
Electrical Safety
Observe the following safety instructions any time you are connecting or
disconnecting devices attached to the workstation.
DANGER
An electrical outlet that is not correctly wired could place hazardous
voltage on metal parts of the system or the devices that attach to the
system. It is the responsibility of the customer to ensure that the outlet
is correctly wired and grounded to prevent an electrical shock.
Before installing or removing signal cables, ensure that the power
cables for the system unit and all attached devices are unplugged.
When adding or removing any additional devices to or from the system,
ensure that the power cables for those devices are unplugged before
the signal cables are connected. If possible, disconnect all power
cables from the existing system before you add a device.
Use one hand, when possible, to connect or disconnect signal cables
to prevent a possible shock from touching two surfaces with different
electrical potentials.
, order number SA23-2652.
System Unit
During an electrical storm, do not connect cables for display stations,
printers, telephones, or station protectors for communication lines.
Preface ix
CAUTION:
This product is equipped with a three–wire power cable and plug for the user's
safety. Use this power cable with a properly grounded electrical outlet to avoid
electrical shock.
DANGER
To prevent electrical shock hazard, disconnect the power cable from
the electrical outlet before relocating the system.
CAUTION:
This unit has more than one power supply cord. To reduce the risk of
electrical shock, disconnect two power supply cords before servicing.
xRS/6000 Enterprise Server Model H80 Series User's Guide
Laser Safety Information
The optical drive in this system unit is a laser product. The optical drive has a label
that identifies its classification. The label, located on the drive, is shown below.
CLASS 1 LASER PRODUCT
LASER KLASSE 1
LUOKAN 1 LASERLAITE
APPAREIL A LASER DE CLASSE 1
IEC 825:1984 CENELEC EN 60 825:1991
The optical drive in this system unit is certified in the U.S. to conform to the
requirements of the Department of Health and Human Services 21 Code of Federal
Regulations (DHHS 21 CFR) Subchapter J for Class 1 laser products. Elsewhere,
the drive is certified to conform to the requirements of the International
Electrotechnical Commission (IEC) 825 (1st edition 1984) and CENELEC EN 60
825:1991 for Class 1 laser products.
CAUTION:
A class 3 laser is contained in the device. Do not attempt to operate the drive
while it is disassembled. Do not attempt to open the covers of the drive as it
is not serviceable and is to be replaced as a unit.
Class 1 laser products are not considered to be hazardous. The optical drive
contains internally a Class 3B &Type. laser that is nominally 30 milliwatts at 830
nanometers. The design incorporates a combination of enclosures, electronics, and
redundant interlocks such that there is no exposure to laser radiation above a Class
1 level during normal operation, user maintenance, or servicing conditions.
Preface xi
xiiRS/6000 Enterprise Server Model H80 Series User's Guide
Data Integrity and Verification
Notice
IBM computer systems contain mechanisms designed to reduce the
possibility of undetected data corruption or loss. This risk, however,
cannot be eliminated. Users who experience unplanned outages, system
failures, power fluctuations or outages, or component failures must verify
the accuracy of operations performed and data saved or transmitted by the
system at or near the time of the outage or failure. In addition, users must
establish procedures to ensure that there is independent data verification
before relying on such data in sensitive or critical operations. Users should
periodically check the IBM support websites for updated information and
fixes applicable to the system and related software.
Data Integrity and Verificationxiii
xivRS/6000 Enterprise Server Model H80 Series User's Guide
About This Book
This book provides information on how to install and remove options, use the
system, use diagnostics, use service aids, and verify system operation. This book
also provides information to help you solve some of the simpler problems that might
occur.
ISO 9000
ISO 9000 registered quality systems were used in the development and
manufacturing of this product.
Online Publications
RS/6000 publications are available online. To access the online books, visit our Web
site at:http://www.rs6.ibm.com/resource/hardware_docs/
Related Publications
The following publications are available:
The
The
The
The
The
The
Preface xv
System Unit Safety Information
translations of safety information used throughout this book.
RS/6000 Enterprise Server Model H80 Series Installation Guide
number SA38-0575, contains information on how to set up and cable the system,
install and remove options, and verify system operation.
RS/6000 Enterprise Server Model H80 Series Service Guide
SA38-0566, contains reference information, maintenance analysis procedures
(MAPs), error codes, removal and replacement procedures, and a parts catalog.
7014 Model T00 Rack Installation and Service Guide
SA38-0577, contains information regarding the 7014 Model T00 Rack, in which
the RS/6000 Enterprise Server Model H80 Series may be installed.
RS/6000 Diagnostic Information for Multiple Bus Systems
SA38-0509, contains diagnostic information, service request numbers (SRNs),
and failing function codes (FFCs).
, order number SA23-2652, contains
, order
, order number
, order number
, order number
RS/6000 Adapters, Devices, and Cable Information for Multiple Bus
Systems
devices, and cables for your system. This manual is intended to supplement the
service information found in the
Systems
, order number SA38-0516, contains information about adapters,
RS/6000 Diagnostic Information for Multiple Bus
.
The
PCI Adapter Placement Reference
information regarding slot restrictions for adapters that can be used in this
system.
, order number SA38-0538, contains
The
Trademarks
The following terms are trademarks of International Business Machines Corporation
in the United States, other countries, or both:
AIX
RS/6000
Other company, product, and service names may be trademarks or service marks of
others.
Site and Hardware Planning Information
information to help you plan your installation.
, order number SA38-0508, contains
xviRS/6000 Enterprise Server Model H80 Series User's Guide
Chapter 1. Introducing the Model H80
The RS/6000 Enterprise Server Model H80 Series is a multiprocessor, multibus
system packaged in drawers. The processors and memory are packaged in a 5
EIA-unit central electronics complex (CEC drawer), and the I/O devices are in a 5
EIA-unit I/O drawer. The basic system consists of one CEC drawer and one I/O
drawer in the same rack. Connections between the CEC drawer and I/O drawers
are made through a number of cables which include the V/S COMM cable, RIO
(Remote Input Output) cables, and JTAG cable.
The CEC drawer supports one processor card, which can have up to six processors,
each having its own L2 cache. The processors share common system memory.
The system memory resides on one or two memory riser cards in the CEC drawer.
Up to 16 GB of user memory can be installed.
Power is provided to the system rack through an ac power cord that is connected to
a distribution bus. Power is connected from the distribution bus to each drawer
independently. All drawers use 200 - 240 V ac power.
The primary I/O drawer has the following:
14 PCI slots
System operator panel
SPCN controller
Interrupt and system bus control logic
Service Processor
Optional boot DASDs
The PCI bus speeds are as follows:
Up to 33Mhz for the four 32 bit slots at 5 V
Up to 66Mhz for the ten 64 bit slots at 3.3 V
Slots 1-5, 8-10, 13 and 14 support either 32 or 64 bit PCI adapters. The remaining
slots are 32 bit only.
The primary I/O drawer has space for up to three media devices (tape, CD-ROM,
and diskette drive). Two optional disk drives can occupy PCI slots 13 and 14.
Chapter 1. Introducing the Model H801
2RS/6000 Enterprise Server Model H80 Series User's Guide
Chapter 2. Using the Model H80
This chapter discusses system power control and use of options such as the
keyboard, mouse, and drives supported by the Model H80.
Operator Panel
The following diagram shows the locations of the operator panel display and
components of the operator panel.
1
2
3
6
5
4
1Power On/Off Button4Reset Button
2Power On/Off LED5Disturbance or System Attention LED
3Operator Panel Display
R
Scrolling Ball Indicator
The first time ac power is connected to the system, or whenever ac power is
disconnected from the system and then reconnected, a scrolling ball that moves from
left to right may be visible in the operator panel display before the first 8xxx progress
code appears; it might not display long enough to be visible. If seen, it may remain in
the operator panel display for a time of less than one second, or up to three to five
seconds, depending on the system and configuration.
Chapter 2. Using the Model H80
3
Powering Off and Powering On the System
This section provides procedures for powering off and powering on the system.
Powering Off the System
If the system is operating under AIX, type the shutdown -F command to power off
the system.
If you cannot use this method, you can power off the system by using the following
operator-panel power button procedure.
Attention: Using the power button on the primary I/O drawer operator panel to power
off the system might cause unpredictable results in the data files, and the next IPL
will take longer to complete.
1. Open the rack door.
2. Press the power button on the primary I/O drawer operator panel.
The power LED on the primary I/O drawer operator panel starts blinking at a fast
rate. B0FF appears in the primary operator panel display.
When the power-off sequence is complete, the system goes into Standby mode, as
evidenced by the following:
OK message displays in the primary operator panel display.
The primary I/O drawer operator panel LED will start blinking at a slow rate.
The CEC drawer power LED will start blinking at a slow rate.
Powering On the System
Perform the following steps to power on the CEC drawer and attached I/O drawer:
1. Open the rack door. Look for OK in the primary operator panel display, which
indicates that the system is in Standby mode.
2. Press the power button on the primary I/O drawer operator panel.
The power LED on the primary I/O drawer operator panel starts blinking at a fast
rate. Checkpoints codes (9xxx) appear in the operator panel display.
When the power-up sequence is complete; the following events occur:
The power LED on the primary I/O drawer operator panel stops blinking and
stays on.
4RS/6000 Enterprise Server Model H80 Series User's Guide
The power LED on the CEC drawer stops blinking and stays on.
Powering Off and Powering On the System Using the Service Processor
The system can be powered off and on using the System Power Control menu,
which is a Service Processor menu that is available to the privileged user. See
“System Power Control Menu” on page 39.
Chapter 2. Using the Model H805
POST Indicators
POST (Power-On-Self-Test) indicators indicate tests that are being performed as the
system is preparing to load the operating system. The POST indicators are words
that display on the system console. Each time that the system starts a different step
in the POST, a POST indicator word appears on the console. Each word is an
indicator of the tests that are being performed.
The POST screen displays the following words:
Memory Memory test
Keyboard Initialize the keyboard and mouse. The time period for pressing a key to
access the System Management Services, or to boot from a default boot
list is now open. See “POST Keys” on page 7 for more information.
NetworkSelf-test on network adapters
SCSIAdapters are being initialized
SpeakerSounds an audible tone at the end of POST
6RS/6000 Enterprise Server Model H80 Series User's Guide
POST Keys
The POST keys, if pressed
the last POST indicator displays, cause the system to start services or boot modes
used for configuring the system and diagnosing problems. The keys are described
below:
Note: The program function keys (F1-F12) on a keyboard attached to the I/O
drawer are no longer used and will be ignored. After the keyboard POST indicator
displays, you must use the numeric number keys to enter input.
after
the keyboard POST indicator displays and
before
1Key
The numeric 1 key, when pressed during POST, starts the System Management
Services (SMS) interface.
5Key
The numeric 5 key, when pressed during POST, starts the default boot list mode,
located in firmware.
This mode attempts to boot from the first device of each type found in the list. It
does not search for other bootable devices of that type if the first device is not
bootable. Instead, it continues to the next device type in the list. The firmware
supports up to five entries in the boot list.
The default boot sequence is:
1. Diskette
2. CD-ROM
3. Hard File
4. Tape Drive (if installed)
5. Network
a. Token Ring
b. Ethernet
6Key
The numeric 6 key works like the numeric 5 key, except that firmware looks for a
boot record according to the custom bootlist that was set up by System Management
Services.
Chapter 2. Using the Model H807
8Key
To enter the Open Firmware command line, press the numeric 8 key
after
the word
keyboard displays and before the last word (speaker) displays during startup. After
you press the 8 key, the remaining POST indicators display until initialization
completes.
When initialization and POST are complete, the Open Firmware command line (an OK
prompt) displays.
Use the Open Firmware command line to set up adapters that are not configurable
with the System Management Services. Your adapter documentation directs you to
use this option if it is needed.
To exit from the Open Firmware command prompt, type reset-all or power off the
system and reboot.
8RS/6000 Enterprise Server Model H80 Series User's Guide
Console Strategy
The firmware starts a console-selection sequence at system boot time if any of the
following are true:
A console has not yet been selected
A previous console-selection sequence timed-out
A change in the system configuration has affected the console (keyboard
installed/removed, mouse installed/removed, graphics adapter installed/removed
or moved to another PCI slot).
The console-selection sequence allows you to select (from the appropriate input
device) any one of the available console devices. If no console is selected within
approximately 60 seconds, serial port 1 (S1) is selected as the console and the
selection sequence times-out.
Attention: If an ASCII terminal is attached to serial port 1 (S1), and there is any
interaction with this terminal:
After OK displays in the operator panel
AND
Before the power-up sequence is initiated
the firmware will still use this terminal as the console, regardless of the previous
console selection.
After a console has been selected, the console-selection sequence is only started at
boot time if there is a change in the system configuration (as described above), or
the contents of the system's nonvolatile memory (NVRAM) are lost.
Note: Moving an ASCII terminal from one serial port to another (from S1 to S2)
cannot be detected by the firmware, so it does not constitute a configuration change.
You can also initiate a system console-selection sequence from the SMS menus.
Chapter 2. Using the Model H809
Reading the I/O Drawer Operator Panel Display
The operator panel display is used to:
Track the progress of the system unit self tests and configuration program.
Display codes when the operating system comes to an abnormal end.
Display system messages.
Checkpoints
During power-on self-test (POST), four-digit checkpoints are displayed indicating the
progress of the testing. If an error is detected that requires attention, the system unit
halts with an eight-digit error code displayed in the upper row of the operator panel
display, starting in the leftmost position. This eight-digit error code identifies the error
(see
RS/6000 Enterprise Server Model H80 Series Service Guide
error codes).
The four-digit checkpoints are in the form of nnnn, where n is an alphabetic or
numeric character.
for a listing of the
10RS/6000 Enterprise Server Model H80 Series User's Guide
Using the Keyboards
Several keyboards are available for the system unit. The keyboards have various
keys that enter data and control the cursor location. The keyboards can be engraved
for the languages of different countries.
The functions of each keyboard depends on the software used. The character sets
for the keyboards are contained and explained in the documentation for your
operating system.
1
Num
Caps
Print
EscF1F2F3F4F6F7F8F5F9F10 F11 F12Pause
@
#
!
123
Tab
Caps
Lock
Shift
CtrlAlt
$% &
4567
QWERYU ITOP
ASDFGHJKL :
ZXCVBNM
*
8
()
90
<
>
,
_
+
-
=
"
,
;
?
Shift
/.
AltCtrl
Backspace
Enter
Scroll
Screen
Lock
SysRq
Insert
Home
Delete
End
Lock
Break
Page
Up
Page
Down
Lock
Num
/*-
Lock
789
HomePgUp
654
321
End
PgDn
.
0
InsDel
Scroll
Lock
+
Enter
2
3
4
The keyboard is divided into four sections:
1. Function keys are multipurpose keys, and their function is controlled by the
operating system.
2. Typewriter keys are similar to those on a standard typewriter. Their function is
controlled by the software.
3. Control keys move the cursor on the screen and do programmed control
functions. The movement and functions depend on the application used.
4. Numeric keypad is arranged like a calculator to help when typing numbers.
Chapter 2. Using the Model H8011
All of the keyboards adjust for typing comfort. To tilt the keyboard, pull out the
keyboard legs. The legs snap into position. To decrease the tilt of the keyboard,
rotate the keyboard legs until they snap into the bottom of the keyboard case.
The keyboard cable plugs into the keyboard connector at the rear of the I/O drawer.
12RS/6000 Enterprise Server Model H80 Series User's Guide
Using the Three-Button Mouse
The mouse is a hand-operated locating device. A three-button mouse is available for
use with the system unit. Consult your application publication for the exact use of
the three-button mouse.
You can use the mouse to perform such functions as positioning a cursor, selecting
items from a menu, or moving around in your document much easier and faster than
if you used only the keyboard. The cursor moves exactly as you move the mouse on
a flat surface, such as a desktop.
When you move the mouse around on a flat surface, the cursor moves on the
display screen; the movement changes the position of the cursor.
With the mouse buttons, you can perform functions such as selecting and
deselecting options, extending your selection, or choosing a command. The precise
function of your mouse depends on the software you are using.
The mouse has a cable that plugs into the mouse connector at the rear of the
primary I/O drawer.
Handling the Mouse Correctly
For best operation, handle the mouse with care. Incorrect handling can damage the
mouse.
Do not:
Operate the mouse on cloth, unfinished wood, newspaper, or carpet
Drop or hit the mouse
Carry the mouse by holding onto the cable
Expose the mouse to extreme temperatures or direct sunlight
Place the mouse in liquid spills
Chapter 2. Using the Model H8013
Caring for the Mouse
Make sure that the operating surface for the mouse is smooth, clean, and flat. For
example, you can operate the mouse on the following surfaces:
Finished wood
Glass
Enamel
Plastic
Paper (except newspaper)
Metal
Rough surfaces collect contaminants that can be transferred to the interior of the
mouse by the ball. Rough surfaces can also cause the pads located on the bottom
of the mouse to prematurely wear. A deeply pitted surface could cause erratic
operation of the mouse. The surface you use should be free from spills, dirt, dust,
lint, wax, eraser dust, and other foreign matter.
Additional things to check:
Inspect the work surface for spills or other contaminants.
Dust the work surface.
If you are using a paper pad, inspect it for wear and replace it if necessary.
14RS/6000 Enterprise Server Model H80 Series User's Guide
Cleaning the Mouse
To clean the mouse, do the following:
1. Remove the retaining ring (1) by turning it counterclockwise, in the direction of
the arrow as shown in the illustration.
1
2
3
2. Remove the ball (2).
3. Inspect the ball for contaminants. Wipe it clean with a dry, lint-free cloth.
4. If the ball is dirty, wash it in warm, soapy water. Rinse and wipe the ball with a
lint-free cloth until dry.
5. Inspect the ball cavity (3) in the mouse for foreign materials. If there are any
foreign materials, remove them.
6. Replace the ball.
7. Replace the retaining ring on the mouse and align it with the open slots in the
ball cavity.
8. Turn the retaining ring clockwise until the open slots are covered and you hear
the ring snap into place.
Chapter 2. Using the Model H8015
Using the 3.5-Inch Diskette Drive
The Model H80 has a 1.44MB diskette drive installed vertically in the front.
The 1.44MB diskette drive can format, read, and write diskettes compatible with the
following diskettes:
1.0MB diskettes with 720KB formatted data capacity
2.0MB diskettes with 1.44MB formatted data capacity (HD)
Format the diskette according to its specified capacity.
Write-Protecting 3.5-Inch Diskettes
Write-protecting diskettes is necessary so that important information is not
accidentally lost.
When diskettes are write-protected, you can read information from the diskettes, but
you cannot write information onto them.
There is a write-protect tab on the 3.5-inch diskette.
To locate the write-protect tab, turn the diskette over with the label facing down.
To prevent writing onto a diskette, slide the write-protect tab to open the protect
slot, as shown in the following illustration.
16RS/6000 Enterprise Server Model H80 Series User's Guide
To allow writing onto a diskette, slide the write-protect tab to cover the protect
slot, as shown in the following illustration.
Loading and Unloading the 3.5-Inch Diskette
To load a diskette into the drive, insert the diskette in the diskette drive with the
labeled metal shutter first. Push the diskette into the drive until you hear a click.
The click indicates that the diskette is securely in position in the drive.
To unload the diskette, push the diskette-unload button. The diskette unloads
partially from the drive. Remove the diskette.
Chapter 2. Using the Model H8017
Using the CD-ROM Drive
CAUTION:
A Class 3 laser is contained in the device. Do not attempt to operate the drive
while it is disassembled. Do not attempt to open the covers of the drive, as it is
not serviceable and is to be replaced as a unit.
Your CD-ROM drive looks like the one shown in the illustration, and the controls are
located as indicated.
When the CD-ROM is set to On, the status light indicates one of several conditions.
The following are status light states and the respective conditions of the CD-ROM
drive:
Off during standby with the tray loaded or unloaded
Blinks from insertion of the tray to completion of initialization
Blinks slowly when disc is dusty
Blinks fast when in the audio mode
Lights during data transfer operations
Lights steadily when:
– No disc is in the tray
– The disc is in the tray upside down
– Some condition exists that should be checked. If this occurs, contact your
service representative.
18RS/6000 Enterprise Server Model H80 Series User's Guide
Loading the CD-ROM Drive
Press the unload button to open the tray. Place the disc, with the printed side away
from the tray, into the tray. Slip out the bottom tabs to hold the disc in place. Push
gently on the load/unload button. The drive automatically pulls the tray into the drive
and prepares the disc for reading.
Unloading the CD-ROM Drive
Push and hold the unload button until the drawer comes out, and then remove the
disc.
Cleaning the CD-ROM Drive
This CD-ROM drive has an internal head-cleaning mechanism, and therefore does
not require an external cleaning device. The internal cleaning mechanism cleans the
head every time the tray is operated.
Always handle discs carefully by the edges to avoid leaving fingerprints or scratching
the disc. This helps the disc to maintain good readability. Discs can be wiped with a
soft, lint-free cloth or lens tissue. Always wipe in a straight line from the inner hub to
the outer rim.
Chapter 2. Using the Model H8019
Emergency Eject
Note: Execute the following procedure only in an emergency (such as when the
tray will not eject even though you have pressed the unload button).
1. Insert a small diameter rod, such as a straightened paper clip, into the
emergency eject hole. (Refer to the illustration below for the location of the
emergency eject hole.)
2. Push in the tool until you feel resistance.
3. Maintain a small amount of pressure on the rod while pulling on the tray with
your fingernail.
Note: Normally the tray makes a ratcheting sound when you pull it open using the
above procedure.
20RS/6000 Enterprise Server Model H80 Series User's Guide
Ergonomic Information
After you have set up your system, we encourage you to visit the Healthy Computing
Web site. Good ergonomic practice is important to get the most from your
workstation and to avoid discomfort. This means that the equipment and the
workplace should be arranged to suit your individual needs and the kind of work you
do.
The Healthy Computing Web site gives ergonomic guidelines to help you understand
the ergonomic considerations that you should know when working at a computer
workstation. The address is:
http://www.us.pc.ibm.com/healthycomputing
Using the Service Processor and Service Director Features
The Service Processor and Service Director features protect users against
unnecessary system downtime by keeping support personnel (both internal and
external) aware of any unexpected changes in the system environment. In
combination, the two features provide a flexible solution to automated system
maintenance.
Service Processor
The Service Processor runs on its own power boundary and continually monitors
hardware attributes, the AIX operating system, and the environmental conditions
within the system. Any system failure which prevents the system from returning to
an operational state (a fully functional AIX operating system) is reported by the
Service Processor. The Service Processor is controlled by firmware and does not
require the AIX operating system to be operational to perform its tasks. If any
system failures are detected, the Service Processor can take predetermined
corrective actions. The methods of corrective actions are:
Surveillance
Call Home
AIX Operating System Monitoring
Surveillance is a function in which the Service Processor monitors the system
through heartbeat communication with the system firmware. The
periodic signal that the firmware can monitor. During system startup, the firmware
surveillance monitor is automatically enabled to check for heartbeats from the
firmware. If a heartbeat is not detected within a default period, the Service
Processor attempts to reboot the system until the system either restarts successfully,
Chapter 2. Using the Model H8021
heartbeat
is a
or a predetermined retry threshold is reached. In the event the Service Processor is
unsuccessful in bringing the system online (or in the event that the user asked to be
alerted to any Service Processor-assisted restarts), the system can call home to
report the error.
The Call Home function can be initialized to call either a service center telephone
number, a customer administration center, or a digital pager telephone number. The
Service Processor can be configured to stop at the first successful call to any of the
numbers listed, or can be configured to call every number provided. If connected to
the service center, the Service Processor transmits the relevant system information
(the system's serial number and model type) and Service Request Number (SRN). If
connected to a digital pager service, the Service Processor inputs a Customer Voice
Telephone Number defined by the customer. An established sequence of digits or
the telephone number to a phone near the failed system could be used to signal a
system administrator to a potential system failure.
During normal operations, the Service Processor can also be configured to monitor
the AIX operating system. If AIX does not respond to the Service Processor
heartbeat, the Service Processor assumes the operating system is hung. The
service processor can automatically initiate a restart and, if enabled, initiate the call
home function to alert the appropriate people to the system hang.
Unlike the Service Director, the service processor cannot be configured in a
client/server environment where one system can be used to manage all dial-out
functions for a set of systems.
Prior to installing the Service Director feature, ensure that you have the latest level of
system firmware. You also need a properly configured modem. For more
information on configuring a modem, see “Modem Configuration Menu” on page 54.
Service Director
The Service Director is a software extension to the AIX operating system that
monitors the system while the AIX operating system is running. The Service Director
monitors and analyzes all recoverable system failures, and, if needed, can
automatically place a service call to a service center (without user intervention).
The service center receives the machine type/serial number, host name, SRN, and a
problem description. The service center analyzes the problem report and, if
warranted, dispatches a service person to the customer site. The service center also
determines if any hardware components need to be ordered prior to the service
person's arrival.
22RS/6000 Enterprise Server Model H80 Series User's Guide
The Service Director code also gives the user the option to establish a single system
as the problem reporting server. A single system, accessible over the user network,
can be used as the central server for all the other systems on the Local Area
Network (LAN) that are running the Service Director application. If the Service
Director application on a remote client decides a service request needs to be placed,
the client forwards the information to the Service Director server which dials the
service center telephone number from its locally attached modem. In this scenario,
the user only needs to maintain a single analog line for providing call-out capabilities
for a large set of servers.
A modem is required for enabling automated problem reporting to the IBM service
center. Configuration files for several types of modems are included as part of the
Service Director package. Refer to “Modem Configuration Menu” on page 54 for
more information on configuring your modem.
Chapter 2. Using the Model H8023
24RS/6000 Enterprise Server Model H80 Series User's Guide
Chapter 3. Using the Service Processor
The service processor menus make it possible for you to configure service processor
options, as well as enable and disable functions.
Service processor menus are available using an ASCII terminal when OK is displayed
on the primary I/O drawer operator panel or when the service processor has
detected a server problem (such as a surveillance failure).
For a summary of the service processor functions and the methods used to invoke
them, see the following table.
Chapter 3. Using the Service Processor25
Service Processor FunctionsService
Processor
Menus
(ASCII
terminals)
Service
Processor
Service
Aids
(ASCII or
graphics
terminals)
Read VPD image from last system bootY
Read System POST ErrorsY
Read Service Processor Error LogsY
View Progress Indicators from last bootY
Power-off SystemY
Read NVRAMY
Start Talk ModeY
Enable/Disable Console MirroringY
Setup Reboot/Restart PolicyY
Enable/Disable ModemY
Setup Modem ConfigurationY
Setup Dial-out Phone NumbersY
Setup SurveillanceY
Select Modem Line SpeedY
Update System EPROMsY
26RS/6000 Enterprise Server Model H80 Series User's Guide
Service Processor Menus
The service processor menus are divided into two groups:
General user menu - the user must know the general-access password.
Privileged user menus - the user must know the privileged-access password.
If the server is powered off, the service processor menus can be accessed locally or
remotely.
Accessing the Service Processor Menus Locally
Service processor menus can be accessed locally by connecting an ASCII terminal
to serial port 1 (S1), serial port 2 (S2), or serial port 3 (S3). Access to the service
processor menus is not available on serial port 4 (S4). Because the presence of the
ASCII terminal cannot be confirmed by the service processor, you must press a key
(any key) on the ASCII keyboard to confirm its presence after OK appears in the
primary drawer operator panel display.
When you gain access, the service processor prompts you for a password (if set),
and when verified, displays the service processor menus.
The service processor menu prompt, represented by >, 1> or 2> indicates the serial
port to which the terminal is connected.
> indicates serial port 1 (S1)
1> indicates serial port 2 (S2)
2> indicates serial port 3 (S3)
Accessing the Service Processor Menus Remotely
If your system has a modem connected and is configured for call-in (see “Modem
Configuration Menu” on page 54), the service processor menus can be accessed
remotely as follows:
1. With the system powered off, call in from a remote terminal.
2. The Service Processor detects ring-indicate and prompts you for a password (if
set). When verified, the service processor menus display remotely.
Saving and Restoring Service Processor Settings
All the settings you make (except language) from the service processor menus can
be backed up either for recovering from a fault that may corrupt these settings, or for
replicating these settings to other servers that include a service processor.
Chapter 3. Using the Service Processor27
The service aid, Save or Restore Hardware Management Policies, can be used to
save your settings after initial setup or whenever the settings must be changed for
system operation purposes.
It is strongly recommended that you use this service aid for backing up service
processor settings to protect the usefulness of the service processor and the
availability of the server. Refer to “Save or Restore Hardware Management Policies”
on page 143 for information about this service aid.
Menu Inactivity
To prevent loss of control due to power loss or power surges, the service processor
attempts to leave menu mode after 10 minutes of inactivity by simulating the option
99 selection. This attempt is successful on menus containing the 99 option. On the
other menus, the attempt is unsuccessful and the following message displays with
each attempt:
Illegal value entered
Press Return to continue
28RS/6000 Enterprise Server Model H80 Series User's Guide
General User Menu
The menu options presented to the general user are a subset of the options
available to the privileged user. The user must know the general-access password,
if set, to access this menu.
àð
GENERAL USER MENU
1. Power-on System
2. Power-off System
3. Read VPD Image from Last System Boot
4. Read Progress Indicators from Last System Boot
5. Read Service Processor Error Logs
6. Read System POST Errors
99. Exit from Menus
1>
á
Power-on System
ñ
Allows the user to start up the system using the current ASCII terminal as the
active console.
Power-off System
Allows the user to power down the system using the current ASCII terminal as
the active console.
Read VPD Image from Last System Boot
Displays manufacturer vital product data, such as serial numbers, part numbers,
and so on, that were stored from the system boot prior to the one in progress
now.
Chapter 3. Using the Service Processor29
Read Progress Indicators from Last System Boot
Displays a number of the boot progress indicators, which may include service
processor checkpoints, IPL checkpoints, and/or AIX configuration codes, from the
previous system boot. This information can be useful in diagnosing system faults.
The progress indicator codes are listed from top (latest) to bottom (oldest).
This information is not stored in nonvolatile storage. If the system is powered
down using the power-on button on the operator panel, this information is
retained. If the ac power is disconnected from the system, this information will be
lost. For an example, refer to “LCD Progress Indicator Log” on page 77.
Read Service Processor Error Logs
Displays the service processor error logs. For an example, refer to “Service
Processor Error Logs” on page 75.
Read System POST Errors
Displays the results of the System Firmware POST (Power-On Self Test). Your
server may be able to start in the presence of POST errors if there are sufficient
working system resources. If POST errors occur during startup, this error log,
when used with the diagnostics, helps to isolate faults. For an example, refer to
“Service Processor Error Logs” on page 75.
Exit from Menus
Selecting this option will exit the service processor menus. You can re-enter the
menus by pressing any key on the console.
30RS/6000 Enterprise Server Model H80 Series User's Guide
Privileged User Menus
The following menus are available to privileged users only. The user must know the
privileged access password, if set, to access these menus.
Main Menu
At the top of the Main Menu is a listing containing:
Your system's current firmware version
The firmware copyright notice
The system name given to your server during setup
You need the firmware version for reference when you either update or repair the
functions of your service processor.
The system name, an optional field, is the name that your server reports in problem
messages. This name helps your support team (for example, your system
administrator, network administrator, or service representative) to more quickly
identify the location, configuration, and history of your server. The system name is
set from the Main Menu using option 6.
Chapter 3. Using the Service Processor31
Note: The information under the Service Processor Firmware heading in the
following Main Menu illustration is example information only.
àð
VERSION: MM313
SYSTEM NAME
MAIN MENU
1. Service Processor Setup Menu
2. System Power Control Menu
3. System Information Menu
4. Language Selection Menu
5. Call-In/Call-Out Setup Menu
6. Set System Name
99. Exit from Menus
1>
á
Service Processor Firmware
Copyright 1999 IBM Corporation
Service Processor Setup Menu
See “Service Processor Setup Menu” on page 33 for more information.
System Power Control Menu
ñ
See “System Power Control Menu” on page 39 for more information
System Information Menu
See “System Information Menu” on page 44 for more information.
Language Selection Menu
See “Language Selection Menu” on page 52 for more information.
Call-In/Call-Out Setup Menu
See “Call-In/Call-Out Setup Menu” on page 53 for more information.
Set System Name
Allows setting of the system name.
32RS/6000 Enterprise Server Model H80 Series User's Guide
Service Processor Setup Menu
The Service Processor Setup menu shown below is accessed from the Main Menu:
àð
SERVICE PROCESSOR SETUP MENU
1. Change Privileged Access Password
2. Change General Access Password
3. Enable/Disable Console Mirroring:
Currently Enabled
4. Start Talk Mode
5. OS Surveillance Setup Menu
6. Reset Service Processor
7. Reprogram Flash EPROM Menu
8. Serial Port Snoop Setup Menu
98. Return to Previous Menu
99. Exit from Menus
1>
á
ñ
Note: Unless otherwise stated in menu responses, settings become effective when
a menu is exited using option 98 or 99.
Chapter 3. Using the Service Processor33
Passwords
Passwords can be any combination of up to eight alphanumeric characters. You can
enter longer passwords, but the entries are truncated to include only the first eight
characters. The Privileged Access Password can be set from Service Processor
menus or from System Management Services (SMS) utilities (see Chapter 4, “Using
System Management Services” on page 81). The General Access Password can be
set only from Service Processor menus.
For security purposes, the service processor counts the number of attempts to enter
correct passwords. The results of not recognizing a correct password within this
error threshold are different, depending on whether the attempts are being made
locally (at the server) or remotely (through a modem). The error threshold is three
attempts.
If the error threshold is reached by someone entering passwords at the server, the
Service Processor commands the server to resume the initial program load (IPL).
This action is taken based on the assumption that the server is in an adequately
secure location with only authorized users having access. Such users must still
successfully enter a login password to access AIX.
If the error threshold is reached by someone entering passwords remotely, Service
Processor commands the server to power down to prevent potential security attacks
on the server by unauthorized remote users. The following table lists what you can
access with the Privileged Access Password and the General Access Password.
Privileged
Access
Password
NoneNoneService processor MAIN MENU displays
SetNoneUsers with the password see the service processor MAIN
SetSetUsers see menus associated with the entered password
General
Access
Password
34RS/6000 Enterprise Server Model H80 Series User's Guide
Resulting Menu
MENU. Users without password cannot log in.
Change Privileged Access Password
Set or change the Privileged Access Password. It provides the user with the
capability to access all service processor functions. This password is usually
used by the system administrator or root user.
Change General Access Password
Set or change the General Access Password. It provides limited access to
service processor menus, and is usually available to all users who are allowed to
power on the server, especially remotely.
Note: The General Access Password can only be set or changed after the
Privileged Access Password is set.
Enable/Disable Console Mirroring
Console mirroring is supported on serial port 1 (S1), serial port 2 (S2), and serial
port 3 (S3). When Console Mirroring is enabled, the service processor sends
information to all serial ports. The serial port from which console mirroring is
enabled is referred to as the
active port
. The
mirror port
is determined when
keyboard input is detected from one of the other ports. From this point on, the
service processor only sends information to the active port and the mirror port.
This capability can be enabled by local or remote users, providing local users
the capability to monitor remote sessions. Console mirroring can be enabled for
the current session only. For more information, see “Console Mirroring” on
page 74.
Start Talk Mode
In a console-mirroring session, it is useful for those who are monitoring the
session to be able to communicate with each other. Selecting this menu item
activates the keyboards and displays for such communications while console
mirroring is established. This is a full duplex link, so message interference is
possible. Alternating messages between users works best.
Chapter 3. Using the Service Processor35
OS Surveillance Setup Menu
This menu can be used to set up operating system (OS) surveillance.
àð
OS Surveillance Setup Menu
1. Surveillance:
Currently Enabled
2. Surveillance Time Interval:
2 minutes
3. Surveillance Delay:
2 minutes
98. Return to Previous Menu
1>
á
– Surveillance
Can be set to Enabled or Disabled.
– Surveillance Time Interval
Can be set to any number from 2 through 255.
ñ
– Surveillance Delay
Can be set to any number from 0 through 255.
Refer to “Service Processor System Monitoring - Surveillance” on page 71 for
more information about surveillance.
Reset Service Processor
If this option is selected, entering Y will cause the service processor to reboot.
36RS/6000 Enterprise Server Model H80 Series User's Guide
Reprogram Flash EPROM Menu
This option updates the system EPROMs. After entering Y to indicate that you
want to continue, you are prompted to enter the update diskettes. Follow the
instructions on the screen. When the update is complete, the service processor
reboots.
All system EPROMs that can be reprogrammed are updated at the same time.
They are as follows:
– System Power Control Network programming
– Service Processor programming
– IPL programming
– Run-Time Abstraction Services
Chapter 3. Using the Service Processor37
Serial Port Snoop Setup Menu
This menu can be used to set up Serial Port Snooping, in which the user can
configure Serial Port 1 as a "catch-all" reset device.
From the Service Processor Main Menu, select option 1, Service Processor
setup menu, then select option 8 (Serial Port Snoop Setup Menu).
àð
SERIAL PORT SNOOP SETUP MENU
1. System reset string:
Currently Unassigned
2. Snoop Serial Port:
Currently Unassigned
98. Return to Previous Menu
1>
á
Use the system reset string option to enter the system reset string, which resets
the machine when it is detected on the main console on Serial Port 1.
Use the Snoop Serial Port option to select the Serial Port to Snoop.
Note: Only Serial Port 1 is supported.
After Serial Port Snooping is correctly configured, at any point after the system
unit is booted to AIX, whenever the reset string is typed on the main console, the
system unit uses the Service Processor reboot policy to restart.
ñ
Pressing Enter after the reset string is not required, so make sure that the string
is not common or trivial. A mixed-case string is recommended.
38RS/6000 Enterprise Server Model H80 Series User's Guide
System Power Control Menu
This menu is used to set power control options. Other menus that control boot
options are available here:
àð
SYSTEM POWER CONTROL MENU
1. Enable/Disable Unattended Start Mode:
Currently Enabled
2. Ring Indicate Power-On Menu
3. Reboot/Restart Policy Setup Menu
4. Power-On System
5. Power-Off System
6. Enable/Disable Fast System Boot
Currently Enabled
7. Boot Mode Menu
98. Return to Previous Menu
99. Exit from Menus
1>
á
Enable/Disable Unattended Start Mode
Use this option to instruct the service processor to restore the power-state of the
server after a temporary power failure. Unattended Start Mode can also be set
through the System Management Services (SMS) Menus. It is intended to be
used on servers that require automatic power-on after a power failure. For more
information, see “System Power-On Methods” on page 62.
Chapter 3. Using the Service Processor39
ñ
Ring Indicate Power-On Menu
àð
RING INDICATE POWER-ON MENU
1. Ring indicate power-on :
Currently Enabled
2. Number of rings:
Currently 3
3. Refresh Modem Settings
98. Return to Previous Menu
á
ñ
Ring indicate is enabled by default on both serial port 1 (S1) and serial port 2
(S2). When ring indicate power on is enabled, call-in is disabled.
If ring indicate power-on is enabled and call-in is already enabled, you will be
asked to confirm your choice. Refer to the message displayed on your screen.
If the ring indicate power-on setting is changed, you must select option 30,
Refresh Modem Settings to update the modem settings. If option 30, Refresh
Modem Setting is selected, and the modem(s) have not been configured, you will
be asked to configure the modems first. See “Call-In/Call-Out Setup Menu” on
page 53 for information on configuring modems.
Option 2 is used to set the number of rings.
40RS/6000 Enterprise Server Model H80 Series User's Guide
Reboot/Restart Policy Setup Menu
The following menu controls Reboot/Restart Policy:
àð
Reboot/Restart Policy Setup Menu
1. Number of reboot attempts:
Currently 1
2. Use OS-Defined restart policy?
Currently Yes
3. Enable supplemental restart policy?
Currently No
4. Call-Out before restart:
Currently Disabled
98. Return to Previous Menu
1>
á
ñ
Reboot
system reset or power on.
is the process of bringing up the system hardware; for example, from a
Restart
is activating the operating system after the
system hardware is reinitialized. Restart must follow a successful reboot.
– Number of reboot attempts - If the server fails to successfully complete the
boot process, it attempts to reboot the number of times specified. Entry
values equal to or greater than 0 are valid. Only successive failed
reboot/restart attempts count.
– Use OS-Defined restart policy - Allows the service processor to react or not
react in the same way as the operating system to major system faults by
reading the setting of the operating system parameter AutomaticallyRestart/Reboot After a System Crash. This parameter may or may not be
defined, depending on the operating system or its version/level. If the
operating system automatic restart setting is defined, then it can be set to
respond to a major fault by restarting or by not restarting. See your operating
system documentation for details on setting up operating system automatic
restarts. The default value is YES.
Chapter 3. Using the Service Processor41
– Enable supplemental restart policy - The default setting is NO. If set to
YES, the service processor restarts the system when the system loses
control as detected by service processor surveillance, and either:
The Use OS-Defined restart policy is set to NO.
OR
The Use OS-Defined restart policy is set to YES, and the operating
system has NO automatic restart policy.
Refer to “Service Processor Reboot/Restart Recovery” on page 64.
– Call-Out before restart (Enabled/Disabled) - If a restart is necessary due to
a system fault, you can enable the service processor to call out and report
the event. This option can be valuable if the number of these events
becomes excessive, signalling a bigger problem.
Power-On System
Allows immediate power-on of the system. For other power-on methods, see
“Powering On the System” on page 4.
Power-Off System
Allows the user to power-off the system.
Enable/Disable Fast System Boot
Allows the user to select the IPL type, mode, and speed of your boot capabilities.
Attention: Selecting the fast IPL results in several diagnostic tests being skipped
and a shorter memory test being run.
42RS/6000 Enterprise Server Model H80 Series User's Guide
Boot Mode Menu
The Boot Mode Menu allows you to select a boot mode.
àð
Boot Mode Menu
1. Boot to SMS Menu:
Currently Disabled
2. Service Mode Boot from Saved List:
Currently Disabled
3. Service Mode Boot from Default List:
Currently Disabled
4. Boot to Open Firmware Prompt:
Currently Disabled
98. Return to Previous Menu
1>
á
To select a boot mode, select a number and press Enter. The item
corresponding to the selected number toggles from Disabled to Enabled. If the
same number is selected again, the item toggles from Enabled to Disabled. If a
boot mode is Enabled, the Boot mode selected is performed, and the
Disabled/Enabled selection is reset to Disabled. Following is a description for
each boot mode:
ñ
– Boot to SMS Menu
When enabled, the system boots to the System Management Services
(SMS) Menu.
– Service Mode Boot from Saved List
This selection causes the system to boot from the saved service mode boot
list (saved in NVRAM). This is normally used to try to boot Customer
Diagnostics from the CD-ROM drive. If the system boots AIX from the disk
drive and AIX diagnostics are loaded on the disk drive, AIX boots to the
diagnostics menu.
Using this option to boot the system is the preferred way to run Online
diagnostics.
– Service Mode Boot from Default List
This selection is similar to “Service Mode Boot from Saved List,” except the
system boots from the default boot list that is stored in the system firmware.
Using this option to boot the system is the preferred way to run Standalone
diagnostics.
Chapter 3. Using the Service Processor43
– Boot to Open Firmware
When this selection is enabled, the system boots to the Open Firmware
prompt.
System Information Menu
This menu provides access to system configuration information, error logs, system
resources, and processor configuration.
àð
SYSTEM INFORMATION MENU
1. Read VPD Image from Last System Boot
2. Read Progress Indicators from Last System Boot
3. Read Service Processor Error Logs
4. Read System POST Errors
5. Read NVRAM
6. Read Service Processor Configuration
7. Processor Configuration/Deconfiguration Menu
8. Memory Configuration/Deconfiguration Menu
9. Power Control Network Utilities Menu
98. Return to Previous Menu
99. Exit from Menus
1>
á
Read VPD Image from Last System Boot
Displays manufacturer's vital product data (VPD), such as serial numbers, part
numbers, and so on, that was stored from the system boot prior to the one in
progress now.
44RS/6000 Enterprise Server Model H80 Series User's Guide
ñ
Read Progress Indicators from Last System Boot
Displays a number of the boot progress indicators, which may include Service
Processor checkpoints, IPLROS checkpoints, and/or AIX configuration codes,
from the previous system boot. This information can be useful in diagnosing
system faults.
The progress indicator codes are listed from top (latest) to bottom (oldest).
This information is not stored in nonvolatile storage. If the system is powered
down using the power-on button on the operator panel, this information is
retained. If the ac power is disconnected from the system, this information will be
lost. For an example, refer to “LCD Progress Indicator Log” on page 77.
Read Service Processor Error Logs
Displays error conditions detected by the service processor. Refer to “Service
Processor Error Logs” on page 75 for an example of this error log.
Read System POST Errors
Select this item to review the results of the POST (Power-On Self Test). Your
server may be able to start in the presence of POST errors if there are sufficient
working system resources. If POST errors occur during startup, this error log,
when used with the diagnostics, helps to isolate faults. Refer to “System POST
Errors” on page 76 for an example of this error log.
Read NVRAM
Displays Non Volatile Random Access Memory (NVRAM) content.
Read Service Processor Configuration
Displays current service processor configuration.
Chapter 3. Using the Service Processor45
Processor Configuration/Deconfiguration Menu
This menu allows the user to change the system processor configuration. If it is
necessary to take one of the processors offline, this menu allows you to
deconfigure a processor, and then reconfigure the processor at a later time. An
example of this menu is shown below:
àð
PROCESSOR CONFIGURATION/DECONFIGURATION MENU
77. Enable/Disable CPU Repeat Gard: Currently Enabled
1. 3. () Configured by system
2. 1 3.1 (31) Deconfigured by system
3. 2 3.2 () Configured by system
4. 3 3.3 () Configured by system
5. 4 4. () Configured by system
6. 5 4.1 () Configured by system
7. 6 4.2 () Configured by system
8. 7 4.3 () Configured by system
98. Return to Previous Menu
1>
á
Note: This table is built from vital product data collected during the last boot
sequence. The first time the system is powered up, or after the system's
non-volatile ram (NVRAM) has been erased, this table may be empty. The table
is rebuilt during the next boot into AIX.
ñ
The fields of the previous table represent the following:
Column 1 (1.) Menu selection index.
Column 2 (0) Logical processor device number assigned by AIX. You can
display these logical device numbers by issuing the command lsdev
-C ] grep proc on the AIX command line.
Column 3 (3.0) Processor address list used by the service processor and should
be ignored.
Column 4 (00) Error status of the processors.
The error status of the each processor is indicated by AB, where B indicates the
number of errors and A indicates the type of error according to the following
table:
A status of 00 indicates that the CPU has not had any errors logged against it by
the service processor.
46RS/6000 Enterprise Server Model H80 Series User's Guide
To enable or disable CPU Repeat Gard, use menu option 77 of the Processor
Configuration/Deconfiguration Menu.
Note: The processor numbering scheme used by the Service Processor is
different from the numbering scheme used by AIX. Consult the AIX
documentation before configuring or deconfiguring a processor to ensure that the
correct processor is selected.
Memory Configuration/Deconfiguration Menu
These menus allow the user to change the system memory configuration. If it is
necessary to take one of the memory DIMMs offline, this menu allows you to
deconfigure a DIMM, and then reconfigure the DIMM at a later time.
When this option is selected, a menu displays, showing the memory riser cards
in the system. Type the number of the memory riser card on which the DIMM is
plugged that you want to configure or deconfigure. An example of this menu is
shown below:
àð
MEMORY CONFIGURATION/DECONFIGURATION MENU
77. Enable/Disable Memory Repeat Gard: Currently Enabled
1. Memory riser card 1
2. Memory riser card 2
98. Return to Previous Menu
á
Note: If there is only one processor in the system, system memory may reside
on that processor assembly. In that case, you are given the option to select the
processor card from the list. After you select a processor card or a memory riser
card, a menu allowing the selection of a memory DIMM is displayed. An
example of this menu is shown below:
Chapter 3. Using the Service Processor47
ñ
àð
MEMORY CONFIGURATION/DECONFIGURATION MENU
77. Enable/Disable Memory Repeat Gard: Currently Enabled
1. 11.16() Configured by system2. 11.17() Configured by system
3. 11.18() Configured by system4. 11.19() Configured by system
5. 11.2() Configured by system6. 11.21() Configured by system
7. 11.22() Configured by system8. 11.23() Configured by system
9. 11.24() Configured by system 1. 11.25() Configured by system
11. 11.26() Configured by system 12. 11.27() Configured by system
13. 11.28() Configured by system 14. 11.29() Configured by system
15. 11.3() Configured by system 16. 11.31() Configured by system
98. Return to Previous Menu
Memory DIMMs are managed as a group.
Deconfiguring a DIMM will make the whole group unavailable.
>
á
Note: This table is built from vital product data collected during the last boot
sequence. The first time the system is powered up, or after the system's
non-volatile ram (NVRAM) has been erased, this table may be empty. The table
is rebuilt during the next boot into AIX.
The error status of the each memory DIMM is indicated by AB, where B indicates the
number of errors and A indicates the type of error according to the following table:
A status of 00 indicates that the memory DIMM has not had any errors logged
against it by the service processor.
To change the memory configuration, select the number of the memory DIMM. The
memory DIMM state will change from configured to deconfigured or from
deconfigured to configured. Memory DIMMs are managed as a group;
deconfiguring a DIMM makes the whole group of four DIMMs unavailable.
In the example menu shown above, each line shows two DIMMs and whether they
are configured.
The physical location of a DIMM is determined by
location (11 for riser card 1 and 10 for riser card 2) followed by the DIMM location on
the riser card, according to the following two illustrations.
48RS/6000 Enterprise Server Model H80 Series User's Guide
xx.yy
, where xx is the riser card
To enable or disable Memory Repeat Gard, use menu option 77 of the Processor
Configuration/Deconfiguration Menu.
Memory Riser Card 1 Memory DIMM Locations for Service Processor
Menus
15: 11.16
13: 11.20
11: 11.24
9: 11.28
7: 11.29
5: 11.25
3: 11.21
1: 11.17
16: 11.18
14: 11.22
12: 11.26
10: 11.30
8: 11.31
6: 11.27
4: 11.23
2: 11.19
Memory Riser Card 2 Memory DIMM Locations for Service Processor
Menus
15: 10.32
13: 10.36
11: 10.40
16: 10.34
14: 10.38
12: 10.42
9: 10.44
7: 10.45
5: 10.41
3: 10.37
1: 10.33
Chapter 3. Using the Service Processor
10: 10.46
8: 10.47
6: 10.43
4: 10.39
2: 10.35
49
Processor Card Memory DIMM Locations for Service Processor Menus
8: 3.48
7: 3.50
6: 3.52
5: 3.54
4: 3.55
3: 3.53
2: 3.51
1: 3.49
Power Control Network Utilities Menu
This menu allows the user to do a lamp test on the operator panels and display
or change the I/O type.
àð
POWER CONTROL NETWORK UTILITIES MENU
1. Lamp Test for all Operator Panels
2. Display I/O Type
3. Change I/O Type
98. Return to Previous Menu
1>
á
ñ
– Lamp Test for all Operator Panels
Selecting this option tests the operator panel indicators. When selected, the
indicators blink on and off for approximately 30 seconds.
– Display I/O Type
Nothing displays on the operator panel. After 20-30 seconds the Power
Control Network Utilities Menu redisplays on the console.
50RS/6000 Enterprise Server Model H80 Series User's Guide
– Change I/O Type
Use this option to change the I/O type of the primary I/O drawer after a
service action or configuration change if the I/O type is incorrect. If this
option is chosen, you are asked to make two entries.
1. When asked for the drawer number, type 1.
2. When asked for the I/O type, type 85.
If you enter an invalid value, a failure message displays on the console. The
Power Control Network Utilities Menu is then redisplayed on the console.
Chapter 3. Using the Service Processor51
Language Selection Menu
The service processor menus and messages are available in different languages.
This menu allows selecting languages in which the service processor and system
firmware menus and messages are displayed.
àð
LANGUAGE SELECTION MENU
1. English
2. Francais
3. Deutsch
4. Italiano
5. Espanol
6. Svenska
98. Return to Previous Menu
99. Exit from Menus
1>
á
ñ
Note: Your ASCII terminal must support the ISO-8859 character set to correctly
display languages other than English.
52RS/6000 Enterprise Server Model H80 Series User's Guide
Call-In/Call-Out Setup Menu
This menu is used to configure a modem for the service processor to use to support
the system:
àð
CALL-IN/CALL-OUT SETUP MENU
1. Modem Configuration Menu
2. Serial Port Selection Menu
3. Serial Port Speed Setup Menu
4. Telephone Number Setup Menu
5. Call-Out Policy Setup Menu
6. Customer Account Setup Menu
7. Call-Out Test
98. Return to Previous Menu
99. Exit from Menus
1>
á
ñ
Modem Configuration Menu, see “Modem Configuration Menu” on page 54.
Serial Port Selection Menu, see “Serial Port Selection Menu” on page 55.
Serial Port Speed Setup Menu, see “Serial Port Speed Setup Menu” on
page 56.
Telephone Number Setup Menu, see “Telephone Number Setup Menu” on
page 57.
Call-Out Policy Setup Menu, see “Call-Out Policy Setup Menu” on page 59.
Customer Account Setup Menu, see “Customer Account Setup Menu” on
page 60.
Call-Out Test Menu tests the configuration after the modem is installed and
configured correctly.
Chapter 3. Using the Service Processor53
Modem Configuration Menu
The first two lines of the Modem Configuration Menu are status lines showing the
current selections. Selections are made in the two sections labeled Modem Ports and
Modem Configuration File Name. Select the serial port that you want to activate and
then select the modem configuration file for the modem on the port. If you want to
set up all of the serial ports with modems, make your selections one port at a time.
Note: Modem configuration is only supported on serial port 1 (S1) and serial port 2
(S2).
àð
Modem Configuration Menu
Port 1 Modem Configuration File Name:
Port 2 Modem Configuration File Name:
To make changes, First select the port and then the configuration file
name
Modem Ports:
1. Serial port 1
2. Serial port 2
Modem Configuration File Name:
5. none9. modem_z_sp
6. modem_f_sp1. modem_m1_sp
7. modem_f_sp11. modem_m_sp
8. modem_f1_sp12. modem_m1_sp
3. Save configuration to NVRAM and Configure modem
98. Return to Previous Menu
>
á
For information on choosing a modem configuration file, see “Sample Modem
Configuration Files” on page 177, and “Transfer of a Modem Session” on page 183.
54RS/6000 Enterprise Server Model H80 Series User's Guide
ñ
Serial Port Selection Menu
This menu allows you to enable and/or disable the call-in and call-out functions of
each serial port in any combination.
àð
Serial Port Selection Menu
1. Serial Port 1 Call-Out:3. Serial Port 1 Call-In:
Currently DisabledCurrently Disabled
2. Serial Port 2 Call-Out:4. Serial Port 2 Call-In:
Currently DisabledCurrently Disabled
98. Return to Previous Menu
á
Call-in and ring indicate power-on cannot be enabled at the same time. If ring
indicate power on is already enabled and you try to enable call-in, a message
prompts you for confirmation. Refer to the message displayed on the screen.
ñ
Chapter 3. Using the Service Processor55
Serial Port Speed Setup Menu
This menu allows you to set serial port speed to enhance terminal performance or to
accommodate modem capabilities.
àð
Serial Port Speed Setup Menu
1. Serial Port 1 Speed:
Currently 96
2. Serial Port 2 Speed:
Currently 96
3. Serial Port 3 Speed:
Currently 96
98. Return to Previous Menu
1>
á
A serial port speed of 9600 baud or higher is recommended. Valid serial port
speeds are shown below:
56RS/6000 Enterprise Server Model H80 Series User's Guide
Telephone Number Setup Menu
Use this menu to set or change the telephone numbers for reporting a system failure.
àð
Telephone Number Setup Menu
1. Service Center Telephone Number:
Currently Unassigned
2. Customer Administration Center Telephone Number:
Currently Unassigned
3. Digital Pager Telephone Number:
Currently Unassigned
4. Customer Voice Telephone Number:
Currently Unassigned
5. Customer System Telephone Number:
Currently Unassigned
98. Return to Previous Menu
1>
á
ñ
Chapter 3. Using the Service Processor57
Service Center Telephone Number is the number of the service center
computer. The service center usually includes a computer that takes calls from
servers with call-out capability. This computer is referred to as the "catcher."
The catcher expects messages in a specific format to which the service
processor conforms.
For more information about the format and catcher computers, refer to the
README file in the AIX /usr/samples/syscatch directory. Contact your service
provider for the correct service center telephone number to enter. Until you have
that number, leave this field unassigned.
Customer Administration Center Telephone Number is the number of the
System Administration Center computer (catcher) that receives problem calls
from servers. Contact your system administrator for the correct telephone
number to enter here. Until you have that number, leave this field unassigned.
Digital Pager Telephone Number is the number for a numeric pager carried by
someone who responds to problem calls from your server. Contact your
administration center representative for the correct telephone number to enter.
For test purposes, use a test number, which you can change later. See note on
page 73.
Customer Voice Telephone Number is the telephone number of a phone near
the server or answered by someone responsible for the server. This is the
telephone number left on the pager for callback. For test purposes, use a test
number, which you can change later.
Customer System Telephone Number is the telephone number to which your
server's modem is connected. The service or administration center
representatives need this number to make direct contact with your server for
problem investigation. This is also referred to as the
58RS/6000 Enterprise Server Model H80 Series User's Guide
call-in
phone number.
Call-Out Policy Setup Menu
Callout settings can be set using the following menu:
àð
CALL-OUT POLICY SETUP MENU
1. Call-Out policy (First/All):
Currently First
2. Remote timeout, (in seconds):
Currently 12
3. Remote latency, (in seconds):
Currently 2
4. Number of retries:
Currently 2
98. Return to Previous Menu
1>
á
Call-Out policy can be set to first or all. If call-out policy is set to first, the
service processor stops at the first successful call-out to one of the following
numbers in the order listed:
ñ
1. Service Center
2. Customer Administrative Center
3. Pager
If call-out policy is set to all, the service processor attempts a call-out to all of
the following numbers in the order listed:
1. Service Center
2. Customer Administrative Center
3. Pager
Remote timeout and Remote latency are functions of your service provider's
catcher computer. Either use the defaults or contact your service provider for
recommended settings.
Number of retries is the number of times you want the server to retry calls that
failed to complete.
Chapter 3. Using the Service Processor59
Customer Account Setup Menu
This menu allows users to enter information that is specific to their account.
àð
Customer Account Setup Menu
1. Customer Account Number:
Currently Unassigned
2. Customer RETAIN Login userid:
Currently Unassigned
3. Customer RETAIN login password:
Currently Unassigned
98. Return to Previous Menu
1>
á
Customer Account Number is assigned by your service provider for
record-keeping and billing. If you have an account number, enter it. Otherwise,
leave this field unassigned.
Customer RETAIN Login UserID and Customer RETAIN Login Password
apply to a service function to which your service provider may or may not have
access. Leave these fields unassigned if your service provider does not use
RETAIN.
ñ
Service Processor Procedures in Service Mode
When the system is in service mode, the following service-processor parameters are
suspended:
When service mode is exited, the service processor parameters revert to the
customer settings.
60RS/6000 Enterprise Server Model H80 Series User's Guide
Service Processor Functions
This section discusses some of the service processor functions in greater detail. The
service processor supports the functions listed in the following table.
Built-in FunctionsInitialization and TestService Processor BIST
System Chip Initialization
Error Data CollectionBIST/POST errors and status
Isolation to Field Replaceable
Unit (FRU) on failure
ConfigurationCPU Complex validation
VPD Collection
System ManagementReset and Reboot on System
Firmware failure
Reboot on system failure
Local User FunctionUser InterfaceLocal async console
Text based menus with NLS
Operator Panel messages
Power and MiscellaneousPower On/Off
Configurable Reboot Policy
Status and Data AccessVPD
Error data (SP)
Error data (system)
Service Processor Setup
Utilities
Remote User FunctionsCall-Out (Call Home)
Reporting
Call-InPower-on through ring-indicate
Passwords
Phone numbers
Language (NLS) selection
Call-In/Call-Out enable/disable
System Name
Modem Configuration
OS termination
Boot failure
Surveillance failure
Checkstop
Machine check
Identify system by name
Password/security check
Console mirroring
Chapter 3. Using the Service Processor61
System Power-On Methods
This section discusses the following system power-on methods:
Power-on Switch
Service Processor Menus
Privileged users can power-on the system by selecting the System Control
Power Menu option from the main menu and then selecting the Power-on
System option from the system power control menu. General users should select
Power-on System on the general user menu.
Remote Power-on via Ring-Indicate Signal
The server automatically powers on when it detects a "ring indicate" signal from
a modem attached to serial port 1 (S1) or serial port 2 (S2).
A remote user can call the server to activate ring detection by the modem.
Listen for a few more rings than the threshold number for starting the system.
The system powers on without answering the call.
Unattended start mode - refer to Enable/Disable Unattended Start Mode on
page 39.
The Service Processor can be enabled to recover from the loss of ac power (see
Enable/Disable Unattended Power-On Mode in the SYSTEM POWER CONTROL
MENU). When ac power is restored, the system returns to the then current
power state at the time ac loss occurred. For example, if the system was
powered-on when ac loss occurred, it reboots/restarts when power is restored. If
the system was powered-off when ac loss occurred, it remains off when power is
restored.
62RS/6000 Enterprise Server Model H80 Series User's Guide
Timed power-on - refer to the shutdown -t command on servers using AIX.
Working in conjunction with AIX, the Service Processor in your server can
operate a timer, much like the wake-up timer on your clock radio. You can set
the timer so that your server powers on at a certain time after shutting down.
The timer is battery-operated, so power interruptions that occur while the server
is off do not affect its accuracy. Refer to the AIX shutdown -t command for
details on setting the timer.
Note: If an ac power loss is in progress when the Timed Power-On attempt
occurs, the server is not able to power on when ac power is restored.
Follow-up to a Failed Boot Attempt
The service processor initiates a power-on sequence if a failed boot attempt is
detected (due to a hardware or software failure).
Fast/Slow Boot (IPL) Capabilities
Using the service processor menus, you can select the IPL type, mode, and
speed of your boot capabilities.
Attention: Selecting fast IPL results in several diagnostic tests being skipped
and a shorter memory test being run.
Chapter 3. Using the Service Processor63
Service Processor Reboot/Restart Recovery
Reboot
reset or power-on. The boot process ends when control passes to the operating
system process.
Restart
reinitialized. Restart must follow a successful reboot.
describes bringing the system hardware back up; for example, from a system
describes activating the operating system after the system hardware is
Boot (IPL) Speed
When the server enters reboot recovery, slow IPL is automatically started, which
gives the POST an opportunity to locate and report any problems that may otherwise
be untested.
Failure During Boot Process
During the boot process, either initially after system power-on or upon reboot after a
system failure, the service processor monitors the boot progress. If progress stops,
the service processor can reinitiate the boot process (reboot) if enabled to do so.
The service processor can re-attempt this process according to the number of retries
selected in the Reboot/Restart Policy Setup Menu.
Failure During Normal System Operation
When the boot process completes and control transfers to the operating system
(OS), the service processor can monitor operating system activity (see the Set
Surveillance Parameters option in the SERVICE PROCESSOR SETUP MENU). If
OS activity stops due to a hardware- or software-induced failure, the service
processor can initiate a reboot/restart process based on the settings in the Service
Processor Reboot/Restart Policy Setup Menu and the OS automatic restart settings
(see the operating system documentation).
If you are using the AIX operating system, the menu item under SMIT for setting the
restart policy is Automatically Reboot After Crash (True/False). The default is False.
When the setting is True, and if the service processor parameter "Use OS-Defined
Restart Policy" is Yes (the default), Service Processor takes over for AIX to
reboot/restart after a hardware or surveillance failure.
64RS/6000 Enterprise Server Model H80 Series User's Guide
Service Processor Reboot/Restart Policy Controls
The operating system's automatic restart policy (see operating system
documentation) indicates the operating system response to a system crash. The
service processor can be instructed to refer to that policy by the Use OS-Defined
Restart Policy setup menu.
If the operating system has no automatic restart policy, or if it is disabled, then the
service processor-restart policy can be controlled from the service processor menus.
Use the Enable Supplemental Restart Policy selection.
Use OS-Defined restart policy - The default setting is Yes. This causes the service
processor to refer to the OS Automatic Restart Policy setting and take action (the
same action the operating system would take if it could have responded to the
problem causing the restart).
When this setting is No, or if the operating system did not set a policy, the service
processor refers to Enable supplemental restart policy for its action.
Enable supplemental restart policy - The default setting is No. If set to Yes, the
service processor restarts the server when the operating system loses control and
either:
The Use OS-Defined restart policy is set to No.
OR
The Use OS-Defined restart policy is set to Yes and the operating system has
no automatic restart policy.
Refer to “Service Processor Reboot/Restart Recovery” on page 64.
Chapter 3. Using the Service Processor65
The following table describes the relationship among the operating system and
service processor restart controls:
OS Automatic
reboot/restart
after crash setting
Service processor
to use OS-Defined
restart policy?
Service Processor
Enable
supplemental restart
System response
policy?
NoneNoNo
NoneNoYesRestarts
NoneYes
NoneYes
2
False
2
False
2
False
2
False
1
1
NoNo
NoYesRestarts
1
Yes
1
Yes
TrueNoNo
TrueNoYesRestarts
TrueYes
TrueYes
1
Service processor default
2
AIX default
1
1
1
1
No
YesRestarts
1
1
No
Yes
1
1
No
Restarts
YesRestarts
66RS/6000 Enterprise Server Model H80 Series User's Guide
System Firmware Updates
This section describes the methods of reprogramming the system firmware. All types
of firmware that can be reprogrammed are updated at the same time. They are as
follows:
System Power Control Network programming
Service Processor programming
IPL programming
Run-time Abstraction Services
Three user-initiated processes can be used to update the firmware in the system:
Service Processor menu
Service Aids (available in AIX)
AIX command line
A fourth method, called recovery mode, is initiated automatically by the service
processor if the service processor programming has been corrupted. This method is
described later in this section.
Determining the Level of Firmware on the System
Note: This information may be superseded by the information that is available on
the RS/6000 Support Web site shown below. Always check the Web site for the
latest images and instructions for checking the firmware level.
The Web address is:http://www.rs6.ibm.com/support/micro
The firmware level is denoted by XXYYMMDD, where XX = model designation, YY =
year, MM = month, and DD = day of the release.
The firmware level is displayed in two places on the system:
On the AIX command line, enter lscfg -vp]grep ROM. The last line displayed is
the firmware level of the system.
At the top of the Service Processor main menu.
Update Diskette Images and Instructions
Update diskette images and instructions are available from the RS/6000 Support
Web site. The Web address is:http://www.rs6.ibm.com/support/micro
Chapter 3. Using the Service Processor67
It is strongly recommended that you create a set of firmware update diskettes from
the RS/6000 Support Web site as soon as possible after a system is installed. As
noted below, the diskette images can be downloaded to any personal computer or
RS/6000 with Internet access. This method is the only way to acquire a set of
firmware update diskettes; they cannot be created using the Service Processor
menus or using AIX.
Retain and store the latest firmware diskettes each time the firmware gets updated in
the event that the firmware gets corrupted and must be reloaded.
Download the detailed instructions, as well as the latest flash images, from this Web
site. Before doing a system firmware update, check the Web site for the latest code
and images. Do not attempt a firmware update if you are unsure of the image you
have, or of the procedure required for the update.
The images are available on the Web site in either DOS format or AIX (backup)
format:
If downloading to a PC, download the DOS image and follow the instructions.
The diskettes that are created will be in AIX (backup) format.
If downloading to an RS/6000, follow the instructions to create an image on disk
drives or on diskettes.
Updating System Firmware From the Service Processor Menus
The service processor menus are available while the system is powered off. As a
privileged user, from the service processor main menu, select Service ProcessorSetup, then select Reprogram Flash EPROM Menu. The update process requests
update diskettes as needed.
Updating System Firmware from the AIX Service Aids
Note: This service aid is supported only in On-Line Diagnostics.
Refer to the detailed instructions that are provided on the Web site with the latest
image.
68RS/6000 Enterprise Server Model H80 Series User's Guide
Updating System Firmware from the AIX Command Line
Refer to the detailed instructions that are provided on the Web site with the latest
image.
Note: The update process can take up to sixty minutes, and the system will always
reboot itself during the update process.
Recovery Mode
If the service processor detects that its programming has been corrupted when it is
powering up to standby mode, it will enter recovery mode. In recovery mode, it will
ask for the firmware update diskettes by putting eight-digit codes on the operator
panel.
The eight-digit codes are as follows:
CodeAction
A1FD 0000System firmware has been corrupted and must be reflashed.
A1FD 0001Insert update diskette 1.
A1FD 0002Insert update diskette 2.
A1FD 0003Insert update diskette 3.
Notes:
1. If the wrong diskette is inserted at any time, or if the diskette is left in the drive
after it has been read, B1FD 1F is displayed, indicating that the wrong diskette
is in the drive.
2. If B1FD 1A is displayed at any time during the process, the service processor
must be reset by removing and reconnecting ac power.
After the update process is complete, the service processor resets itself and goes
through a complete reboot.
Chapter 3. Using the Service Processor69
Configuring and Deconfiguring Processors or Memory
All failures that crash the system with a machine check or check stop, even if
intermittent, are reported as a diagnostic callout for service repair. To prevent the
recurrence of intermittent problems and improve the availability of the system until a
scheduled maintenance window, processors and memory modules with a failure
history are marked "bad" to prevent their being configured on subsequent boots.
A processor or memory module is marked "bad" under the following circumstances:
A processor or memory module fails built-in self test (BIST) or power-on self test
(POST) testing during boot (as determined by the Service Processor).
A processor or memory module causes a machine check or check stop during
runtime, and the failure can be isolated specifically to that processor or memory
module (as determined by the processor runtime diagnostics in the Service
Processor).
A processor or memory module reaches a threshold of recovered failures that
results in a predictive callout (as determined by the processor runtime
diagnostics in the Service Processor).
During boot time, the Service Processor does not configure processors or memory
modules that are marked "bad," much in the same way that it would deconfigure
them for BIST/POST failures.
If a processor is deconfigured, the processor remains offline for subsequent reboots
until the faulty processor is replaced. The Repeat Gard function also provides the
users with the option of manually deconfiguring a processor, or re-enabling a
previously deconfigured processor. For information on how to configure or
deconfigure a processor, see the Processor Configuration/Deconfiguration Menu on
page 46.
You can enable or disable CPU Repeat Gard or Memory Repeat Gard using the
Processor Configuration/Deconfiguration Menu, which is a submenu under the
System Information Menu.
Run-Time CPU Deconfiguration (CPU Gard)
L1 instruction cache recoverable errors, L1 data cache correctable errors, and L2
cache correctable errors are monitored by the processor runtime diagnostics (PRD)
code running in the Service Processor. When a predefined error threshold is met, an
error log with warning severity and threshold exceeded status is returned to AIX. At
the same time, PRD marks the CPU for deconfiguration at the next boot. AIX will
70RS/6000 Enterprise Server Model H80 Series User's Guide
attempt to migrate all resources associated with that processor to another processor
and then stop the defective processor.
Correctable errors in system memory are monitored and tracked. When a predefined
error threshold is met, an error log with warning severity is returned to AIX. A
memory page will be deallocated by AIX.
Service Processor System Monitoring - Surveillance
Surveillance is a function in which the service processor monitors the system, and
the system monitors the service processor. This monitoring is accomplished by
periodic samplings called
heartbeats
.
Surveillance is available during two phases:
System firmware bringup (automatic)
Operating system runtime (optional)
System Firmware Surveillance
System firmware surveillance is automatically enabled during system power-on. It
cannot be disabled by the user.
If the service processor detects no heartbeats during system IPL (for 7 minutes), it
cycles the system power to attempt a reboot. The maximum number of retries is set
from the service processor menus. If the fail condition persists, the service
processor leaves the machine powered on, logs an error, and displays menus to the
user. If Call-out is enabled, the service processor calls to report the failure and
displays the operating-system surveillance failure code on the operator panel.
Chapter 3. Using the Service Processor71
Operating System Surveillance
Operating system surveillance provides the service processor with a means to detect
hang conditions, as well as hardware or software failures, while the operating system
is running. It also provides the operating system with a means to detect a service
processor failure caused by the lack of a return heartbeat.
Operating system surveillance is not enabled by default, allowing you to run
operating systems that do not support this service processor option.
You can also use Service Processor Menus and Service Processor Service Aids to
enable or disable operating system surveillance.
For operating system surveillance to work correctly, you must set three parameters:
Surveillance enable/disable
Surveillance interval
The maximum time the service processor should wait for a heartbeat from the
operating system before timeout.
Surveillance delay
The length of time to wait from the time the operating system is started to when
the first heartbeat is expected.
Surveillance does not take effect until the next time the operating system is started
after the parameters have been set.
If desired, you can initiate surveillance mode immediately from Service Aids. In
addition to the three options above, a fourth option allows you to select immediate
surveillance, and rebooting of the system is not necessarily required.
If operating system surveillance is enabled (and system firmware has passed control
to the operating system), and the service processor does not detect any heartbeats
from the operating system, the service processor assumes the system is hung and
takes action according to the reboot/restart policy settings. See “Service Processor
Reboot/Restart Recovery” on page 64.
If surveillance is selected from the service processor menus which are only available
at bootup, then surveillance is enabled by default as soon as the system boots.
From Service Aids, the selection is optional.
72RS/6000 Enterprise Server Model H80 Series User's Guide
Call-Out (Call-Home)
The service processor can call out (Call-Home) when it detects one of the following
conditions:
System firmware surveillance failure
Operating system surveillance failure (if supported by operating system)
Restarts
Critical hardware failure
Abnormal operating system termination
To enable the call-out feature, do the following:
1. Connect a modem to any serial port.
2. Set up the following using the Service Processor Menus or Diagnostic Service
Aids:
Enable call-out for the serial port where the modem is connected.
Enter the modem configuration file name.
Set up site-specific parameters (phone numbers for call-out, call-out policy,
number of call-out retries, and so on).
3. To call out before restart, set Call-out before restart to ENABLED from the
Reboot/Restart Policy Setup menu.
Note: Some modems, such as IBM 7857-017, are not designed for the paging
function. Although they can be used for paging, they will return an error
message when they do not get the expected response from another modem.
Therefore, even though the paging was successful, the error message will
cause the Service Processor to retry, continuing to place pager calls for the
number of retries specified in the Call-Out Policy Setup Menu. These retries
result in redundant pages.
Chapter 3. Using the Service Processor73
Console Mirroring
Console mirroring allows a user on one serial port to monitor the service processor
activities on another serial port. This can be done on locally attached ASCII terminals
or terminals remotely attached through modems. Console mirroring is supported on
any combination of locally or remotely attached monitors.
The serial port from which console mirroring is enabled is called the
The mirror port is determined when keyboard input is detected from one of the other
two serial ports. From this point forward, the service processor sends information to
the active port and the mirror port only. Console mirroring ends when the service
processor releases control of the serial ports to the system firmware.
Console mirroring is supported on serial port 1 (S1), serial port 2 (S2), and serial port
3 (S3). Remote attachment through modem is supported on serial port 1 (S1) and
serial port 2(S2).
primary port
.
System Configuration
The following describes the configuration for console mirroring:
Service Processor
Modem connected to one serial port and enabled for incoming calls
Local ASCII terminal connected to the other serial port. This local terminal can
be connected directly to your server or connected through another modem.
There are two scenarios in which console mirroring can be started:
Remote session first, then local session added:
1. Remote session already in progress.
2. Remote user uses service processor menus to enable console mirroring,
allowing both consoles to be active.
Local session first, then remote session added:
1. Local session is already in progress.
2. The service processor receives a call from the remote user.
3. The local user selects the option to enable console mirroring. The service
processor immediately begins mirroring service processor menus.
74RS/6000 Enterprise Server Model H80 Series User's Guide
Service Processor Error Logs
The service processor error logs, an example of which is shown below, contain error
conditions detected by the service processor.
àð
Error Log
1. 11/3/9919:41:56 Service Processor Firmware Failure
B14999
Enter error number for more details.
Press Return to continue, or 'x' to return to menu.
Press "C" to clear error log, any other key to continue. >
á
Note: The time stamp in this error log is Coordinated Universal Time (CUT), which
is also referred to as Greenwich Mean Time (GMT). AIX error logs have additional
information available and can time stamp with local time.
Entering an error number provides nine words of system reference code (SRC) data;
an example screen is shown below.
76RS/6000 Enterprise Server Model H80 Series User's Guide
LCD Progress Indicator Log
The following is an example of the LCD progress indicator log. It shows the types of
entries that may appear in the log, and is for example purposes only.
The progress indicator codes are listed from top (latest) to bottom (oldest).
This section provides a high-level flow of the phases of the Service Processor (SP).
SP Power Applied
Pre-Standby Phase
Standby Phase SP Menus Available
Bring-Up Phase SMS Menus Available
Runtime Phase AIX Login Prompt Available
Pre-Standby Phase: This phase is entered when the server is connected to a
power source. The server may or may not be fully powered on. This phase is exited
when the Power-On Self-Tests (POSTs) and configurations tasks are completed.
The Pre-Standby phase components are:
Service Processor Initialization - Service Processor performs any necessary
hardware and software initializations.
Service Processor POST - Service Processor conducts Power-On Self-Tests on
its various work and code areas.
Service Processor Unattended Start Mode Checks - To assist fault recovery. If
unattended start mode is set, the Service Processor automatically reboots the
server. Service Processor does not wait for user input or power-on command,
but moves through the phase and into the Bring-Up Phase. Access SMS menus
or Service Processor menus to reset the unattended start mode.
78RS/6000 Enterprise Server Model H80 Series User's Guide
Standby Phase: The standby phase can be reached in either of two ways:
With the server off and power connected (the normal path), recognized by OK in
the LCD display.
OR
With the server on after an operating system fault, recognized by an 8-digit code
in the LCD display.
In the Standby phase, the SP takes care of some automatic duties and is available
for menus operation. The SP remains in the standby phase until a power-on request
is detected.
The Standby phase components are as follows:
Modem Configuration
Service Processor configures the modem (if installed) so that incoming calls can
be received, or outgoing calls can be placed.
Dial In
Monitor incoming phone line to answer calls, prompt for a password, verify the
password, and remotely display the standby menu. The remote session can be
mirrored on the local ASCII console if the server is so equipped and the user
enables this function.
Menus
The Service Processor menus are password-protected. Before you can access
them, you need either General User (Power-On Password or POP) or Privileged
User (Privileged Access Password or PAP).
Bring-Up Phase: This phase is entered upon power-on, and exited upon loading
of the operating system.
The bring-up phase components are as follows:
Retry Request Check
The Service Processor checks to see if the previous boot attempt failed. If the
specified number of failures are detected, the SP displays an error code and
places an outgoing call to notify an external party if the user has enabled this
option.
Dial Out
The Service Processor can dial a preprogrammed telephone number in the event
of an IPL failure. The Service Processor issues an error report with the last
reported IPL status indicated and any other available error information.
Chapter 3. Using the Service Processor79
Update Operator Panel
The Service Processor displays Operator Panel data on the ASCII terminal if a
remote connection is active.
Environmental Monitoring
The Service Processor provides expanded error recording and reporting.
System Firmware Surveillance (Heartbeat Monitoring)
The SP monitors and times the interval between system firmware heartbeats.
Responding to System Processor Commands
The SP responds to any command issued by the system processor.
Runtime Phase: This phase includes the tasks that the SP performs during
steady-state execution of the operating system.
Environmental Monitoring
The SP monitors voltages, temperatures, and fan speeds (on some servers).
Responding to System Processor Commands
The SP responds to any command issued by the system processor.
Run-Time Surveillance (Heartbeat Monitoring)
If the device driver is installed and surveillance enabled, the Service Processor
monitors the system heartbeat. If the heartbeat times out, the Service Processor
places an outgoing call. This is different from the bring-up phase scenario,
where the specified number of reboot attempts are made before placing an
outgoing call.
80RS/6000 Enterprise Server Model H80 Series User's Guide
Chapter 4. Using System Management Services
Use the System Management Services menus to view information about your system
and to perform tasks such as setting a password, changing the boot list, and setting
the network parameters.
The text-based Open Firmware command line (OK prompt), which is available from
the System Management Services Utilities menu, allows you to configure certain
adapters.
To start the System Management Services, do the following:
1. Turn on or restart the system.
2. Watch for the banner screen to appear on the display. If an ASCII terminal is
being used, the banner screen will be lines of text in the form of (RS/6000
RS/6000 RS/6000 ....). If a color monitor is attached to a graphics adapter and a
keyboard is connected to the primary I/O drawer, the banner screen will be a
graphical logo.
3. After the banner screen is displayed, the POST indicator words, memory,
keyboard, network, scsi and speaker appear across the bottom of the screen.
Press the numeric 1 key after the word keyboard appears, and before the last
word speaker appears.
For more information on the POST indicator words, refer to “POST Indicators” on
page 6.
Note: The System Management Services can also be started using the Service
Processor Boot Mode Menu. See "Boot Mode Menu" on page 43.
Chapter 4. Using System Management Services81
After the System Management Services starts, the following screen displays:
àð
Utilities
1 Password Utilities
2 Display Error Log
3 Remote Initial Program Load Setup
4 SCSI Utilities
5 Select Console
6 MultiBoot
7 Select Language
8 OK Prompt
.------.
|X=Exit|
------'
===>
á
ñ
82RS/6000 Enterprise Server Model H80 Series User's Guide
Each option on the System Management Services Utilities menu is described in this
section.
Password Utilities
Entering this selection permits access to the options shown on the following screen:
Set Privileged-Access Password: The privileged-access password protects against
the unauthorized starting of the system programs. When this option is selected,
another screen displays that allows the setting of a password.
You can set the password from either the Service Processor or System Management
Services menus. After the password is set, both the Service Processor and System
Management Services automatically use the same password. The password is
limited to eight characters.
Remove Privileged-Access Password: If you previously had set a
privileged-access password and want to remove it, select Remove
Privileged-Access Password.
Unattended Start Mode <ON>: This selection is used to enable or disable
Unattended Start Mode. Use this option to instruct the Service Processor to restore
the power-state of the server after a temporary power failure, which is necessary on
servers that require automatic power-on after a power failure. The default setting is
ON.
Chapter 4. Using System Management Services83
Display Error Log
A screen similar to the following displays when you select this option. You can view
or clear your system error log.
Note: The time stamp in this error log is Coordinated Universal Time (CUT), which
is also referred to as Greenwich Mean Time (GMT). AIX error logs have more
information available and can time stamp with your local time.
84RS/6000 Enterprise Server Model H80 Series User's Guide
Loading...
+ hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.