Parts ofthe productmay bederived fromBerkeley BSDsystems, licensedfrom theUniversity of California.UNIX isa registeredtrademark in
the U.S.and othercountries, exclusivelylicensed throughX/Open Company, Ltd.
Sun, SunMicrosystems, theSun logo,AnswerBook2, docs.sun.com,Solstice DiskSuite,Sun FastEthernet, SunNetra, SunEnterprise, Sun
Enterprise Volume Manager, SunStorEdge, SunUltra, OpenBoot, Solaris,and UltraSparcare trademarks,registeredtrademarks, or service
marks ofSun Microsystems,Inc. inthe U.S.and othercountries. All SPARC trademarksare usedunder licenseand aretrademarks or registered
trademarks ofSPARCInternational, Inc.in theU.S. andother countries. Productsbearing SPARC trademarksare basedupon anarchitecture
developed bySun Microsystems,Inc.
The OPENLOOK andSun™ GraphicalUser Interfacewas developed bySun Microsystems,Inc. forits usersand licensees. Sunacknowledges
the pioneeringefforts ofXerox inresearchingand developing theconcept ofvisual orgraphical user interfaces for thecomputer industry. Sun
holds anon-exclusive licensefrom Xeroxto theXerox GraphicalUser Interface,which licensealso covers Sun’slicensees whoimplement OPEN
LOOK GUIsand otherwisecomply withSun’s writtenlicense agreements.
Federal Acquisitions:Commercial Software—GovernmentUsers Subjectto StandardLicense Terms andConditions.
DOCUMENTATION ISPROVIDED “ASIS” AND ALLEXPRESS ORIMPLIED CONDITIONS,REPRESENTATIONSAND WARRANTIES,
INCLUDING ANYIMPLIED WARRANTY OFMERCHANTABILITY, FITNESSFOR A PARTICULARPURPOSE OR NON-INFRINGEMENT,
ARE DISCLAIMED,EXCEPT TOTHE EXTENTTHATSUCH DISCLAIMERSARE HELD TOBE LEGALLY INVALID.
Copyright 2002Sun Microsystems,Inc., 4150Network Circle,Santa Clara,CA 95054Etats-Unis. Tousdroitsréservés.
Ce produit oudocument estdistribué avecdes licencesqui en restreignent l’utilisation, lacopie, ladistribution, etla décompilation. Aucune
partie dece produitou documentne peutêtre reproduitesous aucuneforme, parquelque moyen que ce soit,sans l’autorisationpréalable et
écrite deSun etde sesbailleurs delicence, s’il yen a.Le logicieldétenu par des tiers, etqui comprendla technologierelative auxpolices de
caractères,est protégépar un copyrightet licenciépar desfournisseurs deSun.
provided thatexisting copyrightnotices are retained in allcopies andthat thisnotice isincluded verbatim inanydistributions. Nowritten
agreement, license,or royaltyfee isrequired forany ofthe authorized uses. Modifications tothis softwaremay becopyrighted by theirauthors
and neednot followthe licensingterms describedhere, providedthat the newterms areclearly indicated onthe firstpage ofeach file where
they apply.
IN NOEVENT SHALLTHE AUTHORSOR DISTRIBUTORSBE LIABLETO ANYPARTY FOR DIRECT, INDIRECT,SPECIAL, INCIDENTAL,
OR CONSEQUENTIALDAMAGES ARISING OUTOF THEUSE OFTHIS SOFTWARE, ITS DOCUMENTATION,OR ANY DERIVATIVES
THEREOF, EVEN IF THEAUTHORS HAVE BEENADVISED OFTHE POSSIBILITY OF SUCH DAMAGE.
This isscotty, asimple tclinterpreter withsome specialcommands toget information aboutTCP/IP networks.Copyright (c)1993, 1994, 1995, J.
Schoenwaelder,TU Braunschweig,Germany,Institute forOperating Systemsand ComputerNetworks. Permission touse, copy, modify, and
distribute thissoftware andits documentationfor anypurpose andwithout fee ishereby granted,provided thatthis copyrightnotice appears
in allcopies. The Universityof Braunschweigmakes norepresentations aboutthe suitabilityof this softwarefor anypurpose. It isprovided “as
is" withoutexpress orimplied warranty.
Contents
Prefacevi
Before You Read This Bookvi
How This Book Is Organizedvi
Using UNIX Commandsvii
Typographic Conventionsvii
Shell Promptsvii
Related Documentationviii
Accessing Sun Documentation Onlineviii
Sun Welcomes Your Commentsix
1.Configuring DR1
DR Models2
Enhancements in DR Model 3.03
Where to Execute DR Commands3
Requirements for Multipathing in DR 3.03
▼To Enable the Kernel Cage7
▼To Set Permanent Driver Parameters for Network Drivers7
▼To Enable Device Suspension for the soc and pln Drivers8
▼To Specify an Unsafe Driver List8
▼To Make an Unsupported Tape Device Detach-Safe9
Preparing for DR Detach Operations9
Configuration Changes During DR Operations10
Controlling Forcible Conditions that Affect System Quiescence10
▼To Manually Suspend a Suspend-Unsafe Device11
▼To Force a System Quiescence12
Target Memory Constraints12
Processors13
Network Devices13
Remote DR Communication14
▼To Troubleshoot a Connection Failure During a Solaris 9 (DR Model3.0)
Operation14
vSun Enterprise 10000 DR Configuration Guide • May 2002
Preface
This guide describes the domain-side configuration of the Sun Enterprise 10000
server Dynamic Reconfiguration (DR) feature. For information about how to use
these features, refer to the appropriate document listed in “Related Documentation”
on page viii.
Before You Read This Book
This guide is intended for the Sun Enterprise 10000 system administrator who has a
working knowledge of UNIX® systems, particularly those based on the Solaris™
operating environment. If you do not have such knowledge, first read the Solaris
user documentation provided with this system, and consider UNIX system
administration training.
Also read and be familiar with the TCP/IP and Data Communications AdministrationGuide that is provided with your server.
How This Book Is Organized
This guide contains one chapter that describes how to configure and reconfigure a
Sun Enterprise 10000 domain before and after a DR operation.
vi
Using UNIX Commands
This document does not contain complete information on basic UNIX commands
and procedures such as shutting down the system, booting the system, and
configuring devices. See the Solaris software documentation that you received with
your system for this information.
Typographic Conventions
Typeface or
Symbol
AaBbCc123The names of commands, files,
AaBbCc123
AaBbCc123Book titles, new words or terms,
MeaningExamples
Edit your .login file.
and directories; on-screen
computer output
What you type, when
contrasted with on-screen
computer output
words to be emphasized.
Replace command-line
variables with real names or
values.
Use ls -a to list all files.
% You have mail.
% su
Password:
Read Chapter 6 in the User’s Guide.
These are called class options.
To delete a file, type rm filename.
Shell Prompts
ShellPrompt
C shellmachine-name%
C shell superusermachine-name#
Bourne shell and Korn shell$
Bourne shell and Korn shell superuser#
viiSun Enterprise 10000 DR Configuration Guide • May 2002
Related Documentation
ApplicationTitlePart Number
UserSun Enterprise 10000 Dynamic
Reconfiguration User Guide
Sun Enterprise 10000 SSP 3.5 User Guide806-7613
System Administration Guide: IP Services806-4075
Sun StorEdge Traffic Manager Software
Installation and Configuration Guide
Sun Enterprise 10000 InterDomain
Networks User Guide
ReferenceSun Enterprise 10000 Dynamic
Reconfiguration Reference Manual
Sun Enterprise 10000 SSP 3.5 Reference
Manual
Sun Enterprise 10000 IDN Configuration
Guide
Sun Enterprise 10000 IDN Error Messages806-5231
TCP/IP and Data Communications
Administration Guide
Installation and Release
Notes
Sun Enterprise 10000 SSP 3.5 Installation
Guide and Release Notes
816-3627
816-1420
806-4131
806-7617
806-7614
806-5230
806-4003
806-7615
Accessing Sun Documentation Online
A broad selection of Sun system documentation is located at:
A complete set of Solaris documentation and many other titles are located at:
http://docs.sun.com
Prefaceviii
Sun Welcomes Your Comments
Sun is interested in improving its documentation and welcomes your comments and
suggestions. You can email your comments to Sun at:
docfeedback@sun.com
Please include the part number (816-3630-10) of your document in the subject line of
your email.
ixSun Enterprise 10000 DR Configuration Guide • May 2002
CHAPTER
1
Configuring DR
This chapter describes key DR functionality and also guides you through the tasks
for configuring DR. You will find
■ Information on DR models
■ Details on getting started with DR configuration
■ An overview of DR configuration tasks
■ Prerequisite tasks to be completed before a DR detach operation
■ Details on the configuration changes that occur during DR detach operations and
how to control certain conditions when a detach operation is in progress
Note – In this document, the phrase “DR detach operation” refers to the complete
detach or removal of a system board. This detach operation can be accomplished by
using the ADR command deleteboard(1M). For instructions on detaching boards
from Solaris 9 domains (which support only DR model 3.0), refer to the SunEnterprise 10000 Dynamic Reconfiguration User Guide (part number 816-3627-10).
1
DR Models
There are two models of DR available for the Sun Enterprise 10000 system. DR
model 2.0 is sometimes referred to as “legacy DR,” and DR model 3.0 is referred to
as “next generation DR.” The following table shows the different versions of the
Solaris operating environment and the SSP software that are used with DR models
2.0 and 3.0:
DR ModelSolaris Software VersionsSSP Software Versions
2.0Solaris 5.1, 6, 7, and 83.3, 3.4, or 3.5
3.0Solaris 8 10/01 and 02/02,
Domains running version 9 of the Solaris software support only DR model 3.0, and
version 3.5 of the SSP software is required.
Only one model of DR can run within a domain at a time. To check the version of
DR that is running, use the domain_status command with its -m option (available
only on domains running version 3.5 of the SSP software). Make sure to verify the
DR model before you execute any DR commands. The following is an example of the
domain_status(1M) output. The DR-MODEL column indicates which model is
enabled
According to this output, domain A is running Solaris version 8 software (OS 5.8)
with DR model 2.0 enabled; domain B is running Solaris version 8 software with DR
model 3.0 enabled; domain C is running Solaris version 7 software (OS 5.7) with DR
model 2.0 enabled; and domain D is running Solaris version 9 software (OS 5.9) with
DR model 3.0 enabled.
Only certain commands are available in each model, and if you execute a command
that is not supported, an error message appears on the console.
2Sun Enterprise 10000 DR Configuration Guide • May 2002
Caution – Before you switch to DR 3.0 in a domain that is running the Solaris 8
10/01 operating environment, you must upgrade the SSP software to version 3.5
because previous versions of SSP do not support DR 3.0 operations.
For more information about using DR 2.0, see the the Sun Enterprise 10000 DynamicReconfiguration (DR) User Guide (part number 806-7616-10). For more information
about using DR 3.0, see the Sun Enterprise 10000 Dynamic Reconfiguration (DR) UserGuide (part number 816-3627-10).
Enhancements in DR Model 3.0
The DR 3.0 model offers the following enhancements to DR 2.0:
■ DR 3.0 has a framework that offers better integration with applications, through
the Reconfiguration Coordination Manager.
■ DR 3.0 supports network multipathing using IPMP.
Where to Execute DR Commands
You execute DR operations from either of two places: from the system service
processor (SSP) by using the SSP commands—addboard(1M), moveboard(1M),
deleteboard(1M), rcfgadm(1M), and showdevices(1M); or from the domain,
using the cfgadm(1M) command.
Requirements for Multipathing in DR 3.0
To use multipathing on DR model 3.0 domains, run IPMP (the IP multi-pathing
software provided with the Solaris 8 operating environment) and MPxIO software,
included in Solaris Kernal Update Patches 111412-02, 111413-02, 111095-02, 111096-02,
and 111097-02.
Getting Started
Before you run DR operations on your domain, you must
■ Be familiar with how devices must be configured before DR detach operations, as
explained in “Device Prerequisites” on page 14.
Chapter 1Configuring DR3
■ Verify that you have sufficient swap space for your domain.
For details, see “Allocating Sufficient Domain Swap Space” on page 15.
■ Qualify any third-party device drivers, as described in “Qualifying Third-Party
Device Drivers” on page 15.
Device Prerequisites
DR requires that drivers for devices on boards involved in DR detach operations be
both:
■ Detach-safe or not currently loaded
A detach-safe driver supports the device driver interface (DDI) function,
DDI_DETACH. This function provides the ability to detach a particular instance of
a driver without affecting other instances that are servicing other devices.
A detach-unsafe driver is one that does not support DDI_DETACH. If a detachunsafe driver is loaded, you must unload it before performing a DR detach
operation. For details on unloading a detach-unsafe device, see “Preparing for DR
Detach Operations” on page 19.
■ Suspend-safe or closed
A suspend-safe device driver supports the quiescence (pausing) of the Solaris
operating environment during the detach of a board that contains nonpageable
OBP or kernel memory. In order for DR to perform the detach, the operating
environment must temporarily suspend all processes, processors, and device
activities to unconfigure the memory component.
A suspend-safe device supports the DDI_SUSPEND/DDI_RESUME function. This
function enables a device to be suspended during a system quiescence and then
resumed. The device managed by the driver will not attempt to access the domain
centerplane (for example, it does not access memory or interrupt the system),
even if the device is open when the suspend request is made. The quiescence only
affects the target domain; other domains are not affected.
If a driver does not support the function DDI_SUSPEND/DDI_RESUME, the device
is considered to be suspend-unsafe because the operating environment cannot
quiesce if a suspend-unsafe device is present. If a system quiescence is required
for a DR detach operation, you must manually suspend a suspend-unsafe device
so that the quiescence can occur. For details, see “To Manually Suspend a
Suspend-Unsafe Device” on page 111.
4Sun Enterprise 10000 DR Configuration Guide • May 2002
Note – The drivers currently released by Sun Microsystems that are known to be
suspend-safe are: st, sd, isp, esp, fas, sbus, pci, pci-pci, qfe,andhme (Sun
FastEthernet™); nf (NPI-FDDI); qe (Quad Ethernet); le (Lance Ethernet); the SSA
drivers (soc, pln, and ssd); and the Sun StorEdge A5000 drivers (sf, socal,and
ses). For additional information about suspend-safe and detach-safe device drivers,
contact your Sun service representative.
Allocating Sufficient Domain Swap Space
The domain swap configuration consists of the swap devices and swapfs (memory).
The domain must contain enough swap space so that it can flush pageable memory.
For example, if you want to remove 1 Gbyte of memory from a 2-Gbyte domain, you
will need 1 Gbyte of swap space, depending on the load. Insufficient swap space can
prevent the completion of a DR operation.
The domain swap space must be configured as multiple partitions on disks attached
to controllers hosted by different boards. With this type of configuration, a particular
swap partition is not a vital resource because swap partitions can be added and
deleted dynamically (refer to the swap(1M) man page for more information).
Note – When memory (swapfs) or swap space on a disk is detached, there must be
enough memory or swap space remaining in the domain to accommodate currently
running programs.
Qualifying Third-Party Device Drivers
Many third-party drivers (those purchased from vendors other than Sun
Microsystems) do not support the standard Solaris modunload(1M) interface, which
is used to unload detach-unsafe or suspend-unsafe device drivers. Conditions that
invoke the driver functions occur infrequently during normal operation and the
functions are sometimes missing or work improperly. Sun Microsystems suggests
that you test these driver functions during the qualification and installation phases
of any third-party device.
Chapter 1Configuring DR5
Overview of DR Configuration Tasks
This section identifies the various configuration tasks that you must complete before
running DR operations on Solaris 9 domains (which support only DR model 3.0).
Note that it may not be necessary to perform all the tasks described in this section,
depending on the types of devices on your system boards and the type of DR
operation to be performed.
After you configure DR or whenever you make changes to the DR configuration,
you must reboot your domain. If you want to minimize the number of domain
reboots, determine which configuration tasks are applicable to your DR environment
and then perform the appropriate set of configuration tasks before rebooting your
domain.
1. If you intend to perform DR detach operations, enable the kernel cage, as
explained in “To Enable the Kernel Cage” on page 17.
2. For devices, do the following:
■ If you set network configuration parameters manually, make these settings
permanent as described in “To Set Permanent Driver Parameters for Network
Drivers” on page 17.
■ If you have soc and pln devices, enable device suspension, as described in
“To Enable Device Suspension for the soc and pln Drivers” on page 18.
■ If you have suspend-unsafe devices, specify those devices in the unsafe driver
list, which blocks a quiesce from starting.
For details, see “To Specify an Unsafe Driver List” on page 18.
■ If you have tape devices that are not supported by Sun Microsystems, make
those devices detach-safe.
For details, see “To Make an Unsupported Tape Device Detach-Safe” on page
19.
3. If you want to use multipathing, configure your domain for multipathing and run
the appropriate multipathing software on the domain.
4. Reboot the domain to process the configuration changes.
Note – You must reboot the domain after any changes to the DR configuration. If
you want to minimize the number of reboots, you may want to perform various
configuration tasks then reboot the domain.
5. After the reboot completes successfully, review the /var/adm/messages file for
messages that verify the DR configuration changes.
6Sun Enterprise 10000 DR Configuration Guide • May 2002
For example, if you enabled the kernel cage, the following message is generated:
NOTICE: DR Kernel Cage is Enabled
▼ To Enable the Kernel Cage
A caged kernel confines the nonpageable memory to a minimal (most often one)
number of systems boards. By default the kernel cage is disabled, preventing DR
detach operations. If you plan to perform DR detach operations, you must enable the
kernel cage by using the system(4) variable kernel_cage_enable, as explained in
the following procedure.
Be aware that DR attach or addboard operations are enabled by default, regardless
of the setting of the kernel_cage_enable variable.
Note – Before the release of version 7 of the Solaris software, the dr-max-mem
variable was used to enable DR. The dr-max-mem variable is not used to enable DR
in version 7 and subsequent versions of the Solaris software.
1. Using a text editor, edit the /etc/system file for the domain so that
kernel_cage_enable equals 1.
set kernel_cage_enable=1
2. After all DR configuration tasks are complete, be sure to reboot the domain so that
this configuration can take effect.
3. Verify this configuration change in the /var/adm/messages file.
The following example is a section from a messages file, which indicates that the
kernel cage was enabled:
NOTICE: DR Kernel Cage is Enabled
▼ To Set Permanent Driver Parameters for
Network Drivers
If you use the ndd(1M) command to set the driver configuration parameters for
network devices, those parameters may not persist after a DR operation.
Chapter 1Configuring DR7
● If you want to set the driver configuration parameters permanently, set the
parameters in the /etc/system file or the driver.conf file for a specific driver.
▼ To Enable Device Suspension for the soc and
pln Drivers
If your system boards contain soc and pln devices, perform the following steps to
make those drivers suspend-safe.
1. Using a text editor, edit the /etc/system file so that the
pln_enable_detach_suspend and soc_enable_detach_suspend variables are
set to 1, as in the following example:
set pln:pln_enable_detach_suspend=1
set soc:soc_enable_detach_suspend=1
2. After all DR configuration tasks are complete, reboot the domain so that this
configuration can take effect.
▼ To Specify an Unsafe Driver List
You can provide the Solaris operating environment with information regarding any
suspend-unsafe devices in the system by specifying an unsafe-driver list in the
ngdr.conf file.
DR reads this list when it prepares to suspend the operating environment so that a
board containing nonpageable memory can be detached. If DR finds an active driver
in the unsafe driver list, it aborts the operation and returns an error message. The
message identifies the active, unsafe driver. You must manually suspend the device
so that the DR operation can be performed.
1. Using a text editor, edit the /platform/SUNW,Ultra-Enterprise10000/kernel/drv/ngdr.conf file and specify suspend-unsafe device drivers
where driverx represents each suspend-unsafe device driver.
2. After all DR configuration tasks are complete, reboot the domain so that this
configuration can take effect.
8Sun Enterprise 10000 DR Configuration Guide • May 2002
▼ To Make an Unsupported Tape Device Detach-
Safe
For the Solaris 9 operating environment, tape devices that are natively supported by
Sun Microsystems are suspend-safe and detach-safe. For details, refer to the st(7D)
man page for a list of natively-supported drives. If a system board to be detached
contains a natively-supported tape device, you can safely detach the board without
suspending the device.
However, if you want to use a tape device that is not natively supported by Sun
Microsystems, you can use it, but you must make it detach-safe by performing the
following steps.
1. Edit the /kernel/drv/st.conf file with the ST_UNLOADABLE (0x0400) flag set in
the entry. Refer to the st(7D) man page for more information.
2. After all DR configuration tasks are complete, be sure to reboot the domain so that
this configuration can take effect.
Preparing for DR Detach Operations
You must prepare a board for DR detach operations by following the steps described
below. Although the following list of tasks implies a sequence of order, strict
adherence to the order is not necessary. These steps apply to boards containing I/O
or non-network devices.
.
1. Unmount file systems.
For example, if you are using Solstice DiskSuite metadevices, you must unmount
the file system for metadevices that have a board-resident partition (for example,
umount /partit).
If you have suspend-unsafe devices that manage file systems, unmount those file
systems before a detach operation. If have to manually suspend unsafe devices
that manage file systems, lock those file systems using the lockfs(1M) command
before manually suspending the unsafe devices.
Caution – Unmounting shared file systems by using the share(1M) utility may
affect NFS client systems.
2. Remove disk partitions from the swap configuration by using swap(1M).
Chapter 1Configuring DR9
3. If you want to detach a board that hosts Sun StorEdge A3000 controllers, make
those controllers idle or take them offline manually using the rm6 or rdacutil
programs.
The Sun StorEdge A3000 (formerly known as the RSM Array 2000) has dual
controller paths with automatic load balancing and automatic failover
functionality.
4. Close all non-network devices by doing the following:
■ Close all instances of a device by killing any processes that directly open the
device or raw partition, or by directing the process to close an open device on
the board.
■Run modunload(1M) to unload each detach-unsafe or loaded device driver.
Note – In situations where you cannot unload a device that has an unsafe driver,
you can blacklist the board that contains the unsafe device and then reboot the
domain. You can remove the board later. For details on blacklisting, refer to the
blacklist(1M) man page.
5. Processes bound to the processors of a board prevent that board from being
detached. You can use pbind(1M) to rebind them to other processors.
Configuration Changes During DR
Operations
This section describes
■ How you can control forcible conditions that affect system quiescence during a
DR detach operation in progress
■ Various configuration changes performed by DR during DR detach operations
Controlling Forcible Conditions that Affect
System Quiescence
If the Solaris operating environment cannot quiesce during a DR detach operation
involving a board with nonpageable memory, it displays the reason why it cannot
quiesce. For example,a suspend-unsafe device is open that cannot be quiesced by the
operating environment.
10Sun Enterprise 10000 DR Configuration Guide • May 2002
A failure to quiesce due to open suspend-unsafe devices is known as a forcible
condition. You have the option to retry the operation, or you can try to force the
quiescence. The conditions that cause processes not to suspend are generally
temporary in nature. You can retry the operation until the quiescence succeeds.
When you try to force the quiescence, you give the operating environment
permission to continue with the quiescence even if forcible conditions are still
present. Doing this forces the operating environment to permit the detach. Note that,
although a detach can be forced to proceed when there are open suspend-unsafe
devices in the system, it is not possible to force a detach when a detach-unsafe
device resides on the board and its driver is loaded
Note – Real-time processes do not prevent the operating environment from
quiescence.
The most straightforward way to quiesce a domain is to close any suspend-unsafe
devices. For each network driver you must execute the ifconfig(1M) command
with its down parameter, then again with its unplumb parameter (refer to the
ifconfig(1M) man page for more information).
Note – It should be possible to unplumb all network drivers. However, this action is
rarely tested in normal environments and may result in driver error conditions. If
you use DR, Sun Microsystems suggests that you test these driver functions during
the qualification and installation phases of any suspend-unsafe device.
If a suspend-unsafe device is open and cannot be closed, you can manually suspend
the device, and then force the operating environment to quiesce. After the operating
environment resumes, you can manually resume the device as explained below.
Note – If you cannot make a device suspend its access to the domain centerplane,
do not force the operating environment to quiesce. Doing so could cause a domain to
crash or hang. Instead, postpone the DR operation until the suspend-unsafe device is
no longer open.
▼ To Manually Suspend a Suspend-Unsafe Device
1. Remove the usage of the device by performing one or more of the following:
a. Close the device by killing the processes using the device.
b. Ask users not to utilize the device.
Chapter 1Configuring DR11
c. Disconnect the cables to the device.
For example, if a device that allows asynchronous unsolicited input is open, you
can disconnect its cables prior to quiescing the operating environment, preventing
traffic from arriving at the device and the device from accessing the domain
centerplane. You can reconnect the cables after the operating environment
resumes.
d. Unload the device driver by using the modunload(1M) command.
2. Perform the DR operation again.
3. Do the following:
a. Reload the device by using the modload(1M) command.
b. Reconnect the cables to the device.
c. Inform users that the device can be used again.
d. Restart any processes associated with the device.
Caution – If you attempt a forced quiesce operation while activity is occurring on a
suspend-unsafe device, the domain may hang. However, if the domain hangs, it will
not affect other domains that are running on the Sun Enterprise 10000 system.
▼ To Force a System Quiescence
Caution – Exercise care when using the force option. To successfully force the
operating environment to quiesce, you must first manually quiesce the controller.
Procedures to do that, if any, are device-specific. The device must not transfer any
data, reference memory, or generate interrupts during the operation. Be sure to test
any procedures used to quiesce the controller while it is open before running them
on a production system. Using the force option to quiesce the operating
environment, without first successfully quiescing the controller, can result in a
domain failure and subsequent reboot.
● For Solaris 9 (DR model 3.0) operations, run the deleteboard(1M) or
moveboard(1M) command with the -f option.
Target Memory Constraints
When detaching a board with nonpageable memory, DR locates an alternative
(target) memory board to which to copy the nonpageable memory.
12Sun Enterprise 10000 DR Configuration Guide • May 2002
If no target board is found for a copy rename operation, the deleteboard(1M) and
moveboard(1M) commands display the following error messages, respectively:
deleteboard: unconfigure SB2: No available memory target: dr@0:SB2::memory
moveboard: unconfigure SB2: No available memory target: dr@0:SB2::memory
Processors
The boot processor is responsible for maintaining the netcon BBSRAM buffer.
Before detaching a board on which the boot processor resides, DR assigns the boot
processor role to another active (online) processor.
Network Devices
A detach operation will fail if there are any network interfaces on the board that fit
the following conditions. In these cases, the detach operation fails and DR displays
an error message.
■ The interface is the primary network interface for the domain; that is, the
interface whose IP address corresponds to the network interface name contained
in the file /etc/nodename.
Note that bringing down the primary network interface for the domain prevents
network information name services from operating, which results in the inability
to make network connections to remote hosts using applications such as ftp(1),
rsh(1), rcp(1), rlogin(1). NFS client and server operations are also affected.
■ The interface is on the same subnet as the SSP host for the system; that is, the
subnet of the IP address that corresponds to the SSP host name found in
/etc/ssphostname.
Bringing down this interface interrupts communication between the host and SSP.
Because DR operations are initiated on the SSP, control of the detach process
would be lost. Note that the /etc/ssphostname file contains the name of the
SSP that controls the host; therefore, if you rename the SSP, you must manually
update the /etc/ssphostname file.
Caution – Detaching network interfaces may affect NFS client systems.
Chapter 1Configuring DR13
Remote DR Communication
In Solaris 9 domains, the domain configuration server, dcs(1M), controls DR
operations.
▼ To Troubleshoot a Connection Failure During a Solaris 9
(DR Model 3.0) Operation
1. Check the domain.
dcs(1M) must be configured in the /etc/inetd.conf file of the domain. The
following lines must be present in the file:
2. If the dcs daemon is configured in /etc/inetd.conf, kill dcs(1M) if it is
currently running, and send a HUP signal to the inetd(1M) daemon to cause it to
re-read the inetd.conf(4) configuration file:
# kill -9 dcs_pid
# kill -HUP inetd_pid
where dcs_pid is the process ID of the dcs(1M) daemon and inetd_pid is the process
ID of the inetd(1M) daemon.
3. Check the /var/adm/messages file for possible error messages from inetd(1M)
if it is having trouble starting dcs(1M).
The dcs(1M) daemon executable file resides in the /usr/lib directory.
4. Try the DR operation again, starting from the beginning.
14Sun Enterprise 10000 DR Configuration Guide • May 2002