Hp COMPAQ PROLIANT 6400R, PROLIANT DL580 Parallel Database Cluster Model PDC/O2000 for Oracle8i and Windows 2000 Administrator Guide

Parallel Database Cluster Model PDC/O2000 for Oracle8i and Windows 2000
Administrator Guide
Second Edition (June 2001) Part Number 225082-002 Compaq Computer Corporation

Notice

© 2001 Compaq Computer Corporation
Compaq, the Compaq logo, Compaq Insight Manager, SmartStart, ROMPaq, ProLiant, and StorageWorks Registered in U.S. Patent and Trademark Office. ActiveAnswers is a trademark of Compaq Information Technologies Group, L.P. in the United States and other countries.
Microsoft, Windows, and Windows NT are trademarks of Microsoft Corporation in the United States and other countries.
All other product names mentioned herein may be trademarks of their respective companies.
Compaq shall not be liable for technical or editorial errors or omissions contained herein. The information in this document is provided “as is” without warranty of any kind and is subject to change without notice. The warranties for Compaq products are set forth in the express limited warranty statements accompanying such products. Nothing herein should be construed as constituting an additional warranty.
Parallel Database Cluster Model PDC/02000 for Oracle8i and Windows 2000 Second Edition (June 2001) Part Number 225082-002

Contents

About This Guide
Purpose .................................................................................................................... xiii
Audience.................................................................................................................. xiii
Scope ........................................................................................................................xiv
Referenced Manuals ..................................................................................................xv
Supplemental Documents .........................................................................................xvi
Text Conventions.....................................................................................................xvii
Symbols in Text.......................................................................................................xvii
Symbols on Equipment.......................................................................................... xviii
Rack Stability ...........................................................................................................xix
Getting Help .............................................................................................................xix
Compaq Technical Support ...............................................................................xix
Compaq Website.................................................................................................xx
Compaq Authorized Reseller..............................................................................xx
Chapter 1
Clustering Overview
Clusters Defined ...................................................................................................... 1-2
Availability .............................................................................................................. 1-3
Scalability ................................................................................................................ 1-3
Compaq Parallel Database Cluster Overview.......................................................... 1-4
iv Compaq Parallel Database Cluster Model PDC/02000 for Oracle8i and Windows 2000 Administrator Guide
Chapter 2
Cluster Architecture
Compaq ProLiant Servers ........................................................................................ 2-2
High-Availability Features of ProLiant Servers ............................................... 2-3
Shared Storage Components.................................................................................... 2-3
RA4000 Array................................................................................................... 2-4
RA4100 Array................................................................................................... 2-4
RA4000 Array Controllers................................................................................ 2-5
Fibre Channel SAN Switches ........................................................................... 2-6
FC-AL Switches ............................................................................................... 2-7
Storage Hubs..................................................................................................... 2-7
Fibre Host Adapters.......................................................................................... 2-8
Gigabit Interface Converter-Shortwave Modules ............................................. 2-8
Fibre Channel Cables........................................................................................ 2-9
I/O Path Configurations for Redundant Fibre Channel Fabrics ............................... 2-9
Overview of Fibre Channel Fabric SAN Topology .......................................... 2-9
Redundant Fibre Channel Fabrics..................................................................... 2-9
Multiple Redundant Fibre Channel Fabrics.................................................... 2-11
Maximum Distances Between Nodes and Shared Storage Components
in a Redundant Fibre Channel Fabric ............................................................. 2-13
I/O Data Paths in a Redundant Fibre Channel Fabric ..................................... 2-14
I/O Path Configuration Guidelines for Redundant Fibre Channel Fabrics............. 2-15
I/O Path Configuration Rules for Redundant Fibre Channel Fabrics.............. 2-19
Active/Standby Configuration Examples for Redundant Fibre Channel
Fabrics............................................................................................................. 2-20
Active/Active Configuration Examples for Redundant Fibre Channel
Fabrics............................................................................................................. 2-27
Summary of I/O Path Failure and Failover Scenarios for Redundant Fibre
Channel Fabrics ..............................................................................................2-33
I/O Path Configurations for Redundant Fibre Channel Arbitrated Loops.............. 2-39
Overview of FC-AL SAN Topology .............................................................. 2-39
Redundant Fibre Channel Arbitrated Loops ................................................... 2-40
Multiple Redundant Fibre Channel Arbitrated Loops..................................... 2-42
Maximum Distances Between Nodes and Shared Storage Components in a
Redundant FC-AL........................................................................................... 2-44
I/O Data Paths in a Redundant FC-AL ...........................................................2-45
I/O Path Configuration Guidelines for Redundant Fibre Channel Arbitrated
Loops ..................................................................................................................... 2-47
I/O Path Configuration Rules for Redundant FC-ALs.................................... 2-50
Active/Standby Configuration Examples for Redundant FC-ALs.................. 2-51
Active/Active Configuration Examples for Redundant FC-ALs .................... 2-58
Summary of I/O Path Failure and Failover Scenarios for Redundant
FC-ALs ........................................................................................................... 2-64
Cluster Architecture
continued
Cluster Interconnect Options ................................................................................. 2-70
Ethernet Cluster Interconnect ......................................................................... 2-70
Local Area Network ....................................................................................... 2-76
Chapter 3
Cluster Software Components
Overview of the Cluster Software............................................................................ 3-1
Microsoft Windows 2000 Advanced Server............................................................ 3-1
Compaq Software .................................................................................................... 3-2
Compaq SmartStart and Support Software....................................................... 3-2
Compaq System Configuration Utility ............................................................. 3-3
Compaq Array Configuration Utility................................................................ 3-3
Fibre Channel Fault Isolation Utility................................................................ 3-3
Compaq Insight Manager ................................................................................. 3-4
Compaq Insight Manager XE ........................................................................... 3-4
Compaq Options ROMPaq............................................................................... 3-4
Compaq StorageWorks Secure Path for Windows 2000 .................................. 3-5
Compaq Operating System Dependent Modules.............................................. 3-5
Oracle Software ....................................................................................................... 3-6
Oracle8i Server Enterprise Edition................................................................... 3-6
Oracle8i Server................................................................................................. 3-6
Oracle8i Parallel Server Option........................................................................ 3-6
Oracle8i Enterprise Manager............................................................................ 3-7
Oracle8i Certification ....................................................................................... 3-7
Application Failover and Reconnection Software ................................................... 3-8
Contents v
Chapter 4
Cluster Planning
Site Planning............................................................................................................ 4-2
Capacity Planning for Cluster Hardware ................................................................. 4-3
Compaq ProLiant Servers................................................................................. 4-3
Planning Shared Storage Components for Redundant Fibre Channel
Fabrics .............................................................................................................. 4-3
Planning Shared Storage Components for Redundant Fibre Channel
Arbitrated Loops............................................................................................... 4-5
Planning Cluster Interconnect and Client LAN Components........................... 4-6
Planning Cluster Configurations for Redundant Fibre Channel Fabrics.................. 4-7
Sample Small Configuration in a Redundant Fibre Channel Fabric................. 4-7
Sample Large Configuration in a Redundant Fibre Channel Fabric................. 4-9
vi Compaq Parallel Database Cluster Model PDC/02000 for Oracle8i and Windows 2000 Administrator Guide
Cluster Planning
continued
Planning Cluster Configurations for Redundant Fibre Channel Arbitrated
Loops ..................................................................................................................... 4-11
Sample Small Configuration in a Redundant FC-AL ..................................... 4-11
Sample Large Configuration in a Redundant FC-AL ..................................... 4-13
RAID Planning....................................................................................................... 4-14
Supported RAID Levels.................................................................................. 4-16
Raw Data Storage and Database Size ............................................................. 4-17
Selecting the Appropriate RAID Levels ......................................................... 4-17
Planning the Grouping of Physical Disk Storage Space ........................................4-18
Disk Drive Planning............................................................................................... 4-19
Nonshared Disk Drives................................................................................... 4-19
Shared Disk Drives ......................................................................................... 4-20
Network Planning .................................................................................................. 4-20
Windows 2000 Advanced Server Hosts Files for an Ethernet Cluster
Interconnect ....................................................................................................4-20
Client LAN ..................................................................................................... 4-21
Chapter 5
Installation and Configuration
Installation Overview............................................................................................... 5-2
Installing the Hardware............................................................................................ 5-4
Setting Up the Nodes ........................................................................................ 5-4
Installing the Fibre Host Adapters .................................................................... 5-4
Installing GBIC-SW Modules for the Fibre Host Adapters.............................. 5-5
Cabling the Fibre Host Adapters to the Storage Hubs or Switches................... 5-5
Installing the Cluster Interconnect Adapters..................................................... 5-6
Installing the Client LAN Adapters .................................................................. 5-7
Setting Up the RA4000/RA4100 Arrays........................................................... 5-7
Installing GBIC-SW Modules for the RA4000 Array Controllers.................... 5-9
Cabling the Storage Hubs or Switches to the RA4000 Array Controllers ........ 5-9
Installing Additional Redundant Fibre Channel Fabrics................................. 5-15
Installing Additional Redundant FC-ALs....................................................... 5-15
Cabling the Ethernet Cluster Interconnect...................................................... 5-16
Cabling the Client LAN.................................................................................. 5-20
Installing the Operating System Software and Configuring the
RA4000/RA4100 Arrays........................................................................................ 5-21
Guidelines for Clusters ...................................................................................5-21
Automated Installation Using SmartStart ....................................................... 5-22
Installing Secure Path Software for Windows 2000 .............................................. 5-26
Installing the Secure Path Server Software..................................................... 5-26
Installing the Secure Path Client Software...................................................... 5-27
Verifying Shared Disk Storage Using Secure Path Manager.......................... 5-27
Defining Active Array Controllers .................................................................5-28
Installing Compaq OSDs ....................................................................................... 5-30
Verifying Cluster Communications................................................................ 5-31
Mounting Remote Drives and Verifying Administrator Privileges................ 5-32
Installing the Ethernet OSDs .......................................................................... 5-33
Installing Oracle Software ..................................................................................... 5-44
Configuring Oracle Software................................................................................. 5-45
Installing Object Link Manager............................................................................. 5-45
Additional Notes on Configuring Oracle Software ........................................ 5-46
Verifying the Hardware and Software Installation ................................................ 5-47
Cluster Communications ................................................................................ 5-47
Access to Shared Storage from All Nodes...................................................... 5-47
OSDs .............................................................................................................. 5-47
Other Verification Tasks ................................................................................ 5-48
Power Distribution and Power Sequencing Guidelines ......................................... 5-48
Server Power Distribution .............................................................................. 5-49
RA4000/RA4100 Array Power Distribution .................................................. 5-49
Power Sequencing .......................................................................................... 5-50
Chapter 6
Cluster Management
Cluster Management Concepts ................................................................................ 6-2
Powering Off a Node Without Interrupting Cluster Services ........................... 6-2
Managing a Cluster in a Degraded Condition................................................... 6-2
Managing Network Clients Connected to a Cluster ......................................... 6-3
Cluster Events................................................................................................... 6-3
Management Applications ....................................................................................... 6-4
Monitoring Server and Network Hardware ...................................................... 6-4
Managing Shared Drives .................................................................................. 6-5
Monitoring Redundant Fibre Channel Fabrics ................................................. 6-5
Monitoring Redundant Fibre Channel Arbitrated Loops.................................. 6-6
Monitoring the Database .................................................................................. 6-7
Remotely Managing a Cluster .......................................................................... 6-7
Software Maintenance for Oracle8i......................................................................... 6-8
Deinstalling the OSDs ...................................................................................... 6-8
Upgrading Oracle8i Server ............................................................................. 6-11
Upgrading the OSDs....................................................................................... 6-11
Deinstalling a Partial OSD Installation........................................................... 6-13
Upgrading Oracle8i Server ............................................................................. 6-14
Managing Changes to Shared Storage Components.............................................. 6-14
Replacing a Failed Disk.................................................................................. 6-14
Adding Disk Drives to Increase Storage Capacity ......................................... 6-15
Adding an RA4000/RA4100 Array................................................................ 6-15
Replacing a Failed Fibre Host Adapter........................................................... 6-16
Contents vii
viii Compaq Parallel Database Cluster Model PDC/02000 for Oracle8i and Windows 2000 Administrator Guide
Cluster Management
continued
Replacing a Cluster Node ...................................................................................... 6-17
Removing the Node ........................................................................................ 6-17
Adding the Replacement Node ....................................................................... 6-18
Adding a Cluster Node........................................................................................... 6-21
Preparing the New Node................................................................................. 6-22
Preparing the Existing Cluster Nodes ............................................................. 6-23
Installing the Cluster Software for Oracle8i ................................................... 6-23
Monitoring Cluster Operation................................................................................ 6-25
Tools Overview............................................................................................... 6-25
Using Secure Path Manager............................................................................ 6-25
Uninstalling Secure Path................................................................................. 6-28
Chapter 7
Troubleshooting
Basic Troubleshooting Tips ..................................................................................... 7-2
Power ................................................................................................................ 7-2
Physical Connections........................................................................................ 7-2
Access to Cluster Components .........................................................................7-3
Software Revisions ........................................................................................... 7-3
Firmware Revisions .......................................................................................... 7-4
Troubleshooting Oracle8i and OSD Installation Problems and Error Messages ..... 7-5
Potential Difficulties Installing the OSDs with the Oracle Universal
Installer ............................................................................................................. 7-5
Unable to Start OracleCMService..................................................................... 7-6
Unable to Start OracleNMService .................................................................... 7-7
Unable to Start the Database............................................................................. 7-7
Initialization of the Dynamic Link Library NM.DLL Failed............................ 7-8
Troubleshooting Node-to-Node Connectivity Problems.......................................... 7-8
Nodes Are Unable to Communicate with Each Other ...................................... 7-8
Unable to Ping the Cluster Interconnect or the Client LAN ............................. 7-9
Node or Nodes Unable to Rejoin the Cluster.................................................... 7-9
Troubleshooting Client-to-Cluster Connectivity Problems.................................... 7-10
A Network Client Cannot Communicate with the Cluster.............................. 7-10
Troubleshooting Shared Storage Problems............................................................ 7-11
Verifying Connectivity to a Redundant Fibre Channel Fabric........................ 7-11
Verifying Connectivity to a Redundant Fibre Channel Arbitrated Loop........ 7-12
Shared Disks in the RA4000/RA4100 Arrays Are Not Recognized By One
or More Nodes ................................................................................................ 7-12
A Cluster Node Cannot Connect to the Shared Drives ................................... 7-14
Troubleshooting
continued
Troubleshooting Secure Path................................................................................. 7-14
Secure Path Guidelines for Windows 2000 Advanced Server........................ 7-14
Secure Path Manager Cannot Start With Hosts That Use Hyphenated Host
Names............................................................................................................. 7-15
Secure Path Manager Is Delayed In Reporting Path Failure Information....... 7-16
The Addition of New LUNs Causes an Error................................................. 7-16
A Configuration of More Than 64 LUNs Prevents the Secure Path Agent
From Starting.................................................................................................. 7-16
Appendix A
Diagnosing and Resolving Shared Disk Problems
Introduction .............................................................................................................A-1
Run Object Link Manager on All Nodes .................................................................A-3
Restart All Affected Nodes in the Cluster ...............................................................A-4
Rerun and Validate Object Link Manager On All Affected Nodes .........................A-4
Run and Validate Secure Path Manager On All Nodes ...........................................A-5
Run Disk Management On All Nodes .....................................................................A-5
Run and Validate the Array Configuration Utility On All Nodes............................A-6
Perform Cluster Software and Firmware Checks ....................................................A-6
Perform Cluster Hardware Checks ..........................................................................A-7
Contact Your Compaq Support Representative.......................................................A-8
Contents ix
Glossary
Index
x Compaq Parallel Database Cluster Model PDC/02000 for Oracle8i and Windows 2000 Administrator Guide
List of Figures
Figure 1-1. Example of a two-node Compaq Parallel Database
Model PDC/O2000 cluster ................................................................................. 1-2
Figure 2-1. Two-node PDC/O2000 with a two-fabric redundant Fibre
Channel Fabric ................................................................................................. 2-10
Figure 2-2. Two-node PDC/O2000 with two redundant Fibre Channel
Fabrics.............................................................................................................. 2-12
Figure 2-3. Maximum distances between PDC/O2000 cluster nodes and
shared storage subsystem components in a redundant Fibre Channel
Fabric................................................................................................................ 2-13
Figure 2-4. Fibre Host Adapter-to-Fibre Channel SAN Switch data paths........... 2-14
Figure 2-5. Fibre Channel SAN Switch-to-RA4100/4000 Array data paths......... 2-15
Figure 2-6. Active/standby configuration with one RA4000/RA4100 Array ....... 2-22
Figure 2-7. Active/standby configuration with two RA4000/RA4100
Arrays............................................................................................................... 2-23
Figure 2-8. Active/standby configuration with three RA4000/RA4100
Arrays............................................................................................................... 2-24
Figure 2-9. Active/standby configuration with four RA4000/RA4100
Arrays............................................................................................................... 2-25
Figure 2-10. Active/standby configuration with five RA4000/RA4100
Arrays............................................................................................................... 2-26
Figure 2-11. Active/active configuration with two RA4000/RA4100 Arrays ...... 2-29
Figure 2-12. Active/active configuration with three RA4000/RA4100
Arrays............................................................................................................... 2-30
Figure 2-13. Active/active configuration with four RA4000/RA4100
Arrays............................................................................................................... 2-31
Figure 2-14. Active/active configuration with five RA4000/RA4100 Arrays ...... 2-32
Figure 2-15. Two-node PDC/O2000 with a two-loop redundant Fibre
Channel Arbitrated Loop.................................................................................. 2-41
Figure 2-16. Two-node PDC/O2000 with two redundant Fibre Channel
Arbitrated Loops .............................................................................................. 2-43
Figure 2-17. Maximum distances between PDC/O2000 cluster nodes and
shared storage subsystem components in a redundant FC-AL......................... 2-44
Figure 2-18. Fibre Host Adapter-to-FC-AL Switch/Storage Hub data paths........ 2-45
Figure 2-19. FC-AL Switch/Storage Hub-to-RA4000/RA4100 Array data
paths ................................................................................................................. 2-46
Figure 2-20. Active/standby configuration with one RA4000/RA4100
Array ................................................................................................................ 2-53
Figure 2-21. Active/standby configuration with two RA4000/RA4100
Arrays............................................................................................................... 2-54
Figure 2-22. Active/standby configuration with three RA4000/RA4100
Arrays............................................................................................................... 2-55
Figure 2-23. Active/standby configuration with four RA4000/RA4100
Arrays............................................................................................................... 2-56
Figure 2-24. Active/standby configuration with five RA4000/RA4100
Arrays............................................................................................................... 2-57
Contents xi
Figure 2-25. Active/active configuration with two RA4000/RA4100 Arrays ...... 2-60
Figure 2-26. Active/active configuration with three RA4000/RA4100
Arrays............................................................................................................... 2-61
Figure 2-27. Active/active configuration with four RA4000/RA4100
Arrays............................................................................................................... 2-62
Figure 2-28. Active/active configuration with five RA4000/RA4100 Arrays...... 2-63
Figure 2-29. Non-redundant Ethernet cluster interconnect using a
crossover cable................................................................................................. 2-73
Figure 2-30. Non-redundant Ethernet cluster using an Ethernet switch or
hub ................................................................................................................... 2-74
Figure 2-31. Redundant Ethernet cluster interconnect for a two-node
PDC/O2000 cluster .......................................................................................... 2-75
Figure 4-1. Two-node PDC/O2000 cluster with one redundant Fibre
Channel Fabric and one RA4000/RA4100 Array .............................................. 4-7
Figure 4-2. Six-node PDC/O2000 cluster with one redundant Fibre Channel
Fabric and five RA4000/RA4100 Arrays........................................................... 4-9
Figure 4-3. Two-node PDC/O2000 cluster with one redundant FC-AL and
one RA4000/RA4100 Array ............................................................................ 4-11
Figure 4-4. Six-node PDC/O2000 cluster with one redundant FC-AL and
five RA4000/RA4100 Arrays .......................................................................... 4-13
Figure 4-5. RA4000/RA4100 Array disk grouping for a PDC/O2000 cluster...... 4-18
Figure 5-1. Connecting Fibre Host Adapters to Storage Hubs, FC-AL
Switches, or Fibre Channel SAN Switches........................................................ 5-6
Figure 5-2. RA4000/RA4100 Arrays connected to clustered servers
through one redundant Fibre Channel Fabric or redundant FC-AL................... 5-8
Figure 5-3. Cabling Storage Hubs, FC-AL Switches, or Fibre Channel SAN
Switches to RA4000 Array Controllers in an active/standby
configuration.................................................................................................... 5-11
Figure 5-4. Method 1: cabling an active/active configuration with two
RA4000/RA4100 Arrays.................................................................................. 5-13
Figure 5-5. Method 2: cabling an active/active configuration with two
RA4000/RA4100 Arrays.................................................................................. 5-14
Figure 5-6. Non-redundant Ethernet cluster interconnect using a crossover
cable................................................................................................................. 5-17
Figure 5-7. Non-redundant Ethernet cluster interconnect using an Ethernet
switch or hub.................................................................................................... 5-18
Figure 5-8. Redundant Ethernet cluster interconnect for a two-node
PDC/O2000 cluster .......................................................................................... 5-19
Figure 5-9. Server power distribution in a three-node cluster............................... 5-49
Figure A-1. Tasks for diagnosing and resolving shared storage problems .............A-2
xii Compaq Parallel Database Cluster Model PDC/02000 for Oracle8i and Windows 2000 Administrator Guide
List of Tables
Table 2-1 High-Availability Components of ProLiant Servers................................ 2-3
Table 2-2 Features of Active/Standby and Active/Active Configurations for
Redundant Fibre Channel Fabrics .................................................................... 2-17
Table 2-3 I/O Path Failure and Failover Scenarios in Redundant Fibre
Channel Fabrics for Active/Standby Configurations With One
RA4000/RA4100 Array ................................................................................... 2-33
Table 2-4 I/O Path Failure and Failover Scenarios in Redundant Fibre
Channel Fabrics for Active/Standby Configurations With Two or More
RA4000/RA4100 Arrays.................................................................................. 2-35
Table 2-5 I/O Path Failure and Failover Scenarios in Redundant Fibre
Channel Fabrics for Active/Active Configurations With Two or More
RA4000/RA4100 Arrays.................................................................................. 2-37
Table 2-6 Features of Active/Standby and Active/Active Configurations for
Redundant Fibre Channel Arbitrated Loops..................................................... 2-48
Table 2-7 I/O Path Failure and Failover Scenarios in Redundant FC-ALs for
Active/Standby Configurations With One RA4000/RA4100 Array ................ 2-64
Table 2-8 I/O Path Failure and Failover Scenarios in Redundant FC-ALs for
Active/Standby Configurations With Two or More RA4000/RA4100
Arrays............................................................................................................... 2-66
Table 2-9 I/O Path Failure and Failover Scenarios in Redundant FC-ALs for
Active/Active Configurations With Two or More RA4000/RA4100
Arrays............................................................................................................... 2-68
Table 5-1 Active/Active Cabling Methods ............................................................ 5-12
Table 5-2 Active Array Controller Locations ........................................................ 5-29

Purpose

Audience

About This Guide

This administrator guide provides information about the planning, installation, configuration, implementation, management, and troubleshooting of the Compaq Parallel Database Cluster Model PDC/O2000 running Oracle8i software on the Microsoft Windows 2000 Advanced Server operating system.
The expected audience of this guide consists primarily of MIS professionals whose jobs include designing, installing, configuring, and maintaining Compaq Parallel Database Clusters.
The audience of this guide must have a working knowledge of Microsoft Windows 2000 Advanced Server and of Oracle databases or have the assistance of a database administrator.
This guide contains information for network administrators, database administrators, installation technicians, systems integrators, and other technical personnel in the enterprise environment for the purpose of cluster planning, installation, implementation, and maintenance.
IMPORTANT: This guide contains installation, configuration, and maintenance information that can be valuable for a variety of users. If you are installing the PDC/O2000 but will not be administering the cluster on a daily basis, please make this guide available to the person or persons who will be responsible for the clustered servers after you have completed the installation.
xiv Compaq Parallel Database Cluster Model PDC/02000 for Oracle8i and Windows 2000 Administrator Guide

Scope

This guide offers significant background information about clusters as well as basic concepts associated with designing clusters. It also contains detailed product descriptions and installation steps.
This administrator guide is designed to assist you in the following objectives:
Understanding basic concepts of clustering technology
Recognizing and using the high-availability features of the PDC/O2000
Planning and designing a PDC/O2000 cluster configuration to meet your
business needs
Installing and configuring PDC/O2000 hardware and software
Managing the PDC/O2000
Troubleshooting the PDC/O2000
The following summarizes the contents of this guide:
Chapter 1, “Clustering Overview,” provides an introduction to
clustering technology features and benefits.
Chapter 2, “Cluster Architecture, describes the hardware components
of the PDC/O2000 and provides detailed I/O path configuration information.
Chapter 3, “Cluster Software Components, describes software
components used with the PDC/O2000.
Chapter 4, “Cluster Planning, outlines an approach to planning and
designing cluster configurations that meet your business needs.
Chapter 5, “Installation and Configuration, outlines the steps you will
take to install and configure the PDC/O2000 hardware and software.
Chapter 6, “Cluster Management, includes techniques for managing
and maintaining the PDC/O2000.
Chapter 7, “Troubleshooting, contains troubleshooting information for
the PDC/O2000.
Appendix A, “Diagnosing and Resolving Shared Disk Problems,
describes procedures to diagnose and resolve shared disk problems.
Glossary contains definitions of terms used in this guide.
Some clustering topics are mentioned, but not detailed, in this guide. For example, this guide does not describe how to install and configure Oracle8i on a cluster. For information about these topics, see the documents referenced in the guide sections or refer to the documentation provided with the Oracle software.

Referenced Manuals

For additional information, refer to documentation related to the specific hardware and software components of the Compaq Parallel Database Cluster. These related manuals include, but are not limited to:
Documentation related to the ProLiant servers you are clustering
(for example, guides, posters, and performance and tuning guides)
Compaq StorageWorks documentation
G Compaq StorageWorks RAID Array 4000 User Guide
G Compaq StorageWorks RAID Array 4100 User Guide
G Compaq StorageWorks Fibre Channel Storage Hub 7
Installation Guide
G Compaq StorageWorks Fibre Channel Storage Hub 12
Installation Guide
About This Guide xv
G Compaq StorageWorks Fibre Channel Host Bus Adapter
Installation Guide
G Compaq StorageWorks 64-Bit/66-MHz Fibre Channel Host Adapter
Installation Guide
Microsoft Windows 2000 Advanced Server documentation
G Microsoft Windows 2000 Advanced Server Administrator’s Guide
xvi Compaq Parallel Database Cluster Model PDC/02000 for Oracle8i and Windows 2000 Administrator Guide
Oracle8i documentation, including:
G Oracle8i Parallel Server Setup and Configuration Guide
G Oracle8i Parallel Server Concepts
G Oracle8i Parallel Server Administration, Deployment, and
Performance
G Oracle Enterprise Manager Administrator’s Guide
G Oracle Enterprise Manager Configuration Guide
G Oracle Enterprise Manager Concepts Guide

Supplemental Documents

The following technical documents contain important supplemental information for the Compaq Parallel Database Cluster Model PDC/O2000:
Supported Ethernet Interconnects for Compaq Parallel Database
Clusters Using Oracle Parallel Server (ECG062/0299), at
www.compaq.com/support/techpubs/whitepapers
Compaq Parallel Database Cluster Model PDC/O2000 Certification
Matrix for Windows 2000, at
www.compaq.com/enterprise/ha-pdc.html
Various technical white papers on Oracle and cluster sizing, which are
available from Compaq ActiveAnswers website, at
www.compaq.com/activeanswers

Text Conventions

This document uses the following conventions to distinguish elements of text:
User Input, GUI Selections
About This Guide xvii
Text a user types or enters appears in boldface. Items a user selects from a GUI, such as tabs, buttons, or menu items, also appear in boldface. User input and GUI selections can appear in uppercase and lowercase letters.
File Names, Command Names, Directory Names, Drive Names
Menu Options, Dialog Box Names
Type When you are instructed to type information, type
Enter When you are instructed to enter information, type

Symbols in Text

These symbols may be found in the text of this guide. They have the following meanings:
These elements can appear in uppercase and lowercase letters.
These elements appear in initial capital letters and may appear in bold for emphasis.
the information without pressing the Enter key.
the information and then press the Enter key.
WARNING: Text set off in this manner indicates that failure to follow directions in the warning could result in bodily harm or loss of life.
CAUTION: Text set off in this manner indicates that failure to follow directions could result in damage to equipment or loss of information.
IMPORTANT: Text set off in this manner presents clarifying information or specific instructions.
NOTE: Text set off in this manner presents commentary, sidelights, or interesting points of information.
xviii Compaq Parallel Database Cluster Model PDC/02000 for Oracle8i and Windows 2000 Administrator Guide

Symbols on Equipment

These icons may be located on equipment in areas where hazardous conditions may exist.
Any surface or area of the equipment marked with these symbols indicates the presence of electrical shock hazards. Enclosed area contains no operator serviceable parts. WARNING: To reduce the risk of injury from electrical shock hazards, do not open this enclosure.
Any RJ-45 receptacle marked with these symbols indicates a Network Interface Connection. WARNING: To reduce the risk of electrical shock, fire, or damage to the equipment, do not plug telephone or telecommunications connectors into this receptacle.
Any surface or area of the equipment marked with these symbols indicates the presence of a hot surface or hot component. If this surface is contacted, the potential for injury exists. WARNING: To reduce the risk of injury from a hot component, allow the surface to cool before touching.
Power Supplies or Systems marked with these symbols indicate the equipment is supplied by multiple sources of power.
WARNING: To reduce the risk of injury from electrical shock, remove all power cords to completely disconnect power from the system.

Rack Stability

WARNING: To reduce the risk of personal injury or damage to the equipment,
be sure that:
The leveling jacks are extended to the floor.
The full weight of the rack rests on the leveling jacks.
The stabilizing feet are attached to the rack if it is a single rack
installations.
The racks are coupled together in multiple rack installations.
Only one component is extended at a time. A rack may become unstable if
more than one component is extended for any reason.

Getting Help

If you have a problem and have exhausted the information in this guide, you can get further information and other help in the following locations.

Compaq Technical Support

About This Guide xix
In North America, call the Compaq Technical Phone Support Center at 1-800-OK-COMPAQ. This service is available 24 hours a day, 7 days a week. For continuous quality improvement, calls may be recorded or monitored.
Outside North America, call the nearest Compaq Technical Support Phone Center. Telephone numbers for worldwide Technical Support Centers are listed on the Compaq website. Access the Compaq website by logging on to the Internet at
www.compaq.com
Be sure to have the following information available before you call Compaq:
Technical support registration number (if applicable)
Product serial number
Product model name and number
Applicable error messages
Add-on boards or hardware
Third-party hardware or software
Operating system type and revision level
xx Compaq Parallel Database Cluster Model PDC/02000 for Oracle8i and Windows 2000 Administrator Guide

Compaq Website

The Compaq website has information on this product as well as the latest drivers and Flash ROM images. You can access the Compaq website by logging on to the Internet at
www.compaq.com

Compaq Authorized Reseller

For the name of your nearest Compaq Authorized Reseller:
In the United States, call 1-800-345-1518.
In Canada, call 1-800-263-5868.
Elsewhere, see the Compaq website for locations and telephone
numbers.
Chapter 1
Clustering Overview
For many years, companies have depended on clustered computer systems to fulfill two key requirements: to ensure users can access and process information that is critical to the ongoing operation of their business, and to increase the performance and throughput of their computer systems at minimal cost. These requirements are known as availability and scalability, respectively.
Historically, these requirements have been fulfilled with clustered systems built on proprietary technology. Over the years, open systems have progressively and aggressively moved proprietary technologies into industry-standard products. Clustering is no exception. Its primary features, availability and scalability, have been moving into client/server products for the last few years.
The absorption of clustering technologies into open systems products is creating less expensive, non-proprietary solutions that deliver levels of function commonly found in traditional clusters. While some uses of the proprietary solutions will always exist, such as those controlling stock exchange trading floors and aerospace mission controls, many critical applications can reach the desired levels of availability and scalability with non-proprietary client/server-based clustering.
These clustering solutions use industry-standard hardware and software, thereby providing key clustering features at a lower price than proprietary clustering systems. Before examining the features and benefits of the Compaq Parallel Database Cluster Model PDC/O2000 (referred to here as the PDC/O2000), it is helpful to understand the concepts and terminology of clustered systems.
1-2 Compaq Parallel Database Cluster Model PDC/O2000 for Oracle8i and Windows 2000 Administrator Guide

Clusters Defined

A cluster is an integration of software and hardware products that enables a set of loosely coupled servers and shared storage subsystem components to present a single system image to clients and to operate as a single system. As a cluster, the group of servers and shared storage subsystem components offers a level of availability and scalability far exceeding that obtained if each cluster node operated as a stand-alone server.
The PDC/O2000 uses the Oracle8i Parallel Server software, which is a parallel database that can distribute its workload among the cluster nodes. Refer to Chapter 3, “Cluster Software Components” to determine the specific releases your cluster kit supports.
Figure 1-1 shows an example of a PDC/O2000 that includes two nodes (ProLiant (RA4000/RA4100 Arrays), two Compaq StorageWorks
TM
servers), two Compaq StorageWorks RAID Array 4000s or 4100s
TM
Fibre Channel Storage Hubs, Compaq StorageWorks FC-AL Switches, or Compaq StorageWorks Fibre Channel SAN Switches, a cluster interconnect, and a client local area network (LAN).
Client LAN
Switch/Hub
Fibre Host
Adapters (2)
Node 1 Node 2
Storage
Hub/Switch #1
RA4000/4100 Array #1
Figure 1-1. Example of a two-node Compaq Parallel Database Model PDC/O2000 cluster
Cluster
Interconnect
Fibre Host
Adapters (2)
Storage
Hub/Switch #2
RA4000/4100 Array #2
The PDC/O2000 can use redundant Fibre Channel Fabric Storage Area Network (SAN) and redundant Fibre Channel Arbitrated Loop (FC-AL) SAN topologies. These two SAN topologies support the use of multiple redundant fabrics or loops, respectively. In the example shown in Figure 1-1, the clustered nodes are connected to the database on the shared storage subsystems through a redundant Fibre Channel Fabric or redundant FC-AL. Clients access the database through the client LAN, and the cluster nodes communicate across an Ethernet cluster interconnect.

Availability

When computer systems experience outages, the amount of time the system is unavailable is referred to as downtime. Downtime has several primary causes: hardware faults, software faults, planned service, operator error, and environmental factors. Minimizing downtime is a primary goal of a cluster.
Simply defined, availability is the measure of how well a computer system can continuously deliver services to clients.
Availability is a system-wide endeavor. The hardware, operating system, and applications must be designed for availability. Clustering requires stability in these components, then couples them in such a way that failure of one item does not render the system unusable. By using redundant components and mechanisms that detect and recover from faults, clusters can greatly increase the availability of applications critical to business operations.

Scalability

Simply defined, scalability is a computer system characteristic that enables improved performance or throughput when supplementary hardware resources are added. Scalable systems allow increased throughput by adding components to an existing system without the expense of adding an entire new system.
Clustering Overview 1-3
In a stand-alone server configuration, scalable systems allow increased throughput by adding processors or more memory. In a cluster configuration, this result is usually obtained by adding cluster nodes.
Not only must the hardware benefit from additional components, but also software must be constructed in such a way as to take advantage of the additional processing power. Oracle8i Parallel Server distributes the workload among the cluster nodes. As more nodes are added to the cluster, cluster-aware applications can use the parallel features of Oracle8i Parallel Server to distribute workload among more servers, thereby obtaining greater throughput.
1-4 Compaq Parallel Database Cluster Model PDC/O2000 for Oracle8i and Windows 2000 Administrator Guide

Compaq Parallel Database Cluster Overview

As traditional clustering technology has moved into the open systems of client/server computing, Compaq has provided innovative, customer-focused solutions. The PDC/O2000 moves client/server computing one step closer to the capabilities found in expensive, proprietary cluster solutions, at a fraction of the cost.
The PDC/O2000 combines the popular Microsoft Windows 2000 Advanced Server operating system and the industry-leading Oracle8i Parallel Server with award-winning Compaq ProLiant servers and shared storage subsystems.
Together, these hardware and software components provide improved performance through a truly scalable parallel application and improved availability using clustering software that rapidly recovers from detectable faults. These components also provide improved availability through concurrent multinode database access using Oracle8i Parallel Server.
Chapter 2
Cluster Architecture
The Compaq Parallel Database Cluster Model PDC/O2000 (referred to here as the PDC/O2000) is an integration of a number of different hardware and software products. This chapter discusses how these products play a role in bringing a complete clustering solution to your computing environment.
The hardware products include:
Compaq ProLiant servers
Shared storage components
G Compaq StorageWorks RAID Array 4100s (RA4100 Arrays) or
Compaq StorageWorks RAID Array 4000s (RA4000 Arrays)
G Two Compaq StorageWorks RAID Array 4000 Controllers (RA4000
Array Controllers) installed in each RA4000 Array or RA4100 Array
G Compaq StorageWorks Fibre Channel SAN Switches (Fibre Channel
SAN Switches) for redundant Fibre Channel Fabrics
G Compaq StorageWorks Storage Hubs (Storage Hubs) or Compaq
StorageWorks FC-AL Switches (FC-AL Switches) for redundant Fibre Channel Arbitrated Loops
G Compaq StorageWorks 64-bit/66 MHz Fibre Channel Host Adapters
or Compaq StorageWorks Fibre Channel Host Adapter/Ps (Fibre Host Adapters) installed in each server
G Gigabit Interface Converter-Shortwave (GBIC-SW) modules
G Fibre Channel cables
2-2 Compaq Parallel Database Cluster Model PDC/O2000 for Oracle8i and Windows 2000 Administrator Guide
Cluster interconnect components
G Ethernet NIC adapters
G Ethernet cables
G Ethernet switches/hubs
The software products include:
Microsoft Windows 2000 Advanced Server with Service Pack 1 or later
Compaq drivers and utilities
Oracle8i Enterprise Edition with the Oracle8i Parallel Server Option
Refer to Chapter 3, “Cluster Software Components,” for a description of the software products used with the PDC/O2000.

Compaq ProLiant Servers

A primary component of any cluster is the server. Each PDC/O2000 consists of two or more cluster nodes. Each node is a Compaq ProLiant server.
With some exceptions, all nodes in a PDC/O2000 cluster must be identical in model. In addition, all components common to all nodes in a cluster, such as memory, number of CPUs, and the interconnect adapters, must be identical and identically configured.
NOTE: Certain restrictions apply to the server models and server configurations that are supported by the PDC/O2000. For a current list of PDC-certified servers and details on supported configurations, refer to the Compaq Parallel Database Cluster Model PDC/O2000 Certification Matrix for Windows 2000 at
www.compaq.com/solutions/enterprise/ha-pdc.html
High-Availability Features of ProLiant Servers
In addition to the increased application and data availability enabled by clustering, ProLiant servers include many reliability features that provide a solid foundation for effective clustered server solutions. The PDC/O2000 is based on ProLiant servers, most of which offer excellent reliability through redundant power supplies, redundant cooling fans, and Error Checking and Correcting (ECC) memory. The high-availability features of ProLiant servers are a critical foundation of Compaq clustering products. Table 2-1 lists the high-availability features found in many ProLiant servers.
Table 2-1
High-Availability Components of ProLiant Servers
Hot-pluggable hard drives Redundant power supplies
Digital Linear Tape (DLT) Array (optional) ECC-protected processor-memory bus
Uninterruptible power supplies (optional) Redundant processor power modules
ECC memory PCI Hot Plug slots (in some servers)
Offline backup processor Redundant cooling fans
Cluster Architecture 2-3

Shared Storage Components

The PDC/O2000 is based on a cluster architecture known as “shared storage clustering,” in which clustered nodes share access to a common set of shared disk drives. For the PDC/O2000, the shared storage includes these hardware components:
RA4000 Arrays or RA4100 Arrays
RA4000 Array Controllers
Fibre Channel SAN Switches for each redundant Fibre Channel Fabric
2-4 Compaq Parallel Database Cluster Model PDC/O2000 for Oracle8i and Windows 2000 Administrator Guide
Storage Hubs or FC-AL Switches for each redundant Fibre Channel Arbitrated Loop (FC-AL)
Fibre Host Adapters
Gigabit Interface Converter-Shortwave (GBIC-SW) modules
Fibre Channel cables

RA4000 Array

The RA4000 Array is one shared storage solution for the PDC/O2000. Each redundant Fibre Channel Fabric or redundant FC-AL supports one or more RA4000 Arrays. Each RA4000 Array contains two single-port RA4000 Array Controllers. Each array controller connects the RA4000 Array to one Storage Hub, FC-AL Switch, or Fibre Channel SAN Switch.
The RA4000 Array can hold up to twelve 1-inch or eight 1.6-inch Wide-Ultra SCSI drives. The drives must be mounted on Compaq hot-pluggable drive trays. SCSI IDs are assigned automatically according to their drive location, allowing 1-inch and 1.6-inch drives to be intermixed within the same RA4000 Array.
The RA4000 Array comes in either a rack-mountable or a tower model.
For more information about the RA4000 Array, refer to the Compaq StorageWorks RAID Array 4000 User Guide.

RA4100 Array

The RA4100 Array is another shared storage solution for the PDC/O2000. Each redundant Fibre Channel Fabric or redundant FC-AL supports one or more RA4100 Arrays. Each RA4100 Array contains two single-port RA4000 Array Controllers. Each array controller connects the RA4100 Array to one Storage Hub, FC-AL Switch, or Fibre Channel SAN Switch.
The RA4100 Array can hold up to twelve 1-inch Compaq Hot Plug Ultra2 Disk Drives. The drives must be mounted on Compaq hot-pluggable drive trays. SCSI IDs are assigned automatically according to their drive location.
The RA4100 Array comes in a rack-mountable model.
For more information about the RA4100 Array, refer to the Compaq StorageWorks RAID Array 4100 User Guide.

RA4000 Array Controllers

To ensure redundant I/O paths, two single-port RA4000 Array Controllers are installed in each RA4000 Array or RA4100 Array. Only one array controller can be active at any given time. One array controller is configured as the active controller, and the other is the standby controller. To ensure fault tolerance of shared storage on the RA4000 Array or RA4100 Array, the two array controllers must be connected to a different Storage Hub, FC-AL Switch, or Fibre Channel SAN Switch.
From the perspective of the cluster nodes, each RA4000 Array Controller is simply another device connected to one of the cluster’s I/O paths. Consequently, each node sends its I/O requests to the active RA4000 Array Controller just as it would to any SCSI device. The RA4000 Array Controller receives the I/O requests from the nodes and directs them to the shared storage disks to which it has been configured. Because the array controller processes the I/O requests, the cluster nodes are not burdened with the I/O processing tasks associated with reading and writing data to multiple shared storage devices.
When an RA4000/RA4100 Array and the cluster nodes to which it is physically connected are first powered on, the RA4000/RA4100 Array communicates with the nodes to identify which of its two array controller slots contains the active array controller. The array controller that is installed in the active slot is automatically assigned active status by Compaq Secure Path, without the need for any further configuration. To determine which of the two array controllers in an RA4000/RA4100 Array is currently active, find the controller on which the ninth green LED is lit; this LED identifies the active array controller.
Cluster Architecture 2-5
To change the active slot location, use Secure Path Manager to make the array controller in the other slot the active controller. For information about configuring the standby array controller to be active, refer to Defining Active Array Controllers in Chapter 5, Installation and Configuration.
If the active RA4000 Array Controller in an RA4000/RA4100 Array fails, Secure Path causes the standby controller to become the active array controller.
2-6 Compaq Parallel Database Cluster Model PDC/O2000 for Oracle8i and Windows 2000 Administrator Guide
Access to the same logical disks is provided to both RA4000 Array Controllers to allow for successful failovers. In this configuration, both the active and standby array controllers are configured to receive and transmit data for the same logical disks.
For more information about the RA4000 Array Controller, refer to the
Compaq StorageWorks RAID Array 4000 User Guide or the Compaq StorageWorks RAID Array 4100 User Guide.

Fibre Channel SAN Switches

IMPORTANT: For detailed information about cascading two Fibre Channel SAN Switches,
refer to the latest Compaq StorageWorks documentation. This guide does not document cascaded configurations for the Fibre Channel SAN Switch.
Fibre Channel SAN Switches are installed between cluster nodes and shared storage subsystems in PDC/O2000 clusters to create redundant Fibre Channel Fabrics.
An 8-port Fibre Channel SAN Switch and 16-port Fibre Channel SAN Switch are supported. From two to four Fibre Channel SAN Switches can be used in each redundant Fibre Channel Fabric.
Fibre Channel SAN Switches are used to connect the Fibre Host Adapters in a PDC/O2000’s redundant Fibre Channel Fabric to the array controllers in the RA4000/RA4100 Arrays. Two or more Fibre Channel SAN Switches are used in each redundant Fibre Channel Fabric. Using at least two Fibre Channel SAN Switches provides fault tolerance and supports the redundant architecture described in Redundant Fibre Channel Fabrics in this chapter.
Fibre Channel SAN Switches provide full 100 MBps bandwidth on every port. Adding new devices to Fibre Channel SAN Switch ports increases the aggregate bandwidth.
For further information, refer to these manuals provided with each Fibre Channel SAN Switch:
Compaq StorageWorks Fibre Channel SAN Switch 8 Installation and
Hardware Guide
Compaq StorageWorks Fibre Channel SAN Switch 16 Installation and
Hardware Guide
Compaq StorageWorks Fibre Channel SAN Switch Management Guide
provided with the Fibre Channel SAN Switch
Loading...
+ 217 hidden pages