Automatic Configuration Validationfor Oracle RAC

Oracle RAC

You’ve invested a great deal of time and effort in building highly available Oracle RAC clusters. But will those clusters work when you need them most? Are they guaranteed to fail-over flawlessly no matter what happens?

Let’s face it: it is difficult to keep your cluster configuration perfectly aligned with vendor best-practices, and in sync with changes in the other IT layers that Oracle RAC interfaces with (such as OS, Storage, Networking and more). Unfortunately, even a small misconfiguration or discrepancy between cluster nodes can lead to unsuccessful fail-overs and painful outages at the worst possible time.

AvailabilityGuard automatically detects Oracle RAC nodes, as well as detailed cluster and resource configuration on each node. This includes automatic detection of ASM configuration within standalone database servers and cluster nodes, as well as capture and display of the relationship and configuration of nodes.

Oracle RAC Configuration Alignment with Storage and Replication

AvailabilityGuard routinely verifies that your underlying storage devices are accessible and configured to provide equal levels of availability and service. With AvailabilityGuard, you can be confident that clusters will fail-over successfully, mount storage volumes and volume groups, and start applications – whether running on physical servers or logical partitions.

AvailabilitGuard detects many issues in order to ensure Oracle RAC’s performance:

  • LUNS inaccessible to cluster nodes (local or remote nodes), or accessible to unauthorized hosts
  • SCSI-3 reservation best practice violations
  • Data misplaced on incorrect storage tier, or on un-shared volumes.
  • Fabric single point of failure or masking/zoning misconfigurations that will fail fail-over
  • Shares storage and ASM configuration best practices
  • ASM storage alignment issues (I/O capacity, RAID type, storage frame, etc.)
  • ASM Replication issues (if storage-based; completeness, consistency, device mapping, etc.)
  • And more.

Oracle RAC Configuration Alignment with Server and Application level settings

AvailabilityGuard analyzes the configuration of the different components within the domain of the Oracle RAC, including logical partitions, operating systems, volume groups, file systems, Oracle database files and more. AvailabilityGuard verifies that the cluster configuration and the settings of each of these components are aligned and well-orchestrated. Any mismatch may lead to failed switch-overs.

A few sample configurations that are validated:

  • Oracle RAC configuration (cluster components in bad state, DB parameters configuration,…)
  • Cluster and resource best practices (e.g., resources with hard dependencies that are missing pullup and / or hard start dependencies, start dependencies with type modifier syntax errors. inconsistent action scripts)
  • Host-level configuration (OS version, SP, patch, kernel parameters, network configuration, …)
  • Oracle-level configuration
  • Voting-disk best practices
  • Database Configuration best practices
  • Mismatch between OS mount configuration and cluster mount resource config
  • LVM mirroring
  • Existence of key directories/files as defined in resources (Oracle listener.ora, Apache httpDir, SYMCLI, …)
  • Resource-specific best practices (volume group, logical volume, file system, application, Service IP labels, Tape resources)
  • Server network configuration – NIC bonding, private and public network connections, etc.
  • VSCSI and NPIV guidelines for availability and data protection
  • And more.

Oracle RAC Node Alignment

Using an intelligent comparison engine, AvailabilityGuard assists the cluster administrator to identify major differences between cluster nodes. Such inconsistencies often lead to unexpected behavior at and following a cluster fail-over.

Sample issues:

  • Differences in OS version, technology level, installed products, patches, user and group config, kernel parameters, services, network options, configuration files, etc.
  • Difference in FC Adapter settings, Network adapters, time and ntp settings, etc.
  • Difference in multipath config – hdisk number of path, algorithm, queue depth, reserve policy and more.
  • Differences in WebSphere/Weblogic/Tomcat deployments (binaries, domains, Java, etc.)
  • And more.

Oracle RAC Configuration Vulnerabilities

AvailabilityGuard analyzes the configuration of Oracle RAC itself, and verifies that it complies with Oracle’s guidelines and with community-driven best practices. The analysis includes comprehensive investigation of resource groups, resources, network interface, Heartbeat management, and additional components.

AvailabilityGuard validates your Oracle RAC’s:

  • Resource and resource dependency configuration
  • Network configuration best practices
  • States for resource, group and systems
  • And more.