ibm-information-center/dist/eclipse/plugins/i5OS.ic.rzaig_5.4.0.1/rzaigtroubleshoottipclusterpartitions.htm

99 lines
6.6 KiB
HTML

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE html
PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html lang="en-us" xml:lang="en-us">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<meta name="security" content="public" />
<meta name="Robots" content="index,follow" />
<meta http-equiv="PICS-Label" content='(PICS-1.1 "http://www.icra.org/ratingsv02.html" l gen true r (cz 1 lz 1 nz 1 oz 1 vz 1) "http://www.rsac.org/ratingsv01.html" l gen true r (n 0 s 0 v 0 l 0) "http://www.classify.org/safesurf/" l gen true r (SS~~000 1))' />
<meta name="DC.Type" content="task" />
<meta name="DC.Title" content="Tips: Cluster partitions" />
<meta name="abstract" content="Use these tips for cluster partitions." />
<meta name="description" content="Use these tips for cluster partitions." />
<meta name="DC.Relation" scheme="URI" content="rzaigtroubleshootpartitionerrors.htm" />
<meta name="DC.Relation" scheme="URI" content="rzaigtroubleshootchangepartitionednodes.htm" />
<meta name="DC.Relation" scheme="URI" content="../apis/clcntdltc.htm" />
<meta name="copyright" content="(C) Copyright IBM Corporation 1998, 2006" />
<meta name="DC.Rights.Owner" content="(C) Copyright IBM Corporation 1998, 2006" />
<meta name="DC.Format" content="XHTML" />
<meta name="DC.Identifier" content="rzaigtroubleshoottipclusterpartitions" />
<meta name="DC.Language" content="en-us" />
<!-- All rights reserved. Licensed Materials Property of IBM -->
<!-- US Government Users Restricted Rights -->
<!-- Use, duplication or disclosure restricted by -->
<!-- GSA ADP Schedule Contract with IBM Corp. -->
<link rel="stylesheet" type="text/css" href="./ibmdita.css" />
<link rel="stylesheet" type="text/css" href="./ic.css" />
<title>Tips: Cluster partitions</title>
</head>
<body id="rzaigtroubleshoottipclusterpartitions"><a name="rzaigtroubleshoottipclusterpartitions"><!-- --></a>
<!-- Java sync-link --><script language="Javascript" src="../rzahg/synch.js" type="text/javascript"></script>
<h1 class="topictitle1">Tips: Cluster partitions</h1>
<div><p>Use these tips for cluster partitions.</p>
<ol><li class="stepexpand"><span>The rules for restricting operations within a partition are designed
to make merging the partitions feasible. Without these restrictions, reconstructing
the cluster requires extensive work.</span></li>
<li class="stepexpand"><span>If the nodes in the primary partition have been destroyed, special
processing may be necessary in a secondary partition. The most common scenario
that causes this condition is the loss of the site that made up the
primary partition. Use the example in recovering from partition errors and
assume that Partition 1 was destroyed. In this case, the primary node for
Cluster Resource Groups B, C, and D must be located in Partition 2. The simplest
recovery is to use Change Cluster Node Entry to set both Node A and Node B
to failed. See changing partitioned nodes to failed for more information about
how to do this. Recovery can also be achieved manually. In order to do this,
perform these operations:</span><ol type="a"><li><span>Remove Nodes A and B from the cluster in Partition 2. Partition
2 is now the cluster.</span></li>
<li><span>Establish any logical replication environments needed in the new cluster.
IE. Start Cluster Resource Group API/CL command, and so on.</span></li>
</ol>
<p>Since nodes have been removed from the cluster definition in Partition
2, an attempt to merge Partition 1 and Partition 2 will fail. In order to
correct the mismatch in cluster definitions, run the <a href="../apis/clcntdltc.htm"><span class="apiname">Delete Cluster (QcstDeleteCluster)</span> API</a> on
each node in Partition 1. Then add the nodes from Partition 1 to the cluster,
and reestablish all the cluster resource group definitions, recovery domains,
and logical replication. This requires a great deal of work and is also prone to errors.
It is very important that you do this procedure only in a site loss situation.</p>
</li>
<li class="stepexpand"><span>Processing a start node operation is dependent on the status of
the node that is being started: </span> <p>The node either failed or an
End Node operation ended the node:</p>
<ol type="a"><li><span>Cluster resource services is started on the node that is being
added</span></li>
<li><span>Cluster definition is copied from an active node in the cluster
to the node that is being started.</span></li>
<li><span>Any cluster resource group that has the node being started in
the recovery domain is copied from an active node in the cluster to the node
being started. No cluster resource groups are copied from the node that is
being started to an active node in the cluster.</span></li>
</ol>
<p>The node is a partitioned node:</p>
<ol type="a"><li><span>The cluster definition of an active node is compared to the
cluster definition of the node that is being started. If the definitions are
the same, the start will continue as a merge operation. If the definitions
do not match, the merge will stop, and the user will need to intervene.</span></li>
<li><span>If the merge continues, the node that is being started is set
to an active status.</span></li>
<li><span>Any cluster resource group that has the node being started in
the recovery domain is copied from the primary partition of the cluster resource
group to the secondary partition of the cluster resource group. Cluster resource
groups may be copied from the node that is being started to nodes that are
already active in the cluster.</span></li>
</ol>
</li>
</ol>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="rzaigtroubleshootpartitionerrors.htm" title="Certain cluster conditions are easily corrected. If a cluster partition has occurred, you can learn how to recover. This topic also tells you how to avoid a cluster partition and gives you an example of how to merge partitions back together.">Partition errors</a></div>
</div>
<div class="reltasks"><strong>Related tasks</strong><br />
<div><a href="rzaigtroubleshootchangepartitionednodes.htm" title="Sometimes, a partitioned condition is reported when there really was a node outage. This can occur when cluster resource services loses communications with one or more nodes, but cannot detect if the nodes are still operational. When this condition occurs, a simple mechanism exists for you to indicate that the node has failed.">Change partitioned nodes to failed</a></div>
</div>
<div class="relref"><strong>Related reference</strong><br />
<div><a href="../apis/clcntdltc.htm">Delete Cluster (QcstDeleteCluster) API</a></div>
</div>
</div>
</body>
</html>