64 lines
4.4 KiB
HTML
64 lines
4.4 KiB
HTML
|
<?xml version="1.0" encoding="UTF-8"?>
|
||
|
<!DOCTYPE html
|
||
|
PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
|
||
|
<html lang="en-us" xml:lang="en-us">
|
||
|
<head>
|
||
|
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
|
||
|
<meta name="security" content="public" />
|
||
|
<meta name="Robots" content="index,follow" />
|
||
|
<meta http-equiv="PICS-Label" content='(PICS-1.1 "http://www.icra.org/ratingsv02.html" l gen true r (cz 1 lz 1 nz 1 oz 1 vz 1) "http://www.rsac.org/ratingsv01.html" l gen true r (n 0 s 0 v 0 l 0) "http://www.classify.org/safesurf/" l gen true r (SS~~000 1))' />
|
||
|
<meta name="DC.Type" content="task" />
|
||
|
<meta name="DC.Title" content="Recover from cluster job failures" />
|
||
|
<meta name="abstract" content="Failure of a cluster resource services job is usually indicative of some other problem." />
|
||
|
<meta name="description" content="Failure of a cluster resource services job is usually indicative of some other problem." />
|
||
|
<meta name="DC.Relation" scheme="URI" content="rzaigtroubleshootclusterrecovery.htm" />
|
||
|
<meta name="DC.Relation" scheme="URI" content="rzaigmanageendnode.htm" />
|
||
|
<meta name="DC.Relation" scheme="URI" content="rzaigmanagestartnode.htm" />
|
||
|
<meta name="DC.Relation" scheme="URI" content="rzaigmanagejobstructure.htm" />
|
||
|
<meta name="copyright" content="(C) Copyright IBM Corporation 1998, 2006" />
|
||
|
<meta name="DC.Rights.Owner" content="(C) Copyright IBM Corporation 1998, 2006" />
|
||
|
<meta name="DC.Format" content="XHTML" />
|
||
|
<meta name="DC.Identifier" content="rzaigtroubleshootrecoverjobfailure" />
|
||
|
<meta name="DC.Language" content="en-us" />
|
||
|
<!-- All rights reserved. Licensed Materials Property of IBM -->
|
||
|
<!-- US Government Users Restricted Rights -->
|
||
|
<!-- Use, duplication or disclosure restricted by -->
|
||
|
<!-- GSA ADP Schedule Contract with IBM Corp. -->
|
||
|
<link rel="stylesheet" type="text/css" href="./ibmdita.css" />
|
||
|
<link rel="stylesheet" type="text/css" href="./ic.css" />
|
||
|
<title>Recover from cluster job failures</title>
|
||
|
</head>
|
||
|
<body id="rzaigtroubleshootrecoverjobfailure"><a name="rzaigtroubleshootrecoverjobfailure"><!-- --></a>
|
||
|
<!-- Java sync-link --><script language="Javascript" src="../rzahg/synch.js" type="text/javascript"></script>
|
||
|
<h1 class="topictitle1">Recover from cluster job failures</h1>
|
||
|
<div><p>Failure of a cluster resource services job is usually indicative
|
||
|
of some other problem.</p>
|
||
|
<div class="section">You should look for the job log associated with the failed job and
|
||
|
look for messages that describe why it failed. Correct any error situations. <p><img src="./delta.gif" alt="Start of change" />You can use the <a href="../cl/chgclurcy.htm">Change Cluster Recovery (CHGCLURCY) command</a> to restart
|
||
|
a cluster resource group job that was ended without having to end and restart
|
||
|
clustering on a node. <img src="./deltaend.gif" alt="End of change" /></p>
|
||
|
</div>
|
||
|
<ol><li><img src="./delta.gif" alt="Start of change" /><span><samp class="codeph">CHGCLURCY CLUSTER(EXAMPLE)CRG(CRG1)NODE(NODE1)ACTION(*STRCRGJOB)</samp> This
|
||
|
command will cause cluster resource group job, CRG1, on node NODE1 to be submitted.
|
||
|
To start the cluster resource group job on NODE1 requires clustering to be
|
||
|
active on NODE1. </span><img src="./deltaend.gif" alt="End of change" /></li>
|
||
|
<li><span>Restart clustering on the node.</span></li>
|
||
|
</ol>
|
||
|
<div class="section"><p>If you are using a IBM Business Partner cluster management product,
|
||
|
refer to the documentation that came with the product.</p>
|
||
|
</div>
|
||
|
</div>
|
||
|
<div>
|
||
|
<div class="familylinks">
|
||
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="rzaigtroubleshootclusterrecovery.htm" title="Read about how to recover from other cluster failures that may occur.">Cluster recovery</a></div>
|
||
|
</div>
|
||
|
<div class="relconcepts"><strong>Related concepts</strong><br />
|
||
|
<div><a href="rzaigmanagejobstructure.htm" title="When managing cluster, you need to know about job structures and user queues.">Job structure and user queues</a></div>
|
||
|
</div>
|
||
|
<div class="reltasks"><strong>Related tasks</strong><br />
|
||
|
<div><a href="rzaigmanageendnode.htm" title="Stopping or ending a node stops cluster resource services on that node.">End a cluster node</a></div>
|
||
|
<div><a href="rzaigmanagestartnode.htm" title="Starting a cluster node starts cluster resource services on a node in the cluster. Beginning with cluster version 3, a node can start itself and will be able to rejoin the current active cluster, provided it can find an active node in the cluster.">Start a cluster node</a></div>
|
||
|
</div>
|
||
|
</div>
|
||
|
</body>
|
||
|
</html>
|