ibm-information-center/dist/eclipse/plugins/i5OS.ic.rzaig_5.4.0.1/rzaigtroubleshootrecoverjobfailure.htm

64 lines
4.4 KiB
HTML
Raw Normal View History

2024-04-02 14:02:31 +00:00
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE html
PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html lang="en-us" xml:lang="en-us">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<meta name="security" content="public" />
<meta name="Robots" content="index,follow" />
<meta http-equiv="PICS-Label" content='(PICS-1.1 "http://www.icra.org/ratingsv02.html" l gen true r (cz 1 lz 1 nz 1 oz 1 vz 1) "http://www.rsac.org/ratingsv01.html" l gen true r (n 0 s 0 v 0 l 0) "http://www.classify.org/safesurf/" l gen true r (SS~~000 1))' />
<meta name="DC.Type" content="task" />
<meta name="DC.Title" content="Recover from cluster job failures" />
<meta name="abstract" content="Failure of a cluster resource services job is usually indicative of some other problem." />
<meta name="description" content="Failure of a cluster resource services job is usually indicative of some other problem." />
<meta name="DC.Relation" scheme="URI" content="rzaigtroubleshootclusterrecovery.htm" />
<meta name="DC.Relation" scheme="URI" content="rzaigmanageendnode.htm" />
<meta name="DC.Relation" scheme="URI" content="rzaigmanagestartnode.htm" />
<meta name="DC.Relation" scheme="URI" content="rzaigmanagejobstructure.htm" />
<meta name="copyright" content="(C) Copyright IBM Corporation 1998, 2006" />
<meta name="DC.Rights.Owner" content="(C) Copyright IBM Corporation 1998, 2006" />
<meta name="DC.Format" content="XHTML" />
<meta name="DC.Identifier" content="rzaigtroubleshootrecoverjobfailure" />
<meta name="DC.Language" content="en-us" />
<!-- All rights reserved. Licensed Materials Property of IBM -->
<!-- US Government Users Restricted Rights -->
<!-- Use, duplication or disclosure restricted by -->
<!-- GSA ADP Schedule Contract with IBM Corp. -->
<link rel="stylesheet" type="text/css" href="./ibmdita.css" />
<link rel="stylesheet" type="text/css" href="./ic.css" />
<title>Recover from cluster job failures</title>
</head>
<body id="rzaigtroubleshootrecoverjobfailure"><a name="rzaigtroubleshootrecoverjobfailure"><!-- --></a>
<!-- Java sync-link --><script language="Javascript" src="../rzahg/synch.js" type="text/javascript"></script>
<h1 class="topictitle1">Recover from cluster job failures</h1>
<div><p>Failure of a cluster resource services job is usually indicative
of some other problem.</p>
<div class="section">You should look for the job log associated with the failed job and
look for messages that describe why it failed. Correct any error situations. <p><img src="./delta.gif" alt="Start of change" />You can use the <a href="../cl/chgclurcy.htm">Change Cluster Recovery (CHGCLURCY) command</a> to restart
a cluster resource group job that was ended without having to end and restart
clustering on a node. <img src="./deltaend.gif" alt="End of change" /></p>
</div>
<ol><li><img src="./delta.gif" alt="Start of change" /><span><samp class="codeph">CHGCLURCY CLUSTER(EXAMPLE)CRG(CRG1)NODE(NODE1)ACTION(*STRCRGJOB)</samp> This
command will cause cluster resource group job, CRG1, on node NODE1 to be submitted.
To start the cluster resource group job on NODE1 requires clustering to be
active on NODE1. </span><img src="./deltaend.gif" alt="End of change" /></li>
<li><span>Restart clustering on the node.</span></li>
</ol>
<div class="section"><p>If you are using a IBM Business Partner cluster management product,
refer to the documentation that came with the product.</p>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="rzaigtroubleshootclusterrecovery.htm" title="Read about how to recover from other cluster failures that may occur.">Cluster recovery</a></div>
</div>
<div class="relconcepts"><strong>Related concepts</strong><br />
<div><a href="rzaigmanagejobstructure.htm" title="When managing cluster, you need to know about job structures and user queues.">Job structure and user queues</a></div>
</div>
<div class="reltasks"><strong>Related tasks</strong><br />
<div><a href="rzaigmanageendnode.htm" title="Stopping or ending a node stops cluster resource services on that node.">End a cluster node</a></div>
<div><a href="rzaigmanagestartnode.htm" title="Starting a cluster node starts cluster resource services on a node in the cluster. Beginning with cluster version 3, a node can start itself and will be able to rejoin the current active cluster, provided it can find an active node in the cluster.">Start a cluster node</a></div>
</div>
</div>
</body>
</html>