ibm-information-center/dist/eclipse/plugins/i5OS.ic.rzaie_5.4.0.1/rzaieconfigthesaurus.htm

171 lines
9.7 KiB
HTML

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE html
PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html lang="en-us" xml:lang="en-us">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<meta name="security" content="public" />
<meta name="Robots" content="index,follow" />
<meta http-equiv="PICS-Label" content='(PICS-1.1 "http://www.icra.org/ratingsv02.html" l gen true r (cz 1 lz 1 nz 1 oz 1 vz 1) "http://www.rsac.org/ratingsv01.html" l gen true r (n 0 s 0 v 0 l 0) "http://www.classify.org/safesurf/" l gen true r (SS~~000 1))' />
<meta name="DC.Type" content="topic" />
<meta name="DC.Title" content="Set up a thesaurus dictionary for the Webserver search engine on HTTP Server" />
<meta name="abstract" content="This topic provides information about how to set up a thesaurus dictionary file for use with the Webserver search engine with the IBM Web Administration for i5/OS interface." />
<meta name="description" content="This topic provides information about how to set up a thesaurus dictionary file for use with the Webserver search engine with the IBM Web Administration for i5/OS interface." />
<meta name="DC.Relation" scheme="URI" content="rzaieparsearch.htm" />
<meta name="copyright" content="(C) Copyright IBM Corporation 2002,2006" />
<meta name="DC.Rights.Owner" content="(C) Copyright IBM Corporation 2002,2006" />
<meta name="DC.Format" content="XHTML" />
<meta name="DC.Identifier" content="rzaieconfigthesaurus" />
<meta name="DC.Language" content="en-us" />
<!-- All rights reserved. Licensed Materials Property of IBM -->
<!-- US Government Users Restricted Rights -->
<!-- Use, duplication or disclosure restricted by -->
<!-- GSA ADP Schedule Contract with IBM Corp. -->
<link rel="stylesheet" type="text/css" href="./ibmdita.css" />
<link rel="stylesheet" type="text/css" href="./ic.css" />
<title>Set up a thesaurus dictionary for the Webserver search engine on HTTP
Server</title>
</head>
<body id="rzaieconfigthesaurus"><a name="rzaieconfigthesaurus"><!-- --></a>
<!-- Java sync-link --><script language="Javascript" src="../rzahg/synch.js" type="text/javascript"></script>
<h1 class="topictitle1">Set up a thesaurus dictionary for the Webserver search engine on HTTP
Server</h1>
<div><p>This topic provides information about how to set up a thesaurus
dictionary file for use with the Webserver search engine with the <span>IBM<sup>®</sup> Web Administration for i5/OS™ interface</span>.</p>
<div class="important"><span class="importanttitle">Important:</span> Information
for this topic supports the latest PTF levels for HTTP Server for i5/OS .
It is recommended that you install the latest PTFs to upgrade to the latest
level of the HTTP Server for i5/OS. Some of the topics documented here are
not available prior to this update. See <a href="http://www-03.ibm.com/servers/eserver/iseries/software/http/services/service.html" target="_blank">http://www.ibm.com/servers/eserver/iseries/software/http/services/service.htm</a> <img src="www.gif" alt="Link outside Information Center" /> for more information. </div>
<p>The thesaurus support allows you to automatically expand a search query
by using a thesaurus. To make sure information is always found in your indexed
documents, you can create your own thesaurus in which you can list common
terms and associate them with the terms that exist in your documents. For
example, if a person typically searches for PC but your documents only refer
to a personal computer, just add <em>PC</em> to your thesaurus as a synonym
of <em>personal computer</em>. You must first create a thesaurus definition
file that contains terms that are related. Then you build the thesaurus dictionary
to be used by the Webserver search engine. See <a href="#build">Build the thesaurus dictionary</a> for
more information.</p>
<div class="note"><span class="notetitle">Note:</span> The thesaurus definition file can be created in IFS or QSYS.LIB.</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="rzaieparsearch.htm" title="This topic provides step-by-step tasks for the Webserver search engine.">Search tasks</a></div>
</div>
</div><div class="nested1" id="create"><a name="create"><!-- --></a><h2 class="topictitle2">Create a thesaurus definition file</h2>
<div><p>To create a thesaurus definition file, do the following: </p>
<ol><li>Open a text editor on the iSeries™, such as <tt>edtf</tt>.<div class="note"><span class="notetitle">Note:</span> Use <tt>edtf</tt> or
some other iSeries editor rather than a PC editor. It is important that the
file is tagged with the correct CCSID since the words will be matched with
words in the documents themselves.</div>
</li>
<li>Create the content of the thesaurus definition file using the following
file format: <ol type="a"><li>Open a text editor on the iSeries, such as <tt>edtf</tt>.<div class="note"><span class="notetitle">Note:</span> Use <tt>edtf</tt> or
some other iSeries editor rather than a PC editor. It is important that the
file is tagged with the correct CCSID since the words will be matched with
words in the documents themselves. </div>
</li>
<li>Create the content of the thesaurus definition file using the following
file format: <p>A thesaurus definition file consists of blocks containing
elements. Each element of the block is defined by a capitalized keyword. The
block also contains terms that are single or multiple words. For example "cake"
and "chocolate cake" are terms. </p>
<p>Each block starts with <tt>:WORDS</tt> followed
on the same line by one of the following: </p>
<pre class="block">RELATED
Where :RELATED indicates related terms that are not synonyms.
:SYNONYM
Where :SYNONYM indicates terms that are synonyms. </pre>
<p>Member
terms are listed in the block starting on the second line of the block, one
term per line. For example: </p>
<pre class="block">:WORDS:SYNONYM
PC
personal computer</pre>
<p>The following relationships can also be specified
within the block: </p>
<dl><dt class="dlterm">.LOWER_THAN </dt>
<dd>Where the block member terms are more specific in meaning than the term
following <tt>.LOWER_THAN</tt>. </dd>
</dl>
<dl><dt class="dlterm">.HIGHER_THAN </dt>
<dd>Where the block member terms are less specific in meaning than the term
following <tt>.HIGHER_THAN</tt>. </dd>
</dl>
<dl><dt class="dlterm">.RELATED_TO </dt>
<dd>Block member terms are related to this term. </dd>
</dl>
<dl><dt class="dlterm">.SYNONYM_OF </dt>
<dd>Block member terms are synonyms of this term. </dd>
</dl>
<p>A related term is specified on the same line as the relationship.
A term is a single or multiple words. The relationships can be specified in
any order within the block. For example the two following blocks are interpreted
exactly the same: </p>
<pre class="block">:WORDS
rain
snow
hail
.LOWER_THAN precipitation
.RELATED_TO weather
:WORDS
.LOWER_THAN precipitation
.RELATED_TO weather
rain
snow
hail</pre>
<p>When creating a thesaurus definition file, keep the
following in mind:</p>
<ul><li>Preceding and trailing blanks are removed. </li>
<li>Preceding and trailing control characters are removed. </li>
<li>Terms beginning with a period (.) or a colon (:) are not allowed. </li>
<li>Capital letters and small letters of the same character are treated as
the same character. </li>
<li>Leave the keywords that are UPPERCASED as-is. </li>
<li>Terms in the file may be in any language. </li>
<li>The maximum length of a term is 64 characters or 64 bytes. </li>
</ul>
<div class="note"><span class="notetitle">Note:</span> A sample thesaurus definition file is stored in<tt> /QIBM/ProdData/HTTP/Public/HTTPSVR/sample_thesaurus.txt</tt>.</div>
</li>
<li>Once you have created the thesaurus definition file, save it as a text
file (txt). <div class="note"><span class="notetitle">Note:</span> Terms can be added in any supported language; however, the
keywords (:RELATED, :SYNONYM, .LOWER_THAN, .HIGHER_THAN, .RELATED_TO, .SYNONYM_OF,
and :WORDS) can not be changed in order for the definition file to work. </div>
</li>
</ol>
</li>
</ol>
<p>After you have created a thesaurus dictionary, you can manage a thesaurus
dictionary . See <a href="rzaiemngthesdict.htm">Manage a thesaurus dictionary for the Webserver search engine on HTTP Server</a> for
more information. To use the dictionary on a search, select your index and
the search option. After you have selected to do a simple or advanced search,
you will reach a form that allows you to add a thesaurus dictionary to your
search. </p>
</div>
</div>
<div class="nested1" id="build"><a name="build"><!-- --></a><h2 class="topictitle2">Build the thesaurus dictionary</h2>
<div><p>To build the thesaurus dictionary, allowing it to be used by the Webserver
search engine, do the following: </p>
<ol><li>Click the <span class="uicontrol">Advanced</span> tab.</li>
<li>Click the <span class="uicontrol">Search Setup</span> subtab.</li>
<li>Expand <span class="uicontrol">Search Engine Setup</span>.</li>
<li>Click <span class="uicontrol">Build thesaurus dictionary</span>.</li>
<li>Enter the directory and name of the thesaurus definition file that contains
relationship data for generating a thesaurus dictionary in the <strong>Thesaurus
definition file</strong> field. A definition file is a simple text file with formatting
tags to indicate word relationships.</li>
<li>Enter a name for the thesaurus dictionary in the <strong>Thesaurus dictionary
name</strong> field. For example, <tt>mydict</tt>. </li>
<li>Enter the directory that is used to hold the thesaurus dictionary files
that are created in the Thesaurus dictionary directory field. Possible values
include <tt>/QIBM/UserData/HTTPSVR/search</tt> (the default setting), or any
valid directory path. </li>
<li>Click <span class="uicontrol">Apply</span>.</li>
</ol>
</div>
</div>
</body>
</html>