171 lines
9.7 KiB
HTML
171 lines
9.7 KiB
HTML
<?xml version="1.0" encoding="UTF-8"?>
|
|
<!DOCTYPE html
|
|
PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
|
|
<html lang="en-us" xml:lang="en-us">
|
|
<head>
|
|
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
|
|
<meta name="security" content="public" />
|
|
<meta name="Robots" content="index,follow" />
|
|
<meta http-equiv="PICS-Label" content='(PICS-1.1 "http://www.icra.org/ratingsv02.html" l gen true r (cz 1 lz 1 nz 1 oz 1 vz 1) "http://www.rsac.org/ratingsv01.html" l gen true r (n 0 s 0 v 0 l 0) "http://www.classify.org/safesurf/" l gen true r (SS~~000 1))' />
|
|
<meta name="DC.Type" content="topic" />
|
|
<meta name="DC.Title" content="Set up a thesaurus dictionary for the Webserver search engine on HTTP Server" />
|
|
<meta name="abstract" content="This topic provides information about how to set up a thesaurus dictionary file for use with the Webserver search engine with the IBM Web Administration for i5/OS interface." />
|
|
<meta name="description" content="This topic provides information about how to set up a thesaurus dictionary file for use with the Webserver search engine with the IBM Web Administration for i5/OS interface." />
|
|
<meta name="DC.Relation" scheme="URI" content="rzaieparsearch.htm" />
|
|
<meta name="copyright" content="(C) Copyright IBM Corporation 2002,2006" />
|
|
<meta name="DC.Rights.Owner" content="(C) Copyright IBM Corporation 2002,2006" />
|
|
<meta name="DC.Format" content="XHTML" />
|
|
<meta name="DC.Identifier" content="rzaieconfigthesaurus" />
|
|
<meta name="DC.Language" content="en-us" />
|
|
<!-- All rights reserved. Licensed Materials Property of IBM -->
|
|
<!-- US Government Users Restricted Rights -->
|
|
<!-- Use, duplication or disclosure restricted by -->
|
|
<!-- GSA ADP Schedule Contract with IBM Corp. -->
|
|
<link rel="stylesheet" type="text/css" href="./ibmdita.css" />
|
|
<link rel="stylesheet" type="text/css" href="./ic.css" />
|
|
<title>Set up a thesaurus dictionary for the Webserver search engine on HTTP
|
|
Server</title>
|
|
</head>
|
|
<body id="rzaieconfigthesaurus"><a name="rzaieconfigthesaurus"><!-- --></a>
|
|
<!-- Java sync-link --><script language="Javascript" src="../rzahg/synch.js" type="text/javascript"></script>
|
|
<h1 class="topictitle1">Set up a thesaurus dictionary for the Webserver search engine on HTTP
|
|
Server</h1>
|
|
<div><p>This topic provides information about how to set up a thesaurus
|
|
dictionary file for use with the Webserver search engine with the <span>IBM<sup>®</sup> Web Administration for i5/OS™ interface</span>.</p>
|
|
<div class="important"><span class="importanttitle">Important:</span> Information
|
|
for this topic supports the latest PTF levels for HTTP Server for i5/OS .
|
|
It is recommended that you install the latest PTFs to upgrade to the latest
|
|
level of the HTTP Server for i5/OS. Some of the topics documented here are
|
|
not available prior to this update. See <a href="http://www-03.ibm.com/servers/eserver/iseries/software/http/services/service.html" target="_blank">http://www.ibm.com/servers/eserver/iseries/software/http/services/service.htm</a> <img src="www.gif" alt="Link outside Information Center" /> for more information. </div>
|
|
<p>The thesaurus support allows you to automatically expand a search query
|
|
by using a thesaurus. To make sure information is always found in your indexed
|
|
documents, you can create your own thesaurus in which you can list common
|
|
terms and associate them with the terms that exist in your documents. For
|
|
example, if a person typically searches for PC but your documents only refer
|
|
to a personal computer, just add <em>PC</em> to your thesaurus as a synonym
|
|
of <em>personal computer</em>. You must first create a thesaurus definition
|
|
file that contains terms that are related. Then you build the thesaurus dictionary
|
|
to be used by the Webserver search engine. See <a href="#build">Build the thesaurus dictionary</a> for
|
|
more information.</p>
|
|
<div class="note"><span class="notetitle">Note:</span> The thesaurus definition file can be created in IFS or QSYS.LIB.</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="rzaieparsearch.htm" title="This topic provides step-by-step tasks for the Webserver search engine.">Search tasks</a></div>
|
|
</div>
|
|
</div><div class="nested1" id="create"><a name="create"><!-- --></a><h2 class="topictitle2">Create a thesaurus definition file</h2>
|
|
<div><p>To create a thesaurus definition file, do the following: </p>
|
|
<ol><li>Open a text editor on the iSeries™, such as <tt>edtf</tt>.<div class="note"><span class="notetitle">Note:</span> Use <tt>edtf</tt> or
|
|
some other iSeries editor rather than a PC editor. It is important that the
|
|
file is tagged with the correct CCSID since the words will be matched with
|
|
words in the documents themselves.</div>
|
|
</li>
|
|
<li>Create the content of the thesaurus definition file using the following
|
|
file format: <ol type="a"><li>Open a text editor on the iSeries, such as <tt>edtf</tt>.<div class="note"><span class="notetitle">Note:</span> Use <tt>edtf</tt> or
|
|
some other iSeries editor rather than a PC editor. It is important that the
|
|
file is tagged with the correct CCSID since the words will be matched with
|
|
words in the documents themselves. </div>
|
|
</li>
|
|
<li>Create the content of the thesaurus definition file using the following
|
|
file format: <p>A thesaurus definition file consists of blocks containing
|
|
elements. Each element of the block is defined by a capitalized keyword. The
|
|
block also contains terms that are single or multiple words. For example "cake"
|
|
and "chocolate cake" are terms. </p>
|
|
<p>Each block starts with <tt>:WORDS</tt> followed
|
|
on the same line by one of the following: </p>
|
|
<pre class="block">RELATED
|
|
Where :RELATED indicates related terms that are not synonyms.
|
|
:SYNONYM
|
|
Where :SYNONYM indicates terms that are synonyms. </pre>
|
|
<p>Member
|
|
terms are listed in the block starting on the second line of the block, one
|
|
term per line. For example: </p>
|
|
<pre class="block">:WORDS:SYNONYM
|
|
PC
|
|
personal computer</pre>
|
|
<p>The following relationships can also be specified
|
|
within the block: </p>
|
|
<dl><dt class="dlterm">.LOWER_THAN </dt>
|
|
<dd>Where the block member terms are more specific in meaning than the term
|
|
following <tt>.LOWER_THAN</tt>. </dd>
|
|
</dl>
|
|
<dl><dt class="dlterm">.HIGHER_THAN </dt>
|
|
<dd>Where the block member terms are less specific in meaning than the term
|
|
following <tt>.HIGHER_THAN</tt>. </dd>
|
|
</dl>
|
|
<dl><dt class="dlterm">.RELATED_TO </dt>
|
|
<dd>Block member terms are related to this term. </dd>
|
|
</dl>
|
|
<dl><dt class="dlterm">.SYNONYM_OF </dt>
|
|
<dd>Block member terms are synonyms of this term. </dd>
|
|
</dl>
|
|
<p>A related term is specified on the same line as the relationship.
|
|
A term is a single or multiple words. The relationships can be specified in
|
|
any order within the block. For example the two following blocks are interpreted
|
|
exactly the same: </p>
|
|
<pre class="block">:WORDS
|
|
rain
|
|
snow
|
|
hail
|
|
.LOWER_THAN precipitation
|
|
.RELATED_TO weather
|
|
|
|
:WORDS
|
|
.LOWER_THAN precipitation
|
|
.RELATED_TO weather
|
|
rain
|
|
snow
|
|
hail</pre>
|
|
<p>When creating a thesaurus definition file, keep the
|
|
following in mind:</p>
|
|
<ul><li>Preceding and trailing blanks are removed. </li>
|
|
<li>Preceding and trailing control characters are removed. </li>
|
|
<li>Terms beginning with a period (.) or a colon (:) are not allowed. </li>
|
|
<li>Capital letters and small letters of the same character are treated as
|
|
the same character. </li>
|
|
<li>Leave the keywords that are UPPERCASED as-is. </li>
|
|
<li>Terms in the file may be in any language. </li>
|
|
<li>The maximum length of a term is 64 characters or 64 bytes. </li>
|
|
</ul>
|
|
<div class="note"><span class="notetitle">Note:</span> A sample thesaurus definition file is stored in<tt> /QIBM/ProdData/HTTP/Public/HTTPSVR/sample_thesaurus.txt</tt>.</div>
|
|
</li>
|
|
<li>Once you have created the thesaurus definition file, save it as a text
|
|
file (txt). <div class="note"><span class="notetitle">Note:</span> Terms can be added in any supported language; however, the
|
|
keywords (:RELATED, :SYNONYM, .LOWER_THAN, .HIGHER_THAN, .RELATED_TO, .SYNONYM_OF,
|
|
and :WORDS) can not be changed in order for the definition file to work. </div>
|
|
</li>
|
|
</ol>
|
|
</li>
|
|
</ol>
|
|
<p>After you have created a thesaurus dictionary, you can manage a thesaurus
|
|
dictionary . See <a href="rzaiemngthesdict.htm">Manage a thesaurus dictionary for the Webserver search engine on HTTP Server</a> for
|
|
more information. To use the dictionary on a search, select your index and
|
|
the search option. After you have selected to do a simple or advanced search,
|
|
you will reach a form that allows you to add a thesaurus dictionary to your
|
|
search. </p>
|
|
</div>
|
|
</div>
|
|
<div class="nested1" id="build"><a name="build"><!-- --></a><h2 class="topictitle2">Build the thesaurus dictionary</h2>
|
|
<div><p>To build the thesaurus dictionary, allowing it to be used by the Webserver
|
|
search engine, do the following: </p>
|
|
<ol><li>Click the <span class="uicontrol">Advanced</span> tab.</li>
|
|
<li>Click the <span class="uicontrol">Search Setup</span> subtab.</li>
|
|
<li>Expand <span class="uicontrol">Search Engine Setup</span>.</li>
|
|
<li>Click <span class="uicontrol">Build thesaurus dictionary</span>.</li>
|
|
<li>Enter the directory and name of the thesaurus definition file that contains
|
|
relationship data for generating a thesaurus dictionary in the <strong>Thesaurus
|
|
definition file</strong> field. A definition file is a simple text file with formatting
|
|
tags to indicate word relationships.</li>
|
|
<li>Enter a name for the thesaurus dictionary in the <strong>Thesaurus dictionary
|
|
name</strong> field. For example, <tt>mydict</tt>. </li>
|
|
<li>Enter the directory that is used to hold the thesaurus dictionary files
|
|
that are created in the Thesaurus dictionary directory field. Possible values
|
|
include <tt>/QIBM/UserData/HTTPSVR/search</tt> (the default setting), or any
|
|
valid directory path. </li>
|
|
<li>Click <span class="uicontrol">Apply</span>.</li>
|
|
</ol>
|
|
</div>
|
|
</div>
|
|
|
|
</body>
|
|
</html> |