321 lines
15 KiB
HTML
321 lines
15 KiB
HTML
|
<?xml version="1.0" encoding="UTF-8"?>
|
||
|
<!DOCTYPE html
|
||
|
PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
|
||
|
<html lang="en-us" xml:lang="en-us">
|
||
|
<head>
|
||
|
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
|
||
|
<meta name="security" content="public" />
|
||
|
<meta name="Robots" content="index,follow" />
|
||
|
<meta http-equiv="PICS-Label" content='(PICS-1.1 "http://www.icra.org/ratingsv02.html" l gen true r (cz 1 lz 1 nz 1 oz 1 vz 1) "http://www.rsac.org/ratingsv01.html" l gen true r (n 0 s 0 v 0 l 0) "http://www.classify.org/safesurf/" l gen true r (SS~~000 1))' />
|
||
|
<meta name="DC.Type" content="concept" />
|
||
|
<meta name="DC.Title" content="Conversion of character data" />
|
||
|
<meta name="abstract" content="The Character Data Representation Architecture (CDRA) system of tags ensures that you can convert character data in a predictable, repeatable way." />
|
||
|
<meta name="description" content="The Character Data Representation Architecture (CDRA) system of tags ensures that you can convert character data in a predictable, repeatable way." />
|
||
|
<meta name="DC.Relation" scheme="URI" content="rbagscharrepre.htm" />
|
||
|
<meta name="DC.Relation" scheme="URI" content="rbagsccsidgenrecom.htm" />
|
||
|
<meta name="copyright" content="(C) Copyright IBM Corporation 1998, 2006" />
|
||
|
<meta name="DC.Rights.Owner" content="(C) Copyright IBM Corporation 1998, 2006" />
|
||
|
<meta name="DC.Format" content="XHTML" />
|
||
|
<meta name="DC.Identifier" content="rbagschardataconvers" />
|
||
|
<meta name="DC.Language" content="en-us" />
|
||
|
<!-- All rights reserved. Licensed Materials Property of IBM -->
|
||
|
<!-- US Government Users Restricted Rights -->
|
||
|
<!-- Use, duplication or disclosure restricted by -->
|
||
|
<!-- GSA ADP Schedule Contract with IBM Corp. -->
|
||
|
<link rel="stylesheet" type="text/css" href="./ibmdita.css" />
|
||
|
<link rel="stylesheet" type="text/css" href="./ic.css" />
|
||
|
<title>Conversion of character data</title>
|
||
|
</head>
|
||
|
<body id="rbagschardataconvers"><a name="rbagschardataconvers"><!-- --></a>
|
||
|
<!-- Java sync-link --><script language="Javascript" src="../rzahg/synch.js" type="text/javascript"></script>
|
||
|
<h1 class="topictitle1">Conversion of character data</h1>
|
||
|
<div><p>The Character Data Representation Architecture (CDRA) system of
|
||
|
tags ensures that you can convert character data in a predictable, repeatable
|
||
|
way.</p>
|
||
|
<p>Conversion pertains to converting the code points assigned to one or more
|
||
|
characters in one code page to their corresponding code points in another
|
||
|
code page. The conversion might cause a single character to map to a sequence
|
||
|
of characters, or a sequence of characters to map to a single character. Conversion
|
||
|
should not be equated to translating from one language to another.</p>
|
||
|
<div class="section"><h4 class="sectiontitle">Conversion methods</h4><p>The following methods are used
|
||
|
for conversion:</p>
|
||
|
<ul><li><em>Round-trip conversion.</em> The integrity of all character data is maintained
|
||
|
from the source coded character set identifier (CCSID) to the target CCSID
|
||
|
and back to the source. <p>When performing a round-trip conversion, you might
|
||
|
see incorrect representation of the characters displayed in the target CCSID.
|
||
|
The integrity is preserved, however. When the characters are converted back
|
||
|
to the source CCSID, they regain their original hexadecimal values and representation.</p>
|
||
|
</li>
|
||
|
<li><em>Enforced subset match conversion (substitution).</em> Characters that
|
||
|
exist in both the source and target CCSID have their integrity maintained.
|
||
|
Characters in the source CCSID but not in the target CCSID are replaced. Replaced
|
||
|
values are also referred to as substitution characters. For EBCDIC encoding,
|
||
|
these appear on most display stations as a solid block. For ASCII encoding,
|
||
|
these substitution characters appear differently. <p>This substitution is
|
||
|
permanent when converting back to the source CCSID because it is not possible
|
||
|
to retrieve the original hexadecimal values.</p>
|
||
|
<p>For a list of CCSID conversions
|
||
|
that result in substitution characters, see the Default conversion that might
|
||
|
use substitution table.</p>
|
||
|
</li>
|
||
|
<li><em>Linguistic conversion.</em> Also known as best-fit conversion, a partial
|
||
|
mapping is done from the source code page to the target code page. The integrity
|
||
|
of characters that are in both the target CCSID and the source CCSID are preserved.
|
||
|
Characters that are not in the target CCSID are mapped to the most culturally
|
||
|
acceptable alternative for that character. <p>For example, the source CCSID
|
||
|
might support an A grave ( <img src="la140000.gif" alt="A grave character" />). The target CCSID may not support this character. During the conversion,
|
||
|
the most linguistically acceptable character (a Latin capital A) is substituted
|
||
|
for the A grave. After the conversion, characters that are not included in
|
||
|
the target CCSID are presented to the user as the most linguistically acceptable
|
||
|
substitution characters. This substitution is permanent. Any loss of character
|
||
|
integrity is permanent.</p>
|
||
|
<p>Through an application programming interface
|
||
|
(API), linguistic conversion is available from any supported single-byte CCSID
|
||
|
to any other supported single-byte CCSID.</p>
|
||
|
</li>
|
||
|
</ul>
|
||
|
</div>
|
||
|
<div class="section"><h4 class="sectiontitle">Default conversion that can use substitution</h4><p>The
|
||
|
default CCSID conversions in this topic use substitution because the character
|
||
|
sets within the CCSIDs are different.</p>
|
||
|
|
||
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" frame="border" border="1" rules="all"><thead align="left"><tr><th valign="top" id="d0e58">From CCSID</th>
|
||
|
<th valign="top" id="d0e60">To CCSID</th>
|
||
|
</tr>
|
||
|
</thead>
|
||
|
<tbody><tr><td valign="top" headers="d0e58 ">00037</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00833, 00836, 00838, 00930, 00933, 00835, 00939, 00948, 01027,
|
||
|
01043, 05026, 05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00256</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00833, 00836, 00838, 00930, 00933, 00935, 00939, 01027, 05026,
|
||
|
05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00273</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00833, 00836, 00838, 00930, 00933, 00935, 00939, 01027, 05026,
|
||
|
05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00277</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00833, 00836, 00838, 00930, 00933, 00935, 00939, 05026, 05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00278</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00833, 00836, 00838, 00930, 00933, 00935, 00939, 01027, 05026,
|
||
|
05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00280</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00833, 00836, 00838, 00930, 00933, 00935, 00939, 01027, 05026,
|
||
|
05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00284</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00833, 00836, 00838, 00930, 00933, 00935, 00939, 01027, 05026,
|
||
|
05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00285</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00833, 00836, 00838, 00930, 00933, 00935, 00939, 01027, 05026,
|
||
|
05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00290</td>
|
||
|
<td valign="top" headers="d0e60 ">00037, 00256, 00273, 00277, 00278, 00280, 00284, 00285, 00297, 00500,
|
||
|
00833, 00836, 00850 00871, 00897, 00933, 00935, 00937, 00942, 01041</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00297</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00833, 00836, 00838, 00930, 00933, 00935, 00939, 01027, 05026,
|
||
|
05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00300</td>
|
||
|
<td valign="top" headers="d0e60 ">00301</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00301</td>
|
||
|
<td valign="top" headers="d0e60 ">00300, 04396</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00420</td>
|
||
|
<td valign="top" headers="d0e60 ">04960</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00437</td>
|
||
|
<td valign="top" headers="d0e60 ">00870, 00871, 00880, 01025</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00500</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00367, 00833, 00836, 00838, 00930, 00933, 00935, 00939, 01010,
|
||
|
01011, 01012, 01013, 01014, 01015, 01016, 01017, 01018, 01019, 01027, 05026,
|
||
|
05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00833</td>
|
||
|
<td valign="top" headers="d0e60 ">00037, 00256, 00273, 00277, 00278, 00280, 00284, 00285, 00290, 00297,
|
||
|
00500, 00836, 00850, 00871, 00930, 00935, 00937, 00939, 00944, 01027, 01040,
|
||
|
05026, 05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00836</td>
|
||
|
<td valign="top" headers="d0e60 ">00037, 00256, 00273, 00277, 00278, 00280, 00284, 00285, 00290, 00297,
|
||
|
00500, 00833, 00871, 00903, 00930, 00933, 00937, 00939, 00946, 01027, 01042,
|
||
|
01115, 05026, 05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00837</td>
|
||
|
<td valign="top" headers="d0e60 ">00928</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00838</td>
|
||
|
<td valign="top" headers="d0e60 ">00037, 00256, 00273, 00277, 00278, 00280, 00284, 00285, 00297, 00500,
|
||
|
00871, 00937</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00850</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00833, 00870, 00880, 00930, 00933, 00939, 01025, 01027, 01041,
|
||
|
05026, 05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00870</td>
|
||
|
<td valign="top" headers="d0e60 ">00437, 00880, 00915, 01025, 04951</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00871</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00437, 00833, 00836, 00838, 00930, 00933, 00935, 00939, 01027,
|
||
|
05026, 05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00880</td>
|
||
|
<td valign="top" headers="d0e60 ">00437, 00850, 00870, 00912, 00915, 04948, 04951</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00897</td>
|
||
|
<td valign="top" headers="d0e60 ">01027</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00903</td>
|
||
|
<td valign="top" headers="d0e60 ">00836</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00912</td>
|
||
|
<td valign="top" headers="d0e60 ">00880, 01025</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00915</td>
|
||
|
<td valign="top" headers="d0e60 ">00870, 00880</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00930</td>
|
||
|
<td valign="top" headers="d0e60 ">00037, 00256, 00273, 00277, 00278, 00280, 00284, 00285, 00297, 00500,
|
||
|
00833, 00836, 00850, 00871, 00933, 00935, 00937, 00942, 01041</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00933</td>
|
||
|
<td valign="top" headers="d0e60 ">00037, 00256, 00273, 00277, 00278, 00280, 00284, 00285, 00290, 00297,
|
||
|
00500, 00836, 00850, 00871, 00930, 00934, 00935, 00937, 00939, 00944, 01027,
|
||
|
01040, 05026, 05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00934</td>
|
||
|
<td valign="top" headers="d0e60 ">00933</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00935</td>
|
||
|
<td valign="top" headers="d0e60 ">00037, 00256, 00277, 00278, 00280, 00284, 00285, 00290, 00297, 00500,
|
||
|
00833, 00871, 00930, 00933, 00936, 00937, 00939, 00946, 01027, 01042, 05026,
|
||
|
05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00936</td>
|
||
|
<td valign="top" headers="d0e60 ">00935</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00937</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00833, 00836, 00930, 00933, 00935, 00938, 00939, 00948, 01027,
|
||
|
01043, 05026, 05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00938</td>
|
||
|
<td valign="top" headers="d0e60 ">00937</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00939</td>
|
||
|
<td valign="top" headers="d0e60 ">00037, 00256, 00273, 00277, 00278, 00280, 00284, 00285, 00297, 00500,
|
||
|
00833, 00836, 00850, 00871, 00933, 00935, 00937, 00942, 01041</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00942</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00930, 00939, 01027, 05026, 05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00944</td>
|
||
|
<td valign="top" headers="d0e60 ">00833, 00933</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00946</td>
|
||
|
<td valign="top" headers="d0e60 ">00836, 00935</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">00948</td>
|
||
|
<td valign="top" headers="d0e60 ">00037, 00937</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01010</td>
|
||
|
<td valign="top" headers="d0e60 ">00500</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01011</td>
|
||
|
<td valign="top" headers="d0e60 ">00500</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01012</td>
|
||
|
<td valign="top" headers="d0e60 ">00500</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01013</td>
|
||
|
<td valign="top" headers="d0e60 ">00500</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01014</td>
|
||
|
<td valign="top" headers="d0e60 ">00500</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01015</td>
|
||
|
<td valign="top" headers="d0e60 ">00500</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01016</td>
|
||
|
<td valign="top" headers="d0e60 ">00500</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01017</td>
|
||
|
<td valign="top" headers="d0e60 ">00500</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01018</td>
|
||
|
<td valign="top" headers="d0e60 ">00500</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01019</td>
|
||
|
<td valign="top" headers="d0e60 ">00500</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01025</td>
|
||
|
<td valign="top" headers="d0e60 ">00437, 00850, 00870, 00912, 04948</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01027</td>
|
||
|
<td valign="top" headers="d0e60 ">00037, 00256, 00273, 00277, 00278, 00280, 00284, 00285, 00297, 00500,
|
||
|
00833, 00836, 00850, 00871, 00933, 00935, 00937, 00942, 01041</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01040</td>
|
||
|
<td valign="top" headers="d0e60 ">00833, 00933</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01041</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00850, 00930, 00939, 01027, 05026, 05035</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01042</td>
|
||
|
<td valign="top" headers="d0e60 ">00836, 00935</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01043</td>
|
||
|
<td valign="top" headers="d0e60 ">00037, 00937</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01114</td>
|
||
|
<td valign="top" headers="d0e60 ">28709</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">01115</td>
|
||
|
<td valign="top" headers="d0e60 ">00836</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">04396</td>
|
||
|
<td valign="top" headers="d0e60 ">00301</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">04948</td>
|
||
|
<td valign="top" headers="d0e60 ">00880, 01025</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">04951</td>
|
||
|
<td valign="top" headers="d0e60 ">00870, 00880</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">04960</td>
|
||
|
<td valign="top" headers="d0e60 ">00420</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">05026</td>
|
||
|
<td valign="top" headers="d0e60 ">00037, 00256, 00273, 00277, 00278, 00280, 00284, 00285, 00297, 00500,
|
||
|
00833, 00836, 00850, 00871, 00933, 00935, 00937, 00942, 01041</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">05035</td>
|
||
|
<td valign="top" headers="d0e60 ">00037, 00256, 00273, 00277, 00278, 00280, 00284, 00285, 00297, 00500,
|
||
|
00833, 00836, 00850, 00871, 00933, 00935, 00937, 00942, 01041</td>
|
||
|
</tr>
|
||
|
<tr><td valign="top" headers="d0e58 ">28709</td>
|
||
|
<td valign="top" headers="d0e60 ">00290, 00833, 00836, 00838, 00930, 00933, 00935, 00939, 00948, 01027,
|
||
|
01043, 05026, 05035</td>
|
||
|
</tr>
|
||
|
</tbody>
|
||
|
</table>
|
||
|
</div>
|
||
|
</div>
|
||
|
</div>
|
||
|
<div>
|
||
|
<div class="familylinks">
|
||
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="rbagscharrepre.htm" title="Character representation in the server is controlled by the following elements of the Character Data Representation Architecture (CDRA).">Character representation</a></div>
|
||
|
</div>
|
||
|
<div class="relconcepts"><strong>Related concepts</strong><br />
|
||
|
<div><a href="rbagsccsidgenrecom.htm" title="These recommendations are useful when writing globalized applications.">Recommendations and guidelines for using CCSIDs</a></div>
|
||
|
</div>
|
||
|
</div>
|
||
|
</body>
|
||
|
</html>
|