This table shows the relation between possible file.encoding values and the closest matching iSeries™ coded character set identifier (CCSID).
For more information regarding file.encoding support, see Supported encodings by Sun Microsystems, Inc.
file.encoding | CCSID | Description |
---|---|---|
ASCII | 367 | American Standard Code for Information Interchange |
Big5 | 950 | 8-bit ASCII T-Chinese BIG-5 |
Big5_HKSCS | 950 | Big5_HKSCS |
Big5_Solaris | 950 | Big5 with seven additional Hanzi ideograph character mappings for the Solaris zh_TW.BIG5 locale |
CNS11643 | 964 | Chinese National Character Set for traditional Chinese |
Cp037 | 037 | IBM® EBCDIC US, Canada, Netherlands |
Cp273 | 273 | IBM EBCDIC Germany, Austria |
Cp277 | 277 | IBM EBCDIC Denmark, Norway |
Cp278 | 278 | IBM EBCDIC Finland, Sweden |
Cp280 | 280 | IBM EBCDIC Italy |
Cp284 | 284 | IBM EBCDIC Spanish, Latin America |
Cp285 | 285 | IBM EBCDIC UK |
Cp297 | 297 | IBM EBCDIC France |
Cp420 | 420 | IBM EBCDIC Arabic |
Cp424 | 424 | IBM EBCDIC Hebrew |
Cp437 | 437 | 8-bit ASCII US PC |
Cp500 | 500 | IBM EBCDIC International |
Cp737 | 737 | 8-bit ASCII Greek MS-DOS |
Cp775 | 775 | 8-bit ASCII Baltic MS-DOS |
Cp838 | 838 | IBM EBCDIC Thailand |
Cp850 | 850 | 8-bit ASCII Latin-1 Multinational |
Cp852 | 852 | 8-bit ASCII Latin-2 |
Cp855 | 855 | 8-bit ASCII Cyrillic |
Cp856 | 0 | 8-bit ASCII Hebrew |
Cp857 | 857 | 8-bit ASCII Latin-5 |
Cp860 | 860 | 8-bit ASCII Portugal |
Cp861 | 861 | 8-bit ASCII Iceland |
Cp862 | 862 | 8-bit ASCII Hebrew |
Cp863 | 863 | 8-bit ASCII Canada |
Cp864 | 864 | 8-bit ASCII Arabic |
Cp865 | 865 | 8-bit ASCII Denmark, Norway |
Cp866 | 866 | 8-bit ASCII Cyrillic |
Cp868 | 868 | 8-bit ASCII Urdu |
Cp869 | 869 | 8-bit ASCII Greek |
Cp870 | 870 | IBM EBCDIC Latin-2 |
Cp871 | 871 | IBM EBCDIC Iceland |
Cp874 | 874 | 8-bit ASCII Thailand |
Cp875 | 875 | IBM EBCDIC Greek |
Cp918 | 918 | IBM EBCDIC Urdu |
Cp921 | 921 | 8-bit ASCII Baltic |
Cp922 | 922 | 8-bit ASCII Estonia |
Cp930 | 930 | IBM EBCDIC Japanese Extended Katakana |
Cp933 | 933 | IBM EBCDIC Korean |
Cp935 | 935 | IBM EBCDIC Simplified Chinese |
Cp937 | 937 | IBM EBCDIC Traditional Chinese |
Cp939 | 939 | IBM EBCDIC Japanese Extended Latin |
Cp942 | 942 | 8-bit ASCII Japanese |
Cp942C | 942 | Variant of Cp942 |
Cp943 | 943 | Japanese PC data mixed for open env |
Cp943C | 943 | Japanese PC data mixed for open env |
Cp948 | 948 | 8-bit ASCII IBM Traditional Chinese |
Cp949 | 944 | 8-bit ASCII Korean KSC5601 |
Cp949C | 949 | Variant of Cp949 |
Cp950 | 950 | 8-bit ASCII T-Chinese BIG-5 |
Cp964 | 964 | EUC Traditional Chinese |
Cp970 | 970 | EUC Korean |
Cp1006 | 1006 | ISO 8-bit Urdu |
Cp1025 | 1025 | IBM EBCDIC Cyrillic |
Cp1026 | 1026 | IBM EBCDIC Turkey |
Cp1046 | 1046 | 8-bit ASCII Arabic |
Cp1097 | 1097 | IBM EBCDIC Farsi |
Cp1098 | 1098 | 8-bit ASCII Farsi |
Cp1112 | 1112 | IBM EBCDIC Baltic |
Cp1122 | 1122 | IBM EBCDIC Estonia |
Cp1123 | 1123 | IBM EBCDIC Ukraine |
Cp1124 | 0 | ISO 8-bit Ukraine |
Cp1140 | 1140 | Variant of Cp037 with Euro character |
Cp1141 | 1141 | Variant of Cp273 with Euro character |
Cp1142 | 1142 | Variant of Cp277 with Euro character |
Cp1143 | 1143 | Variant of Cp278 with Euro character |
Cp1144 | 1144 | Variant of Cp280 with Euro character |
Cp1145 | 1145 | Variant of Cp284 with Euro character |
Cp1146 | 1146 | Variant of Cp285 with Euro character |
Cp1147 | 1147 | Variant of Cp297 with Euro character |
Cp1148 | 1148 | Variant of Cp500 with Euro character |
Cp1149 | 1149 | Variant of Cp871 with Euro character |
Cp1250 | 1250 | MS-Win Latin-2 |
Cp1251 | 1251 | MS-Win Cyrillic |
Cp1252 | 1252 | MS-Win Latin-1 |
Cp1253 | 1253 | MS-Win Greek |
Cp1254 | 1254 | MS-Win Turkish |
Cp1255 | 1255 | MS-Win Hebrew |
Cp1256 | 1256 | MS-Win Arabic |
Cp1257 | 1257 | MS-Win Baltic |
Cp1258 | 1251 | MS-Win Russian |
Cp1381 | 1381 | 8-bit ASCII S-Chinese GB |
Cp1383 | 1383 | EUC Simplified Chinese |
Cp33722 | 33722 | EUC Japanese |
EUC_CN | 1383 | EUC for Simplified Chinese |
EUC_JP | 5050 | EUC for Japanese |
EUC_JP_LINUX | 0 | JISX 0201, 0208 , EUC encoding Japanese |
EUC_KR | 970 | EUC for Korean |
EUC_TW | 964 | EUC for Traditional Chinese |
GB2312 | 1381 | 8-bit ASCII S-Chinese GB |
GB18030 | 1392 | Simplified Chinese, PRC standard |
GBK | 1386 | New simplified Chinese 8-bit ASCII 9 |
ISCII91 | 806 | ISCII91 encoding of Indic scripts |
ISO2022CN | 965 | ISO 2022 CN, Chinese (conversion to Unicode only) |
ISO2022_CN_CNS | 965 | CNS11643 in ISO 2022 CN form, Traditional Chinese (conversion from Unicode only) |
ISO2022_CN_GB | 1383 | GB2312 in ISO 2022 CN form, Simplified Chinese (conversion from Unicode only) |
ISO2022CN_CNS | 965 | 7-bit ASCII for Traditional Chinese |
ISO2022CN_GB | 1383 | 7-bit ASCII for Simplified Chinese |
ISO2022JP | 5054 | 7-bit ASCII for Japanese |
ISO2022KR | 25546 | 7-bit ASCII for Korean |
ISO8859_1 | 819 | ISO 8859-1 Latin Alphabet No. 1 |
ISO8859_2 | 912 | ISO 8859-2 ISO Latin-2 |
ISO8859_3 | 0 | ISO 8859-3 ISO Latin-3 |
ISO8859_4 | 914 | ISO 8859-4 ISO Latin-4 |
ISO8859_5 | 915 | ISO 8859-5 ISO Latin-5 |
ISO8859_6 | 1089 | ISO 8859-6 ISO Latin-6 (Arabic) |
ISO8859_7 | 813 | ISO 8859-7 ISO Latin-7 (Greek/Latin) |
ISO8859_8 | 916 | ISO 8859-8 ISO Latin-8 (Hebrew) |
ISO8859_9 | 920 | ISO 8859-9 ISO Latin-9 (ECMA-128, Turkey) |
ISO8859_13 | 0 | Latin Alphabet No. 7 |
ISO8859_15 | 923 | ISO8859_15 |
ISO8859_15_FDIS | 923 | ISO 8859-15, Latin alphabet No. 9 |
ISO-8859-15 | 923 | ISO 8859-15, Latin Alphabet No. 9 |
JIS0201 | 897 | Japanese industry standard X0201 |
JIS0208 | 5052 | Japanese industry standard X0208 |
JIS0212 | 0 | Japanese industry standard X0212 |
JISAutoDetect | 0 | Detects and converts from Shift-JIS, EUC-JP, ISO 2022 JP (conversion to Unicode only) |
Johab | 0 | Korean composition Hangul encoding (full) |
K018_R | 878 | Cyrillic |
KSC5601 | 949 | 8-bit ASCII Korean |
MacArabic | 1256 | Macintosh Arabic |
MacCentralEurope | 1282 | Macintosh Latin-2 |
MacCroatian | 1284 | Macintosh Croatian |
MacCyrillic | 1283 | Macintosh Cyrillic |
MacDingbat | 0 | Macintosh Dingbat |
MacGreek | 1280 | Macintosh Greek |
MacHebrew | 1255 | Macintosh Hebrew |
MacIceland | 1286 | Macintosh Iceland |
MacRoman | 0 | Macintosh Roman |
MacRomania | 1285 | Macintosh Romania |
MacSymbol | 0 | Macintosh Symbol |
MacThai | 0 | Macintosh Thai |
MacTurkish | 1281 | Macintosh Turkish |
MacUkraine | 1283 | Macintosh Ukraine |
MS874 | 874 | MS-Win Thailand |
MS932 | 943 | Windows® Japanese |
MS936 | 936 | Windows Simplified Chinese |
MS949 | 949 | Windows Korean |
MS950 | 950 | Windows Traditional Chinese |
MS950_HKSCS | NA | Windows Traditional Chinese with Hong Kong S.A.R. of China extensions |
SJIS | 932 | 8-bit ASCII Japanese |
TIS620 | 874 | Thai industry standard 620 |
US-ASCII | 367 | American Standard Code for Information Interchange |
UTF8 | 1208 | UTF-8 (IBM CCSID 1208, which is not yet available on the iSeries server) |
UTF-16 | 1200 | Sixteen-bit UCS Transformation Format, byte order identified by an optional byte-order mark |
UTF-16BE | 1200 | Sixteen-bit Unicode Transformation Format, big-endian byte order |
UTF-16LE | 1200 | Sixteen-bit Unicode Transformation Format, little-endian byte order |
UTF-8 | 1208 | Eight-bit UCS Transformation Format |
Unicode | 13488 | UNICODE, UCS-2 |
UnicodeBig | 13488 | Same as Unicode |
UnicodeBigUnmarked | Unicode with no byte-order mark | |
UnicodeLittle | Unicode with little-endian byte order | |
UnicodeLittleUnmarked | UnicodeLittle with no byte-order mark |
For default values, see Default file.encoding values.