Unicode considerations for database files

Unicode is a universal encoding scheme for written characters and text that enables the exchange of data internationally. Follow this topic to learn about how to specify DDS position 30 through 37 and position 45 through 80 for describing database files. Positions not mentioned have no special considerations for Unicode.

A Unicode field can contain all types of characters used on an IBM® iSeries™ server, including double-byte character set (DBCS) characters. Unicode data is composed of code units, which represent the minimal byte combination that can represent a unit of text.

There are three transformation formats (encoding forms) of Unicode that are supported with physical and logical file DDS:

Note: In this topic, references to UTF-16 imply UCS-2 as well.