| |
| |
|
 |
This enclycopedia is a work in progress. Please do not hesitate to email me if you disagree with this article or found an any error in this page. |
 |
 |
 |
 |
|
|
alias
It seems that no 2 softwares use the same names to designate a given character set. As an illustration here are the preferred names for the 2 common character sets to be used in Java, in the content type header of an HTML page and to setup your Oracle 9i database:
| Common name | Java preferred name | IANA preferred name | Oracle 9i preferred name | | UTF-8 | UTF8 | UTF-8 | AL32UTF8 | | Latin 1 | ISO8859_1 | ISO_8859-1:1987 | WE8ISO8859P1 |
To resolve this nightmare software often recognize more than one name for each character set referred to as alias. For example Java will recognize UTF-8 as meaning UTF8 and ISO-8859-1 as meaning ISO8859_1.
On a side note it is interesting to note that Oracle does recognize UTF8 as a valid character set different from AL32UTF8, yet only this latest follows the Unicode specifications. |
Related directory categories: Fonts Testing > Language samples > Thai Languages > Thai
Article submitted by: Thierry Sourbier
Last modified: 2008-05-03
Rate this article:
|