i18ngurus.com - Open internationalization resources directory
directorydirectory   add a linkadd a link   searchsearch   mapmap   recentrecent   syndicatesyndicate   encyclopedia <font color=Red><b>Beta</b></font>encyclopedia Beta   feedbackfeedback   faqfaq  
 
   
border
This enclycopedia is a work in progress. Please do not hesitate to email me if you disagree with this article or found an any error in this page.
border
border border border
 
back to index


alias

It seems that no 2 softwares use the same names to designate a given character set. As an illustration here are the preferred names for the 2 common character sets to be used in Java, in the content type header of an HTML page and to setup your Oracle 9i database:

Common nameJava preferred nameIANA preferred nameOracle 9i preferred name
UTF-8UTF8UTF-8AL32UTF8
Latin 1ISO8859_1ISO_8859-1:1987WE8ISO8859P1


To resolve this nightmare software often recognize more than one name for each character set referred to as alias. For example Java will recognize UTF-8 as meaning UTF8 and ISO-8859-1 as meaning ISO8859_1.

On a side note it is interesting to note that Oracle does recognize UTF8 as a valid character set different from AL32UTF8, yet only this latest follows the Unicode specifications.


Related directory categories:
Fonts
Testing > Language samples > Thai
Languages > Thai


Article submitted by: Thierry Sourbier
Last modified: 2008-05-03


Rate this article:
min. max.


(include your email address if you would like a response)