i18ngurus.com - Open internationalization resources directory
directorydirectory   add a linkadd a link   searchsearch   mapmap   recentrecent   syndicatesyndicate   encyclopedia <font color=Red><b>Beta</b></font>encyclopedia Beta   feedbackfeedback   faqfaq  
 
   
border
This enclycopedia is a work in progress. Please do not hesitate to email me if you disagree with this article or found an any error in this page.
border
border border border
 
back to index


byte order mark

Also known as BOM.

Name given to the Unicode character U+FEFF when used at the beginning of a Unicode byte stream. This invisible character generally know as ZERO WIDTH NO-BREAK SPACE (ZWNBSP) serves to identify unambiguously the Unicode transformation form used (and especially the byte order) for the stream. Indeed U+FFFE is a noncharacter so there is no risk of misinterpretation.

The following represents the byte signature of the character U+FEFF with the various Unicode Transformation Forms:

BytesEncoding
00 00 FE FFUTF-32, big-endian
FF FE 00 00UTF-32, little-endian
FE FFUTF-16, big-endian
FF FEUTF-16, little-endian
EF BB BFUTF-8


Historically the ZWNBSP was also used to indicate non-breaking but this use is now deprecated and replaced by the character U+2060 for that purpose.


Article submitted by: Thierry Sourbier
Last modified: 2008-05-03


Rate this article:
min. max.


(include your email address if you would like a response)