
24 Nov
2005
24 Nov
'05
8:43 p.m.
Martin Wille wrote:
Hi,
http://boost.org/libs/serialization/doc/contents.html shows three strange characters at the top of the page ().
Apparently, those characters have the values EF BB BF in hexadecimal notation. Interpreted as UTF-8 sequence, they encode the Unicode codepoint FFFE. This codepoint is used as a marker for the endianness in encodings other than UTF-8. I doubt such a marker would be legal in or would make sense for an UTF-8 encoded document.
I believe that the BOM (byte order mark) is legal in all Unicode encodings, including UTF-8.