
14 Jan
2011
14 Jan
'11
11:33 p.m.
On Fri, Jan 14, 2011 at 5:52 PM, Robert Kawulak <robert.kawulak@gmail.com> wrote:
IIUC you can't assume a fixed size for a character even with UTF-32. In UTF-32 only _codepoints_ have fixed size, yet one character may be composed of several codepoints, e.g. a latin letter followed by a diacritical mark, making up one character (http://en.wikipedia.org/wiki/Combining_character).
Best regards, Robert
I stand corrected. This sort of the thing is the reason I start with disclaimers like, "I'm not an expert, so take this with a grain of salt." Anyhow, thanks for the info.