
On Thu, 17 Mar 2005 17:52:25 +0100, Erik Wien <wien@start.no> wrote:
What exactly do mean by the term "character"? Abstract characters?
I really need to remember the correct terminology - what I mean is the thing "a user thinks of as a character", a "grapheme cluster", of which the Unicode standard says: "[T]here is a core concept of "characters that should be kept together" that can be defined for the Unicode Standard in a language-independent way. This core concept is known as a grapheme cluster, and it consists of any combining character sequence that contains only nonspacing combining marks, or any sequence of characters that constitutes a Hangul syllable (possibly followed by one or more nonspacing marks)." I believe this is what a Unicode library should use as its basic unit. Sorry for any confusion caused, Rogier