Re: [boost] Re: Any interest in adding unicode support to boost?

20 Oct 2004


      On Wed, 20 Oct 2004 12:20:22 -0400, Miro Jurisic <macdev@meeroh.org> wrote:
...
In article <e094f9eb04102006096b92c870@mail.gmail.com>,
 Rogier van Dalen <rogiervd@gmail.com> wrote:
...
My plan was to decompose all characters in unicode::string. This makes
manipulation of diacritics easier. Correct me if I'm wrong, but your
example of finding "ü" in a string would come down to finding the
codepoint sequence "U+0075 U+0308" and checking whether it is not
followed by another combining character, pretty trivial still.
You have to not only decompose them but put them in a canonical decomposed order
in order for that to work.
Yes, of course. I left it out thinking it was trivial (which it may
be; you'd need a small part of the Unicode Database though).

Regards,
Rogier