RFC: interest in Unicode codecs?

Dear Cory, Your UTF standard conversion class looks very interesting. It does however, not give access to the unicode data for grapheme or glyph work. I posted a full unicode character support library at http://www.boostpro.com/vault/ unicode_lib.zip but Rogier and I ran out of time before completing the string class to go with it. If you feel up to it and want to develop a full Unicode support library in addition to your codec, feel free ! Yours, Graham Barnett
Is there interest in having a Unicode codec library submitted to Boost?
Here is what I have (only tested with GCC 4.3 on Debian, and VC++ 2008): http://svn.int64.org/viewvc/int64/snips/unicode.hpp
Right now it is pretty simple to use:
transcode<utf8, utf16le>(forwarditerator, forwarditeratorend, outputiterator, traits [, maximum]); transcode<wchar_encoding, utf32be>(inputrange, outputrange, traits [,maximum]);
There is also a codecvt facet which supports any-to-any.
Supports UTF-8, UTF-16, and UTF-32, in little or big endian. Has a special wchar_encoding that maps to UTF-16 or UTF-32 depending on your platform. A traits class controls error handling.
-- Cory Nelson
participants (1)
-
Graham