On Thu, Jun 24, 2010 at 5:17 AM, Rodrigo Madera <rodrigo.madera@gmail.com> wrote:
There are John Maddock's iterator adapters, those of the Boost.Unicode library under development that are similar to them, and there is also Boost.Locale, another library under development that is a frontend to ICU, that provides function to do conversions (but it works with memory buffers).
I see that Maddock's adapters does the job, but does it provide the reliability of ICU? For example, when it comes to round-trip conversions?
Each code point only has a single valid representation in any of the UTF encodings, so anything but perfect round-trip transcoding would be a bug. Overlong encodings are invalid, and normalization forms are a separate issue outside of UTF transcoding. -- Cory Nelson http://int64.org