
Sebastian Redl wrote:
A little side question: is there any Unicode character support in boost? No. Regex, when built with Unicode support, requires ICU for that. Boost doesn't have its own Unicode stuff. (There's something in the vault,
Péter Szilágyi wrote: though.)
STL includes the wstring but that is 4 byte unicode (and quite hard to mingle with simple strings). Actually, it is whatever the compiler decides it should be. On Linux systems with a default GCC, yes, that's UTF-32, but under Windows it's typically UCS-2 or UTF-16 (with or without surrogate support, that is).
More specifically, 'wchar_t' and derived types have nothing to do with Unicode. The two are orthogonal concepts. As per the spec, wchar_t has to be large enough to hold the extended character set specified by the supported locales. There is no mention in the spec that this character set has to be Unicode. Regards, Stefan -- ...ich hab' noch einen Koffer in Berlin...