
Hi. I am in the process of planning a library for handling unicode strings in C++ ... .... functionality which does not, ie exceptions and file names and locale message files, and assume that only narrow characters exist in its usage. I am still amazed that programmers from countries which would normally use wide characters as Unicode encodings, such as the Japanese, have not made more of an issue with this, but perhaps they are so used to their far more difficult DBCS roots that pursuing wide characters everywhere, much less a real Unicode encoding, is a minor issue with them.
For Japanese strings I use either Shift-JIS or UTF-8 encodings, both of which work fine with std::string for most tasks. Matching sub-strings could give a false match however. When talking about Unicode does that include UTF-8? Darren