
19 Jun
2007
19 Jun
'07
5:47 p.m.
Jeremy Maitin-Shepard wrote:
It occurs to me that perhaps it is not unreasonable after all to restrict the library to supporting Unicode encodings for in-memory character representation.
I personally believe Unicode (not only the character set, but also its collations and algorithms) is the only viable way to represent characters, and thus should be the way strings work with. (get out evil locales and other stuff!) Of course, various encodings can still be used for serialization. Unfortunately, C++ is quite far from having good Unicode tools (not that other programming languages are really better -- Unicode is simply quite complicated, because human languages just are) ICU has most of the stuff, but not with the right interfaces.