
Pavol Droba wrote:
Florin Trofin wrote:
Hello,
I was looking at the string algo library hoping that I can use it with my own string class and I have some questions:
1. Is the library designed to work with variable length encoding characters (like UTF8, UTF16). The answer seems to be no, but wanted to find for sure. Are there any plans to make these algorithms compatible with this type of sequences?
Generaly no. The preamble of the library specifies that "a string" is an arbitrary sequence of characters. If you store variable length character string into a char* array, it will definitely not work. Since your container will store byte-codes, not characters. If you design a utf8 encoded string class with iterators that will iterate over real characters then there is a good chance that the string library will be functional.
Though I don't know the developing status, <boost/regex/pending/unicode_iterator.hpp> seems to nicely work with Boost.StringAlgorithm. I tend to prefer the iterators to locale. Regards, -- Shunsuke Sogame