
14 Jan
2011
14 Jan
'11
6:36 p.m.
2011/1/14 John B. Turpish <jbturp@gmail.com>:
I'm not an expert, so take this with a grain of salt. But couldn't it just as easily be said that UTF-8 is a waste of CPU? There are a number of operations that are constant time if you can assume a fixed size for a character that I would think would have to be linear for UTF-8, for example accessing the Nth character.
John, As I understand the choice is between UTF-8 and UTF-16, since UTF-32 is a waste of memory. Given that, there is never fixed size for a character or linear times - both UTF-8 and UTF-16 are variable-size encodings of UTF-32. Alexander Churanov