
14 Jan
2011
14 Jan
'11
5:13 p.m.
JensFinkhäuser wrote:
Please excuse me if I'm stating the obvious, but I feel I should mention that binary sorting is not collation.
Yes, you're right. Sorting (lexicographically) UTF-8 as sequences of 8-bit unsigned integers gives the same result as sorting their UCS-32 equivalents as sequences of 32 bit unsigned integers.