
In article <87r7ur4kmy.fsf@jbms.ath.cx>, Jeremy Maitin-Shepard <jbms@attbi.com> wrote:
Miro Jurisic <macdev@meeroh.org> writes:
In article <87isg3skr6.fsf@jbms.ath.cx>, Jeremy Maitin-Shepard <jbms@attbi.com> wrote:
[snip]
- For the purpose of string construction, the Unicode specification explicitly states that any sequence of code points is well formed, and so this provides the smallest unit by which guaranteed-well-formed strings can be formed.
Can you refer me to a specific point in the spec where this is stated?
In Unicode 4.0.1, Chapter 3.9, D30a
Right, ok, everything you said so far makes sense; I agree that operating on encoded characters (as sequences of code points) is useful in a number of contexts, and (as I already pointed out), operating on abstract characters is useful in other contexts. meeroh -- If this message helped you, consider buying an item from my wish list: <http://web.meeroh.org/wishlist>