Message 81580 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	loewis
Recipients	PeterL, loewis
Date	2009-02-10.18:59:21
SpamBayes Score	0.0006705987
Marked as misclassified	No
Message-id	<1234292363.63.0.869504877449.issue5200@psf.upfronthosting.co.za>
In-reply-to

Content
It is not true that normalize produces "aaoAAO". Instead, it produces u'a\u030aa\u0308o\u0308A\u030aA\u0308O\u0308' This is the correct result, according to the Unicode specification. It would be incorrect to normalize them unchanged under the Unicode Normal Form D (for decomposed); the decomposed character for 'LATIN SMALL LETTER A WITH RING ABOVE' (for example) is 'LATIN SMALL LETTER A' + 'COMBINING RING ABOVE'. The wikipedia article is irrelevant; refer to the Unicode specification for a normative reference. Closing as invalid.

It is not true that normalize produces "aaoAAO". Instead, it produces

u'a\u030aa\u0308o\u0308A\u030aA\u0308O\u0308'

This is the correct result, according to the Unicode specification. It
would be incorrect to normalize them unchanged under the Unicode Normal
Form D (for decomposed); the decomposed character for 'LATIN SMALL
LETTER A WITH RING ABOVE' (for example) is 'LATIN SMALL LETTER A' +
'COMBINING RING ABOVE'.

The wikipedia article is irrelevant; refer to the Unicode specification
for a normative reference.

Closing as invalid.

History
Date	User	Action	Args
2009-02-10 18:59:23	loewis	set	recipients: + loewis, PeterL
2009-02-10 18:59:23	loewis	set	messageid: <1234292363.63.0.869504877449.issue5200@psf.upfronthosting.co.za>
2009-02-10 18:59:22	loewis	link	issue5200 messages
2009-02-10 18:59:21	loewis	create