This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author loewis
Recipients PeterL, loewis
Date 2009-02-10.18:59:21
SpamBayes Score 0.0006705987
Marked as misclassified No
Message-id <1234292363.63.0.869504877449.issue5200@psf.upfronthosting.co.za>
In-reply-to
Content
It is not true that normalize produces "aaoAAO". Instead, it produces

u'a\u030aa\u0308o\u0308A\u030aA\u0308O\u0308'

This is the correct result, according to the Unicode specification. It
would be incorrect to normalize them unchanged under the Unicode Normal
Form D (for decomposed); the decomposed character for 'LATIN SMALL
LETTER A WITH RING ABOVE' (for example) is 'LATIN SMALL LETTER A' +
'COMBINING RING ABOVE'.

The wikipedia article is irrelevant; refer to the Unicode specification
for a normative reference.

Closing as invalid.
History
Date User Action Args
2009-02-10 18:59:23loewissetrecipients: + loewis, PeterL
2009-02-10 18:59:23loewissetmessageid: <1234292363.63.0.869504877449.issue5200@psf.upfronthosting.co.za>
2009-02-10 18:59:22loewislinkissue5200 messages
2009-02-10 18:59:21loewiscreate