Thanks for the fast response.
I understand that python follows the unicode specification. I think the unicode standard is not correct in this case for the Swedish letters. I have asked unicode.org for an explanation.
Should not the Danish letter "Ø" be normalized as "O"? I get "Ø" for all NFC/NFD/NFKC/NFKD normalizations?
Regards,
Peter Landgren
> Martin v. Löwis <martin@v.loewis.de> added the comment:
>
> It is not true that normalize produces "aaoAAO". Instead, it produces
>
> u'a\u030aa\u0308o\u0308A\u030aA\u0308O\u0308'
>
> This is the correct result, according to the Unicode specification. It
> would be incorrect to normalize them unchanged under the Unicode Normal
> Form D (for decomposed); the decomposed character for 'LATIN SMALL
> LETTER A WITH RING ABOVE' (for example) is 'LATIN SMALL LETTER A' +
> 'COMBINING RING ABOVE'.
>
> The wikipedia article is irrelevant; refer to the Unicode specification
> for a normative reference.
>
> Closing as invalid.
>
> ----------
> nosy: +loewis
> resolution: -> invalid
> status: open -> closed
>
> _______________________________________
> Python tracker <report@bugs.python.org>
> <http://bugs.python.org/issue5200>
> _______________________________________
--
Peter Landgren
Talken Hagen
671 94 BRUNSKOG
0570-530 21
070-635 4719
peter.talken@telia.com
Skype: pgl4820.2