Thanks for the fast response.

I understand that python follows the unicode specification. I think the unicode standard is not correct in this case for the Swedish letters. I have asked unicode.org for an explanation.

Should not the Danish letter "Ø" be normalized as "O"? I get "Ø" for all NFC/NFD/NFKC/NFKD normalizations?

Regards,

Peter Landgren

> Martin v. Löwis <martin@v.loewis.de> added the comment:

>

> It is not true that normalize produces "aaoAAO". Instead, it produces

>

> u'a\u030aa\u0308o\u0308A\u030aA\u0308O\u0308'

>

> This is the correct result, according to the Unicode specification. It

> would be incorrect to normalize them unchanged under the Unicode Normal

> Form D (for decomposed); the decomposed character for 'LATIN SMALL

> LETTER A WITH RING ABOVE' (for example) is 'LATIN SMALL LETTER A' +

> 'COMBINING RING ABOVE'.

>

> The wikipedia article is irrelevant; refer to the Unicode specification

> for a normative reference.

>

> Closing as invalid.

>

> ----------

> nosy: +loewis

> resolution: -> invalid

> status: open -> closed

>

> _______________________________________

> Python tracker <report@bugs.python.org>

> <http://bugs.python.org/issue5200>

> _______________________________________

--

Peter Landgren

Talken Hagen

671 94 BRUNSKOG

0570-530 21

070-635 4719

peter.talken@telia.com

Skype: pgl4820.2