Author pitrou
Recipients benjamin.peterson, ezio.melotti, firatozgul, lemburg, pitrou, r.david.murray, vstinner
Date 2013-02-20.12:28:21
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1361363302.21.0.0557867636893.issue17252@psf.upfronthosting.co.za>
In-reply-to
Content
Yes, I think 3.3 is correct here. I think it was Benjamin who fixed/improved the behaviour of casing methods. Compare 3.3:

>>> "ß".upper()
'SS'

with 3.2:

>>> "ß".upper()
'ß'

Also, 3.2 loses information:

>>> "KİTAP".lower().upper()
'KITAP'
>>> ascii("KİTAP".lower().upper())
"'KITAP'"

while 3.3 retains it:

>>> "KİTAP".lower().upper()
'KİTAP'
>>> ascii("KİTAP".lower().upper())
"'KI\\u0307TAP'"

You can get the combined form again with unicodedata.normalize:

>>> unicodedata.normalize("NFC", "KİTAP".lower().upper())
'KİTAP'
History
Date User Action Args
2013-02-20 12:28:22pitrousetrecipients: + pitrou, lemburg, vstinner, benjamin.peterson, ezio.melotti, r.david.murray, firatozgul
2013-02-20 12:28:22pitrousetmessageid: <1361363302.21.0.0557867636893.issue17252@psf.upfronthosting.co.za>
2013-02-20 12:28:22pitroulinkissue17252 messages
2013-02-20 12:28:21pitroucreate