Author zamsalak
Recipients ezio.melotti, vstinner, zamsalak
Date 2018-09-18.14:02:16
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1537279336.26.0.956365154283.issue34723@psf.upfronthosting.co.za>
In-reply-to
Content
Hey there,

I believe I've come across a bug. It occurs when you try to lower() the Turkish uppercase letter "İ". Gonna explain it with example code since it's easier:

>>> len("Ş")
1
>>> len("Ş".lower())
1
>>> len("Ğ")
1
>>> len("Ğ".lower())
1
>>> len("Ö")
1
>>> len("Ö".lower())
1
>>> len("Ç")
1
>>> len("Ç".lower())
1
>>> len("İ")
1
>>> len("İ".lower())
2

When you lower() the Turkish uppercase letter “İ”, it returns a 2 chars long string with the first character being “i”, and the second being chr(775).

Should it not simply return “i”?
History
Date User Action Args
2018-09-18 14:02:16zamsalaksetrecipients: + zamsalak, vstinner, ezio.melotti
2018-09-18 14:02:16zamsalaksetmessageid: <1537279336.26.0.956365154283.issue34723@psf.upfronthosting.co.za>
2018-09-18 14:02:16zamsalaklinkissue34723 messages
2018-09-18 14:02:16zamsalakcreate