Author kingofsevens
Recipients kingofsevens
Date 2019-01-02.12:03:43
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1546430623.48.0.462352175173.issue35639@roundup.psfhosted.org>
In-reply-to
Content
assert 'ŞİŞLİ'.lower() == 'şişli'

Lowercasing the capital İ (with a dot on - \u0130) adds a unicode character \u0307 after i and if there is a following character it adds that dot (\u0307) over that character. The behavior is different in Python 2.7.10 where it adds the dot on top of 'i'.

Accord to Unicode Specifications character \u0130 should be converted to character \u0069.
History
Date User Action Args
2019-01-02 12:03:47kingofsevenssetrecipients: + kingofsevens
2019-01-02 12:03:43kingofsevenssetmessageid: <1546430623.48.0.462352175173.issue35639@roundup.psfhosted.org>
2019-01-02 12:03:43kingofsevenslinkissue35639 messages
2019-01-02 12:03:43kingofsevenscreate