Author firatozgul
Recipients ezio.melotti, firatozgul, r.david.murray
Date 2013-02-20.11:31:59
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1361359919.37.0.228016952353.issue17252@psf.upfronthosting.co.za>
In-reply-to
Content
In Python, things like lowercasing-uppercasing and sorting were always problematic with regard to Turkish language. For instance, whatever the locale is, you cannot lowercase the word 'KADIN' (woman) in Turkish correctly::

    >>> "KADIN".lower()

    'kadin'

... which is wrong. That should be 'kadın' ('kad\u0131n'). Likewise 'kitap' (book)::

    >>> "kitap".upper()

    'KITAP'

... which is wrong. That should be 'KİTAP' ('K\u0130TAP').

As for this thread, in 3.3, Python does a completely different thing::

    >>> "KİTAP".lower()

    'ki\u0307tap' #wrong

In Python 3.2, this was::

    >>> "KİTAP".lower()

    'kitap' #correct

'i' and 'i\u0307' are not the same. 

Turkish Python programmers define their own upper(), lower(), title(), swapcase() and casefold() methods and use their own sorting techniques.
History
Date User Action Args
2013-02-20 11:31:59firatozgulsetrecipients: + firatozgul, ezio.melotti, r.david.murray
2013-02-20 11:31:59firatozgulsetmessageid: <1361359919.37.0.228016952353.issue17252@psf.upfronthosting.co.za>
2013-02-20 11:31:59firatozgullinkissue17252 messages
2013-02-20 11:31:59firatozgulcreate