classification
Title: case conversion problems in Turkish
Type: behavior Stage:
Components: Unicode Versions: Python 3.0
process
Status: closed Resolution: wont fix
Dependencies: Superseder:
Assigned To: Nosy List: georg.brandl, ozan
Priority: normal Keywords:

Created on 2008-08-10 12:06 by ozan, last changed 2010-04-29 17:42 by terry.reedy. This issue is now closed.

Messages (2)
msg70972 - (view) Author: Ozan Çağlayan (ozan) Date: 2008-08-10 12:06
>>> sys.getdefaultencoding()
'utf-8'

>>> s = 'iı'
>>> s.upper()
'II' # should be 'İI'

>>> t = 'Iİ'
>>> t.lower()
'ii' # should be 'ıi'

>>> unicodedata.name('ı')      # The small dotless one
'LATIN SMALL LETTER DOTLESS I'
>>> unicodedata.name('I')      # The capital dotless one
'LATIN CAPITAL LETTER I'


>>> unicodedata.name('i')      # The small 'i'
'LATIN SMALL LETTER I'
>>> unicodedata.name('İ')      # The corresponding capital one
'LATIN CAPITAL LETTER I WITH DOT ABOVE'

The other non-ascii turkish characters 'şŞğĞöÖçÇüÜ' are correctly
handled by case conversion methods.
msg70976 - (view) Author: Georg Brandl (georg.brandl) * (Python committer) Date: 2008-08-10 12:24
Being Unicode strings, Py3k strings use the Unicode database's
lowercase<->uppercase mapping, which is not context sensitive.

See #1528802 for more discussion.
History
Date User Action Args
2010-04-29 17:42:43terry.reedysetstatus: pending -> closed
2008-08-10 12:24:41georg.brandlsetstatus: open -> pending
resolution: wont fix
messages: + msg70976
nosy: + georg.brandl
2008-08-10 12:06:31ozancreate