Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

case conversion problems in Turkish #47786

Closed
ozan mannequin opened this issue Aug 10, 2008 · 2 comments
Closed

case conversion problems in Turkish #47786

ozan mannequin opened this issue Aug 10, 2008 · 2 comments
Labels
topic-unicode type-bug An unexpected behavior, bug, or error

Comments

@ozan
Copy link
Mannequin

ozan mannequin commented Aug 10, 2008

BPO 3536
Nosy @birkenfeld

Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.

Show more details

GitHub fields:

assignee = None
closed_at = <Date 2010-04-29.17:42:43.304>
created_at = <Date 2008-08-10.12:06:31.960>
labels = ['type-bug', 'expert-unicode']
title = 'case conversion problems in Turkish'
updated_at = <Date 2010-04-29.17:42:43.303>
user = 'https://bugs.python.org/ozan'

bugs.python.org fields:

activity = <Date 2010-04-29.17:42:43.303>
actor = 'terry.reedy'
assignee = 'none'
closed = True
closed_date = <Date 2010-04-29.17:42:43.304>
closer = 'terry.reedy'
components = ['Unicode']
creation = <Date 2008-08-10.12:06:31.960>
creator = 'ozan'
dependencies = []
files = []
hgrepos = []
issue_num = 3536
keywords = []
message_count = 2.0
messages = ['70972', '70976']
nosy_count = 2.0
nosy_names = ['georg.brandl', 'ozan']
pr_nums = []
priority = 'normal'
resolution = 'wont fix'
stage = None
status = 'closed'
superseder = None
type = 'behavior'
url = 'https://bugs.python.org/issue3536'
versions = ['Python 3.0']

@ozan
Copy link
Mannequin Author

ozan mannequin commented Aug 10, 2008

>>> sys.getdefaultencoding()
'utf-8'

>>> s = ''
>>> s.upper()
'II' # should be 'İI'

>>> t = ''
>>> t.lower()
'ii' # should be 'ıi'

>>> unicodedata.name('ı')      # The small dotless one
'LATIN SMALL LETTER DOTLESS I'
>>> unicodedata.name('I')      # The capital dotless one
'LATIN CAPITAL LETTER I'


>>> unicodedata.name('i')      # The small 'i'
'LATIN SMALL LETTER I'
>>> unicodedata.name('İ')      # The corresponding capital one
'LATIN CAPITAL LETTER I WITH DOT ABOVE'

The other non-ascii turkish characters 'şŞğĞöÖçÇüÜ' are correctly
handled by case conversion methods.

@ozan ozan mannequin added topic-unicode type-bug An unexpected behavior, bug, or error labels Aug 10, 2008
@birkenfeld
Copy link
Member

Being Unicode strings, Py3k strings use the Unicode database's
lowercase<->uppercase mapping, which is not context sensitive.

See bpo-1528802 for more discussion.

@ezio-melotti ezio-melotti transferred this issue from another repository Apr 10, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
topic-unicode type-bug An unexpected behavior, bug, or error
Projects
None yet
Development

No branches or pull requests

2 participants