This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author lemburg
Recipients alexs, lemburg, loewis
Date 2008-12-10.09:44:09
SpamBayes Score 3.39104e-09
Marked as misclassified No
Message-id <1228902251.27.0.855534999134.issue4610@psf.upfronthosting.co.za>
In-reply-to
Content
Python uses the Unicode database for the mapping and this only contains
1-1 mappings. The special cases (mostly 1-2 mappings) are not included.

It would be nice to have them available as well, but I guess we'd have
to write them in code rather than invent a new mapping table for them.

Furthermore, there are a few cases like e.g. the Turkish i where case
mappings depend on external context such as the language the code point
is used in - those cases are difficult to get right.

We may need to extend the .lower()/.upper()/.title() methods with an
optional parameter that allow providing this extra context information
to the methods.

BTW: 'ß' is being phased out in German. The new writing rules encourage
using 'ss' or 'SS' instead (which is not entirely correct, since 'ß'
originated from 'sz' used some hundred or so years ago, but those are
just details ;-).
History
Date User Action Args
2008-12-10 09:44:11lemburgsetrecipients: + lemburg, loewis, alexs
2008-12-10 09:44:11lemburgsetmessageid: <1228902251.27.0.855534999134.issue4610@psf.upfronthosting.co.za>
2008-12-10 09:44:10lemburglinkissue4610 messages
2008-12-10 09:44:09lemburgcreate