This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

classification
Title: normalization is affected by unicode width
Type: behavior Stage:
Components: Extension Modules, Unicode Versions: Python 3.3
process
Status: closed Resolution: fixed
Dependencies: Superseder:
Assigned To: Nosy List: Arfrever, benjamin.peterson, ezio.melotti, gvanrossum, lemburg, loewis, tchrist, vstinner
Priority: normal Keywords:

Created on 2011-08-13 04:12 by benjamin.peterson, last changed 2022-04-11 14:57 by admin. This issue is now closed.

Messages (6)
msg142008 - (view) Author: Benjamin Peterson (benjamin.peterson) * (Python committer) Date: 2011-08-13 04:12
Narrow build:
unicodedata.normalize("NFKC", "𝔘𝔫𝔦𝔠𝔬𝔡𝔢")
'𝔘𝔫𝔦𝔠𝔬𝔡𝔢'

Wide build:
>>> unicodedata.normalize("NFKC", "𝔘𝔫𝔦𝔠𝔬𝔡𝔢")
'Unicode'

Normalization needs to properly decode characters in the supplementary plane.
msg142105 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2011-08-15 08:28
See also #12737.
msg143039 - (view) Author: Guido van Rossum (gvanrossum) * (Python committer) Date: 2011-08-26 21:18
Yeah, we should fix this.  At least in 3.3, but (without knowing what exactly is involved) I think backporting to 2.7 and 3.2 makes sense too.
msg144563 - (view) Author: Benjamin Peterson (benjamin.peterson) * (Python committer) Date: 2011-09-28 12:15
Not anymore, though. :)
msg144564 - (view) Author: Benjamin Peterson (benjamin.peterson) * (Python committer) Date: 2011-09-28 13:05
Almost, that is. The unicodedata module still needs to use the new PEP 393 API.
msg144610 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2011-09-29 18:06
Issue fixed in Python 3.3 by Martin, changesets 40cf7204ca2b and 50f008019213.
History
Date User Action Args
2022-04-11 14:57:20adminsetgithub: 56955
2011-09-29 18:06:25vstinnersetstatus: open -> closed

nosy: + loewis
messages: + msg144610

resolution: fixed
2011-09-28 13:05:39benjamin.petersonsetstatus: closed -> open
resolution: fixed -> (no value)
messages: + msg144564
2011-09-28 12:41:53vstinnersetversions: - Python 2.7, Python 3.2
2011-09-28 12:15:35benjamin.petersonsetstatus: open -> closed
resolution: fixed
messages: + msg144563
2011-08-26 21:18:05gvanrossumsetnosy: + gvanrossum
messages: + msg143039
2011-08-15 19:28:23Arfreversetnosy: + Arfrever
2011-08-15 09:00:31tchristsetnosy: + tchrist
2011-08-15 08:28:57vstinnersetmessages: + msg142105
2011-08-13 11:55:39pitrousetnosy: + lemburg, vstinner
2011-08-13 08:02:13ezio.melottisetnosy: + ezio.melotti
components: + Unicode
2011-08-13 04:12:00benjamin.petersoncreate