Issue 16010: Some Unicode in identifiers improperly rejected

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

This issue has been migrated to GitHub: https://github.com/python/cpython/issues/60214

classification

Title:	Some Unicode in identifiers improperly rejected
Type:		Stage:	resolved
Components:	Interpreter Core	Versions:	Python 3.2

process

Created on 2012-09-23 23:45 by Joshua.Landau, last changed 2022-04-11 14:57 by admin. This issue is now closed.

Messages (2)
msg171082 - (view)	Author: Joshua Landau (Joshua.Landau) *	Date: 2012-09-23 23:45
"a¹ = None" is not valid, even though unicodedata.normalize("NFKC", "¹") == "1". One would expect "a¹ = None" and "a1 = None" to be equivalent in this case, as with "aⁱ = None" and "ai = None". I am not sure how many other characters exhibit the same problem. References: http://docs.python.org/py3k/reference/lexical_analysis.html#identifiers http://mail.python.org/pipermail/python-list/2012-September/631420.html "¹" === "\u00b9" "ⁱ" === "\u2071"
msg171089 - (view)	Author: R. David Murray (r.david.murray) *	Date: 2012-09-24 01:40
I find it unexpected that aⁱ and ai name the same variable, but I suppose that is a consequence of the unicode normalization rules (meaning what I really find surprising is the normalization). As for the '¹', its category is No, which does not appear in the list in the identifiers section you link to, while 'ⁱ' is Lm, which does. So there is no bug here.

History
Date	User	Action	Args
2022-04-11 14:57:36	admin	set	github: 60214
2012-09-25 21:08:15	terry.reedy	set	resolution: not a bug
2012-09-24 01:40:36	r.david.murray	set	status: open -> closed nosy: + r.david.murray messages: + msg171089 stage: resolved
2012-09-23 23:45:22	Joshua.Landau	create