Message 382142 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	vstinner
Recipients	m.s.sharifzade, vstinner
Date	2020-11-30.10:53:55
SpamBayes Score	-1.0
Marked as misclassified	Yes
Message-id	<1606733635.14.0.49026274413.issue42511@roundup.psfhosted.org>
In-reply-to

Content
The two strings a different, that's why Python returns False. a = U+0627 U+0646 U+062f U+064a U+0645 U+0634 U+06a9 b = U+0627 U+0646 U+062f U+06cc U+0645 U+0634 U+06a9 U+064a != U+06cc >>> unicodedata.name('\u064a') 'ARABIC LETTER YEH' >>> unicodedata.name('\u06cc') 'ARABIC LETTER FARSI YEH' Python doesn't know arabic, it only compares code pointers: the number 0x064a is not equal to the number 0x06cc. It's not a bug, but a deliberate choice.

The two strings a different, that's why Python returns False.

a = U+0627 U+0646 U+062f U+064a U+0645 U+0634 U+06a9
b = U+0627 U+0646 U+062f U+06cc U+0645 U+0634 U+06a9

U+064a != U+06cc

>>> unicodedata.name('\u064a')
'ARABIC LETTER YEH'
>>> unicodedata.name('\u06cc')
'ARABIC LETTER FARSI YEH'

Python doesn't know arabic, it only compares code pointers: the number 0x064a is not equal to the number 0x06cc.

It's not a bug, but a deliberate choice.

History
Date	User	Action	Args
2020-11-30 10:53:55	vstinner	set	recipients: + vstinner, m.s.sharifzade
2020-11-30 10:53:55	vstinner	set	messageid: <1606733635.14.0.49026274413.issue42511@roundup.psfhosted.org>
2020-11-30 10:53:55	vstinner	link	issue42511 messages
2020-11-30 10:53:55	vstinner	create