This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author vstinner
Recipients m.s.sharifzade, vstinner
Date 2020-11-30.10:53:55
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1606733635.14.0.49026274413.issue42511@roundup.psfhosted.org>
In-reply-to
Content
The two strings a different, that's why Python returns False.

a = U+0627 U+0646 U+062f U+064a U+0645 U+0634 U+06a9
b = U+0627 U+0646 U+062f U+06cc U+0645 U+0634 U+06a9

U+064a != U+06cc

>>> unicodedata.name('\u064a')
'ARABIC LETTER YEH'
>>> unicodedata.name('\u06cc')
'ARABIC LETTER FARSI YEH'

Python doesn't know arabic, it only compares code pointers: the number 0x064a is not equal to the number 0x06cc.

It's not a bug, but a deliberate choice.
History
Date User Action Args
2020-11-30 10:53:55vstinnersetrecipients: + vstinner, m.s.sharifzade
2020-11-30 10:53:55vstinnersetmessageid: <1606733635.14.0.49026274413.issue42511@roundup.psfhosted.org>
2020-11-30 10:53:55vstinnerlinkissue42511 messages
2020-11-30 10:53:55vstinnercreate