This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author mrabarnett
Recipients ThiefMaster, davidism, ezio.melotti, mrabarnett, serhiy.storchaka, vstinner
Date 2017-07-03.20:21:45
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1499113305.95.0.193293638206.issue30838@psf.upfronthosting.co.za>
In-reply-to
Content
In Unicode 9.0.0, U+1885 and U+1886 changed from being General_Category=Other_Letter (Lo) to General_Category=Nonspacing_Mark (Mn).

U+2118 is General_Category=Math_Symbol (Sm) and U+212E is General_Category=Other_Symbol (So).

\w doesn't include Mn, Sm or So.

The .identifier method uses the Unicode properties XID_Start and XID_Continue, which include these codepoints.
History
Date User Action Args
2017-07-03 20:21:45mrabarnettsetrecipients: + mrabarnett, vstinner, ezio.melotti, serhiy.storchaka, ThiefMaster, davidism
2017-07-03 20:21:45mrabarnettsetmessageid: <1499113305.95.0.193293638206.issue30838@psf.upfronthosting.co.za>
2017-07-03 20:21:45mrabarnettlinkissue30838 messages
2017-07-03 20:21:45mrabarnettcreate