This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author mrabarnett
Recipients Arfrever, ezio.melotti, jkloth, mrabarnett, pitrou, r.david.murray, rhettinger, tchrist, terry.reedy
Date 2011-08-14.19:36:25
SpamBayes Score 0.0025321099
Marked as misclassified No
Message-id <1313350586.62.0.349107172718.issue12749@psf.upfronthosting.co.za>
In-reply-to
Content
On a narrow build, "\N{MATHEMATICAL SCRIPT CAPITAL A}" is stored as 2 code units, and neither re nor regex recombine them when compiling a regex or looking for a match.

regex supports \xNN, \uNNNN and \UNNNNNNNN and \N{XYZ} itself, so they can be used in a raw string literal, but it doesn't recombine code units.

I could add recombination to regex at some point if time has passed and no further progress has been made in the language's support for Unicode.
History
Date User Action Args
2011-08-14 19:36:26mrabarnettsetrecipients: + mrabarnett, rhettinger, terry.reedy, pitrou, jkloth, ezio.melotti, Arfrever, r.david.murray, tchrist
2011-08-14 19:36:26mrabarnettsetmessageid: <1313350586.62.0.349107172718.issue12749@psf.upfronthosting.co.za>
2011-08-14 19:36:26mrabarnettlinkissue12749 messages
2011-08-14 19:36:25mrabarnettcreate