Message142077
On a narrow build, "\N{MATHEMATICAL SCRIPT CAPITAL A}" is stored as 2 code units, and neither re nor regex recombine them when compiling a regex or looking for a match.
regex supports \xNN, \uNNNN and \UNNNNNNNN and \N{XYZ} itself, so they can be used in a raw string literal, but it doesn't recombine code units.
I could add recombination to regex at some point if time has passed and no further progress has been made in the language's support for Unicode. |
|
Date |
User |
Action |
Args |
2011-08-14 19:36:26 | mrabarnett | set | recipients:
+ mrabarnett, rhettinger, terry.reedy, pitrou, jkloth, ezio.melotti, Arfrever, r.david.murray, tchrist |
2011-08-14 19:36:26 | mrabarnett | set | messageid: <1313350586.62.0.349107172718.issue12749@psf.upfronthosting.co.za> |
2011-08-14 19:36:26 | mrabarnett | link | issue12749 messages |
2011-08-14 19:36:25 | mrabarnett | create | |
|