Message170605
The '\\g<startquote>' in the below does not work:
>>> repr( re.compile( '\\<\\!ENTITY[ \\011\\012\\015]+\\%[ \\011\\012\\015]*(?P<entityName>[A-Za-z][A-Za-z0-9\\.\\-\\_\\:]*)[ \\011\\012\\015]*(?P<startquote>[\\042\\047])(?P<entityText>.+?)\\g<startquote>[ \\011\\012\\015]*\\>', re.IGNORECASE | re.DOTALL).search( '<!ENTITY % m.mixedContent "( #PCDATA | i | b)">'))
'None'
In the following, the '\\g<startquote>' has been replaced by '\\2'. It works.
>>> repr( re.compile( '\\<\\!ENTITY[ \\011\\012\\015]+\\%[ \\011\\012\\015]*(?P<entityName>[A-Za-z][A-Za-z0-9\\.\\-\\_\\:]*)[ \\011\\012\\015]*(?P<startquote>[\\042\\047])(?P<entityText>.+?)\\2[ \\011\\012\\015]*\\>', re.IGNORECASE | re.DOTALL).search( '<!ENTITY % m.mixedContent "( #PCDATA | i | b)">'))
'<_sre.SRE_Match object at 0x7f77503d1918>'
Either this feature is broken or the re module documentation is somehow misleading me.
(Yes, I know there is an XML error in the above. That's because it's SGML.) |
|
Date |
User |
Action |
Args |
2012-09-17 13:33:47 | steve.newcomb | set | recipients:
+ steve.newcomb, ezio.melotti, mrabarnett |
2012-09-17 13:33:47 | steve.newcomb | set | messageid: <1347888827.23.0.363134887653.issue15956@psf.upfronthosting.co.za> |
2012-09-17 13:33:23 | steve.newcomb | link | issue15956 messages |
2012-09-17 13:33:22 | steve.newcomb | create | |
|