Author serhiy.storchaka
Recipients ezio.melotti, mrabarnett, serhiy.storchaka
Date 2017-10-04.14:02:55
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1507125776.04.0.213398074469.issue31690@psf.upfronthosting.co.za>
In-reply-to
Content
Currently re supports local inline flags. 'a(?i:b)' matches 'a' cases-sensitively, but 'b' case-insensitively. But flags 'a' and 'L' can't be scoped to a subpattern. The 'u' flag currently just redundant, it doesn't make effect in string patterns, and is not allowed in bytes patterns. They can be applied only to the whole pattern. I think it would be nice to make them local.

The example of the problem that this can solve is issue31672. Currently '[a-z]' in Unicode case-insensitive mode matches not only Latin letters from ;a' to 'z' and from 'A' to 'Z', but also characters 'İ', 'ı', 'ſ' and 'K' which are equivalent to 'i', 's' and 'k' correspondingly. With local 'a' and 'u' flags you can use ASCII and Unicode ranges in the same pattern.

I'm working on the patch.
History
Date User Action Args
2017-10-04 14:02:56serhiy.storchakasetrecipients: + serhiy.storchaka, ezio.melotti, mrabarnett
2017-10-04 14:02:56serhiy.storchakasetmessageid: <1507125776.04.0.213398074469.issue31690@psf.upfronthosting.co.za>
2017-10-04 14:02:55serhiy.storchakalinkissue31690 messages
2017-10-04 14:02:55serhiy.storchakacreate