This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author pitrou
Recipients Arfrever, docs@python, ezio.melotti, mrabarnett, pitrou, tchrist, terry.reedy, vstinner
Date 2011-08-13.17:36:36
SpamBayes Score 7.2500406e-08
Marked as misclassified No
Message-id <1313256874.3553.2.camel@localhost.localdomain>
In-reply-to <27372.1313194678@chthon>
Content
> However, because the \w&c issues are bigger, Java addressed the tr18 RL1.2a
> issues differently, this time by creating a new compilation flag called
> UNICODE_CHARACTER_CLASSES (with corresponding embedded "(?U)" regex flag.)
> 
> Truth be told, even Perl has secret pattern compilation flags to govern
> this sort of thing (ascii, locale, unicode), but we (well, I) hope you
> never have to use or even notice them.  
> 
> That too might be a route forward for Python, although I am not quite sure
> how much flexibility and control of your lexical scope you have.  However,
> the "from __future_" imports suggest you may have enough to do something
> slick so that only people who ask for it get it, and also importantly that
> they get it all over the place so don't have to add an extra flag or u'...'
> or whatever every single time.  

If the current behaviour is buggy or sub-optimal, I think we should
simply fix it (which might be done by replacing "re" with "regex" if
someone wants to shepherd its inclusion in the stdlib).

By the way, thanks for the detailed explanations, Tom.
History
Date User Action Args
2011-08-13 17:36:37pitrousetrecipients: + pitrou, terry.reedy, vstinner, ezio.melotti, mrabarnett, Arfrever, docs@python, tchrist
2011-08-13 17:36:36pitroulinkissue12731 messages
2011-08-13 17:36:36pitroucreate