Author terry.reedy
Recipients ezio.melotti, lemburg, loewis, taleinat, terry.reedy
Date 2014-07-11.17:32:40
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1405099960.39.0.858333791657.issue21765@psf.upfronthosting.co.za>
In-reply-to
Content
I did not open another issue yet because I did not want to split the general discussion of parsing 3.x unicode-based python. We might also need another issue for idlelib/PyParse, and that might need to come first.

What I think is that Idle should have at 1 definition of 'identifier' and not a least 3, in 3 separate places ;-). Hyperparser seems like the most appropriate place if there is to be more than str.isidentifier.

That is not to say that equivalents versions might be needed for different uses. ColorDelegator, as it is, needs an re version to be combined with other regexes. I have not looked yet at what UndoDelegator does with its character classification function and whether str.isidentifier could be used.  Hyperparsar seems the hardest since it parses backwards.

I do not consider this issue to be the highest priority, at the moment, but we have collected enough info with help from Ezio and Martin to do some experiments.
History
Date User Action Args
2014-07-11 17:32:40terry.reedysetrecipients: + terry.reedy, lemburg, loewis, taleinat, ezio.melotti
2014-07-11 17:32:40terry.reedysetmessageid: <1405099960.39.0.858333791657.issue21765@psf.upfronthosting.co.za>
2014-07-11 17:32:40terry.reedylinkissue21765 messages
2014-07-11 17:32:40terry.reedycreate