This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author sjt
Recipients Rosuav, berker.peksag, ncoghlan, sjt
Date 2016-07-21.17:15:04
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1469121304.17.0.528382017648.issue27582@psf.upfronthosting.co.za>
In-reply-to
Content
I still think the easiest thing to do would be to make all non-ASCII characters instances of "invalid_character_token", self-delimiting in the same way that operators are.  That would automatically point to exactly the right place in the token stream, and requires zero changes to the error handling code.

I don't have time to look at the code, but I suspect that you could handle this exactly the same way that ? and $ are handled, and maybe even use the same token type.
History
Date User Action Args
2016-07-21 17:15:04sjtsetrecipients: + sjt, ncoghlan, Rosuav, berker.peksag
2016-07-21 17:15:04sjtsetmessageid: <1469121304.17.0.528382017648.issue27582@psf.upfronthosting.co.za>
2016-07-21 17:15:04sjtlinkissue27582 messages
2016-07-21 17:15:04sjtcreate