This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author mrabarnett
Recipients acdha, ezio.melotti, mrabarnett
Date 2013-03-07.23:19:55
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1362698396.08.0.158677158813.issue17381@psf.upfronthosting.co.za>
In-reply-to
Content
The way the re handles ranges is to convert the two endpoints to lowercase and then check whether the lowercase form of the character in the text is in that range.

For example, [A-Z] is converted to the range [\x41-\x5A], and the lowercase form of 'Q' ('\x51') is 'q' ('\x7A'), which is in the range.

In your example, [\u0400-\u0527] is converted to the range [\u0450-\u0527], but the lowercase form of 'А' ('\u0410') is 'а' ('\u0430'), which isn't in the range.

This is the same as issue #3511, but a worse failure.
History
Date User Action Args
2013-03-07 23:19:56mrabarnettsetrecipients: + mrabarnett, ezio.melotti, acdha
2013-03-07 23:19:56mrabarnettsetmessageid: <1362698396.08.0.158677158813.issue17381@psf.upfronthosting.co.za>
2013-03-07 23:19:56mrabarnettlinkissue17381 messages
2013-03-07 23:19:55mrabarnettcreate