This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author amaury.forgeotdarc
Recipients akuchling, amaury.forgeotdarc, belopolsky, benjamin.peterson, donlorenzo, mortenlj, pitrou, rsc, zanella
Date 2008-06-26.15:18:37
SpamBayes Score 0.16039638
Marked as misclassified No
Message-id <1214493519.39.0.824007509472.issue2650@psf.upfronthosting.co.za>
In-reply-to
Content
The escaped regexp is not utf-8 (why should it be?), but it still
matches the same bytes in the searched text, which has to be utf-8
encoded anyway:

>>> text = u"été".encode('utf-8')
>>> regexp = u"é".encode('utf-8')
>>> re.findall(regexp, text)
['\xc3\xa9', '\xc3\xa9']
>>> escaped_regexp = re.escape(regexp)
>>> re.findall(escaped_regexp, text)
['\xc3\xa9', '\xc3\xa9']
History
Date User Action Args
2008-06-26 15:18:40amaury.forgeotdarcsetspambayes_score: 0.160396 -> 0.16039638
recipients: + amaury.forgeotdarc, akuchling, belopolsky, pitrou, rsc, benjamin.peterson, zanella, donlorenzo, mortenlj
2008-06-26 15:18:39amaury.forgeotdarcsetspambayes_score: 0.160396 -> 0.160396
messageid: <1214493519.39.0.824007509472.issue2650@psf.upfronthosting.co.za>
2008-06-26 15:18:38amaury.forgeotdarclinkissue2650 messages
2008-06-26 15:18:37amaury.forgeotdarccreate