This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author ezio.melotti
Recipients belopolsky, eric.araujo, ezio.melotti, fdrake, pluskid, v+python
Date 2011-03-26.10:17:23
SpamBayes Score 4.92448e-06
Marked as misclassified No
Message-id <1301134644.32.0.515359058579.issue7311@psf.upfronthosting.co.za>
In-reply-to
Content
The attached patch changes the regex to allow non-ascii letters in attribute values (using \w with the re.UNICODE flag instead of [a-zA-Z0-9_]).

Using [^>\s] (or even [^> ]) might be OK too, since that's what browsers seem to use (e.g. Firefox and Chrome show "テ<ス＀☃ト   -d-fg" as title of '<a href="" title=テ<ス＀☃ト   -d-fg href="">foo</a>', including the non-ascii spaces in the middle).
History
Date User Action Args
2011-03-26 10:17:24ezio.melottisetrecipients: + ezio.melotti, fdrake, belopolsky, eric.araujo, v+python, pluskid
2011-03-26 10:17:24ezio.melottisetmessageid: <1301134644.32.0.515359058579.issue7311@psf.upfronthosting.co.za>
2011-03-26 10:17:23ezio.melottilinkissue7311 messages
2011-03-26 10:17:23ezio.melotticreate