This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author ezio.melotti
Recipients belopolsky, eric.araujo, ezio.melotti, fdrake, pluskid, v+python
Date 2011-03-27.13:57:20
SpamBayes Score 0.00067364925
Marked as misclassified No
Message-id <1301234241.69.0.369672102987.issue7311@psf.upfronthosting.co.za>
In-reply-to
Content
The HTML 4.01 specifications says[0]:
"""
In certain cases, authors may specify the value of an attribute without any quotation marks. The attribute value may only contain letters (a-z and A-Z), digits (0-9), hyphens (ASCII decimal 45), periods (ASCII decimal 46), underscores (ASCII decimal 95), and colons (ASCII decimal 58). We recommend using quotation marks even when it is possible to eliminate them.
"""

The HTML 5 draft says[1]:
"""
The attribute name, followed by zero or more space characters, followed by a single U+003D EQUALS SIGN character, followed by zero or more space characters, followed by the attribute value, which, in addition to the requirements given above for attribute values, must not contain any literal space characters, any U+0022 QUOTATION MARK characters ("), U+0027 APOSTROPHE characters ('), U+003D EQUALS SIGN characters (=), U+003C LESS-THAN SIGN characters (<), U+003E GREATER-THAN SIGN characters (>), or U+0060 GRAVE ACCENT characters (`), and must not be the empty string.
"""

So maybe [^>\s] is a little too permissive here.

[0]: http://www.w3.org/TR/html4/intro/sgmltut.html#h-3.2.2
[1]: http://dev.w3.org/html5/spec/Overview.html#attributes-0
History
Date User Action Args
2011-03-27 13:57:21ezio.melottisetrecipients: + ezio.melotti, fdrake, belopolsky, eric.araujo, v+python, pluskid
2011-03-27 13:57:21ezio.melottisetmessageid: <1301234241.69.0.369672102987.issue7311@psf.upfronthosting.co.za>
2011-03-27 13:57:21ezio.melottilinkissue7311 messages
2011-03-27 13:57:20ezio.melotticreate