Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HTMLParser lukewarm on bogus bare attribute chars #40417

Closed
mkc mannequin opened this issue Jun 18, 2004 · 5 comments
Closed

HTMLParser lukewarm on bogus bare attribute chars #40417

mkc mannequin opened this issue Jun 18, 2004 · 5 comments
Labels
stdlib Python modules in the Lib dir type-feature A feature request or enhancement

Comments

@mkc
Copy link
Mannequin

mkc mannequin commented Jun 18, 2004

BPO 975556
Nosy @devdanzin, @bitdancer
Superseder
  • bpo-1486713: HTMLParser : A auto-tolerant parsing mode
  • Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.

    Show more details

    GitHub fields:

    assignee = None
    closed_at = <Date 2010-12-03.04:14:13.145>
    created_at = <Date 2004-06-18.19:33:18.000>
    labels = ['type-feature', 'library']
    title = 'HTMLParser lukewarm on bogus bare attribute chars'
    updated_at = <Date 2010-12-03.04:14:13.143>
    user = 'https://bugs.python.org/mkc'

    bugs.python.org fields:

    activity = <Date 2010-12-03.04:14:13.143>
    actor = 'r.david.murray'
    assignee = 'none'
    closed = True
    closed_date = <Date 2010-12-03.04:14:13.145>
    closer = 'r.david.murray'
    components = ['Library (Lib)']
    creation = <Date 2004-06-18.19:33:18.000>
    creator = 'mkc'
    dependencies = []
    files = []
    hgrepos = []
    issue_num = 975556
    keywords = []
    message_count = 5.0
    messages = ['60515', '60516', '81438', '121676', '123175']
    nosy_count = 5.0
    nosy_names = ['mkc', 'nnseva', 'ajaksu2', 'Neil Muller', 'r.david.murray']
    pr_nums = []
    priority = 'normal'
    resolution = 'accepted'
    stage = 'resolved'
    status = 'closed'
    superseder = '1486713'
    type = 'enhancement'
    url = 'https://bugs.python.org/issue975556'
    versions = ['Python 2.7']

    @mkc
    Copy link
    Mannequin Author

    mkc mannequin commented Jun 18, 2004

    I tripped over the same problem mentioned in bug
    bpo-921657 (HTMLParser.py), except that my bogus attribute
    char is '|' instead of '@'.

    May I suggest that HTMLParser either require strict
    compliance with the HTML spec, or alternatively that it
    accept everything reasonable? The latter approach
    would be much more useful, and it would also be
    valuable to have this decision documented.

    In particular, 'attrfind' needs to be changed to accept
    (following the '=\s*') something like the subpattern
    given for 'locatestarttagend' (see the "bare value" line).

    @mkc mkc mannequin added the stdlib Python modules in the Lib dir label Jun 18, 2004
    @nnseva
    Copy link
    Mannequin

    nnseva mannequin commented Oct 13, 2004

    Logged In: YES
    user_id=325678

    see request bpo-1046092 to fix it

    @devdanzin
    Copy link
    Mannequin

    devdanzin mannequin commented Feb 9, 2009

    Per bpo-921657, looks like the current behavior is correct.

    @devdanzin devdanzin mannequin added the type-feature A feature request or enhancement label Feb 9, 2009
    @NeilMuller
    Copy link
    Mannequin

    NeilMuller mannequin commented Nov 20, 2010

    This should probably be solved as part of bpo-1486713 .

    @bitdancer
    Copy link
    Member

    The new strict=False mode from bpo-1486713 handles this case.

    @ezio-melotti ezio-melotti transferred this issue from another repository Apr 9, 2022
    Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
    Labels
    stdlib Python modules in the Lib dir type-feature A feature request or enhancement
    Projects
    None yet
    Development

    No branches or pull requests

    1 participant