Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handling of broken end tags in HTMLParser #58201

Closed
ezio-melotti opened this issue Feb 11, 2012 · 4 comments
Closed

Handling of broken end tags in HTMLParser #58201

ezio-melotti opened this issue Feb 11, 2012 · 4 comments
Assignees
Labels
stdlib Python modules in the Lib dir type-bug An unexpected behavior, bug, or error

Comments

@ezio-melotti
Copy link
Member

BPO 13993
Nosy @ezio-melotti, @merwok
Files
  • issue13993.diff: Patch against 3.2
  • issue13993-2.diff: Patch against 3.2
  • Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.

    Show more details

    GitHub fields:

    assignee = 'https://github.com/ezio-melotti'
    closed_at = <Date 2012-02-13.14:29:51.045>
    created_at = <Date 2012-02-11.13:49:57.338>
    labels = ['type-bug', 'library']
    title = 'Handling of broken end tags in HTMLParser'
    updated_at = <Date 2012-02-13.14:29:51.043>
    user = 'https://github.com/ezio-melotti'

    bugs.python.org fields:

    activity = <Date 2012-02-13.14:29:51.043>
    actor = 'ezio.melotti'
    assignee = 'ezio.melotti'
    closed = True
    closed_date = <Date 2012-02-13.14:29:51.045>
    closer = 'ezio.melotti'
    components = ['Library (Lib)']
    creation = <Date 2012-02-11.13:49:57.338>
    creator = 'ezio.melotti'
    dependencies = []
    files = ['24487', '24488']
    hgrepos = []
    issue_num = 13993
    keywords = ['patch']
    message_count = 4.0
    messages = ['153126', '153256', '153273', '153274']
    nosy_count = 3.0
    nosy_names = ['ezio.melotti', 'eric.araujo', 'python-dev']
    pr_nums = []
    priority = 'normal'
    resolution = 'fixed'
    stage = 'resolved'
    status = 'closed'
    superseder = None
    type = 'behavior'
    url = 'https://bugs.python.org/issue13993'
    versions = ['Python 2.7', 'Python 3.2', 'Python 3.3']

    @ezio-melotti
    Copy link
    Member Author

    The attached patch fixes the parser to handle broken end tags like:
    </label</p>
    </div end tmAd-leaderBoard>
    </<h4>
    </li class="unit">
    </li\r\n\t\t\t\t\t\t</ul>
    </><

    @ezio-melotti ezio-melotti self-assigned this Feb 11, 2012
    @ezio-melotti ezio-melotti added stdlib Python modules in the Lib dir type-bug An unexpected behavior, bug, or error labels Feb 11, 2012
    @python-dev
    Copy link
    Mannequin

    python-dev mannequin commented Feb 13, 2012

    New changeset df5e5eea7833 by Ezio Melotti in branch '3.2':
    bpo-13993: HTMLParser is now able to handle broken end tags when strict=False.
    http://hg.python.org/cpython/rev/df5e5eea7833

    New changeset 2e66baa1d86a by Ezio Melotti in branch 'default':
    bpo-13993: merge with 3.2.
    http://hg.python.org/cpython/rev/2e66baa1d86a

    @python-dev
    Copy link
    Mannequin

    python-dev mannequin commented Feb 13, 2012

    New changeset a349448474ea by Ezio Melotti in branch '2.7':
    bpo-13993: HTMLParser is now able to handle broken end tags.
    http://hg.python.org/cpython/rev/a349448474ea

    @ezio-melotti
    Copy link
    Member Author

    This is now fixed.

    @ezio-melotti ezio-melotti transferred this issue from another repository Apr 10, 2022
    Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
    Labels
    stdlib Python modules in the Lib dir type-bug An unexpected behavior, bug, or error
    Projects
    None yet
    Development

    No branches or pull requests

    1 participant