This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author r.david.murray
Recipients Fedele Mantuano, adepasquale, barry, maciej.szulik, r.david.murray
Date 2016-09-07.20:14:38
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1473279278.37.0.778994711338.issue27010@psf.upfronthosting.co.za>
In-reply-to
Content
Andrea: yes, your patch is different from what I had in mind.  The idea would be to recognize the "nested part with duplicate boundary", register the new defect, but produce a Message object with a structure that looked like this:

  multipart/mixed
    multipart/alternative
        text/plain
        text/html
    image/gif

What your patch produces is:

  multipart/mixed
    multipart/alternative
    text/plain
    text/html

which is not recognizing the nested multipart or the final MIME part (which is the OPs goal).

In principle it should be possible to parse the nesting despite the bad boundary (other MIME parsers do it, as documented here), but I'm not sure how hard it will be to modify Feedparser to do it.  Looking at the code it seems like it shouldn't be that hard to make it work, but I haven't dug deeply enough to be sure.
History
Date User Action Args
2016-09-07 20:14:38r.david.murraysetrecipients: + r.david.murray, barry, maciej.szulik, Fedele Mantuano, adepasquale
2016-09-07 20:14:38r.david.murraysetmessageid: <1473279278.37.0.778994711338.issue27010@psf.upfronthosting.co.za>
2016-09-07 20:14:38r.david.murraylinkissue27010 messages
2016-09-07 20:14:38r.david.murraycreate