Message88864
A simple workaround for the BeautifulSoup is the following wrapper. It
sanitize the javascript code before passing it to the parser by joining
the disjoint strings, so that "</scr"+"ipt>" becomes "</script>".
def bs(input):
pattern = re.compile('\"\+\"')
match = lambda x: ""
massage = copy.copy(BeautifulSoup.MARKUP_MASSAGE)
massage.extend([(pattern, match)])
return BeautifulSoup(input, markupMassage=massage) |
|
Date |
User |
Action |
Args |
2009-06-04 06:34:15 | momat | set | recipients:
+ momat, fdrake, georg.brandl, fantoozler, gsf, cpalmer |
2009-06-04 06:34:13 | momat | set | messageid: <1244097253.39.0.143495255584.issue670664@psf.upfronthosting.co.za> |
2009-06-04 06:34:08 | momat | link | issue670664 messages |
2009-06-04 06:34:00 | momat | create | |
|