Message58823
---------- Forwarded message ----------
From: diego <report@bugs.python.org>
Date: 19-dic-2007 17:05
Subject: [issue1663] Modification HTMLParser.py
To: diego.arias@gmail.com
New submission from diego:
Hello my name is Diego, I needed to parse HTML to retrieve only text,
but not grasped how to do it with class HTMLParser, so the change to do
it. The code to use is:
class ParsearHTML (HTMLParser.HTMLParser):
def __init__(self,datos):
HTMLParser.HTMLParser.__init__(self)
self.feed(datos)
self.close()
def handle_data(self,data):
return data
parser = ParsearHTML(onTmp)
data = parser.feed(onTmp)
And changes in the class are attached. Thank you very much. Diego.
----------
components: None
files: HTMLParser.py
messages: 58821
nosy: diegorubenarias
severity: normal
status: open
title: Modification HTMLParser.py
type: resource usage
versions: Python 2.4
Added file: http://bugs.python.org/file9000/HTMLParser.py
__________________________________
Tracker <report@bugs.python.org>
<http://bugs.python.org/issue1663>
__________________________________ |
File name |
Uploaded |
HTMLParser.py
|
diegorubenarias,
2007-12-19.20:07:19
|
unnamed
|
diegorubenarias,
2007-12-19.20:07:19
|
|
Date |
User |
Action |
Args |
2007-12-19 20:07:19 | diegorubenarias | set | spambayes_score: 0.107215 -> 0.10721525 recipients:
+ diegorubenarias |
2007-12-19 20:07:19 | diegorubenarias | link | issue1663 messages |
2007-12-19 20:07:19 | diegorubenarias | create | |
|