This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author ygale
Recipients ygale
Date 2008-02-24.13:52:30
SpamBayes Score 0.15419587
Marked as misclassified No
Message-id <1203861152.32.0.18277152276.issue2174@psf.upfronthosting.co.za>
In-reply-to
Content
In the documentation for xml.sax.xmlreader.InputSource objects
(section 8.12.4 of the Library Reference) we find that
users of InputSource objects should use the following
sequence to get their input data:

1. If the InputSource has a character stream, use that.
2. Otherwise, if the InputSource has a byte stream, use that.
3. Otherwise, open a URI connection to the system ID.

The parse() method of IncrementalParser skips step 1.

In addition, we need to add a method
getSourceEncoding() to the XMLReader interface;
if non-null, it will indicate to the parser that
the input is a byte stream in the given encoding.

The documentation should indicate what the parser
should do if the XML itself announces that its
encoding is something else. I propose that the parser should
be required to raise an error in that case.

See also #1483.
History
Date User Action Args
2008-02-24 13:52:32ygalesetspambayes_score: 0.154196 -> 0.15419587
recipients: + ygale
2008-02-24 13:52:32ygalesetspambayes_score: 0.154196 -> 0.154196
messageid: <1203861152.32.0.18277152276.issue2174@psf.upfronthosting.co.za>
2008-02-24 13:52:31ygalelinkissue2174 messages
2008-02-24 13:52:30ygalecreate