Message110163
When parsing a UTF-16 little-endian encoded XML file containing some japanese characters, the xml.sax.parse function raises a SAXParseException exception saying "no element found". Problem arises with/on:
Python 2.5.2/Windows XP Pro SP3 32 bit
Python 2.6.4/Windows XP Pro SP3 32 bit
Python 2.5.2/Windows 2008 Server SP2 64 bit
The same file is successfully processed with/on:
Python 2.4.3/CentOS 5.4
Python 2.6.3/CentOS 5.4
I've attached a minimal XML file that contains a single U+FF1A japanese character that triggers the exception. Code for parsing the file follows:
import xml.sax
xml.sax.parse(open("ff1a.xml"), xml.sax.ContentHandler())
Best regards,
Gianfranco |
|
Date |
User |
Action |
Args |
2010-07-13 09:04:36 | gianzula | set | recipients:
+ gianzula |
2010-07-13 09:04:36 | gianzula | set | messageid: <1279011876.88.0.998505025151.issue9241@psf.upfronthosting.co.za> |
2010-07-13 09:04:34 | gianzula | link | issue9241 messages |
2010-07-13 09:04:34 | gianzula | create | |
|