This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author Maciek.J
Recipients Maciek.J, r.david.murray
Date 2010-10-20.18:05:27
SpamBayes Score 1.2617755e-05
Marked as misclassified No
Message-id <1287597929.66.0.0602225715872.issue10149@psf.upfronthosting.co.za>
In-reply-to
Content
Hm... It turns out that there is a "buffer_text" attribute:
http://docs.python.org/library/pyexpat.html#xml.parsers.expat.xmlparser.buffer_text
And setting this attribute to "True" seems to solve the problem.

It solves my problem, but docs are still very confusing. I see two things that should be fixed:
1. In CharacterDataHandler description it should be explicitly noted that data may be chunked even if it is short(!).
2. Description of buffer_text attribute should contain a notice that data may also be arbitrary chunked if this is set to False. My data _was_not_ chunked at new line characters (as the description suggest). It was chunked in the middle of a sentence (there were no whitespace in it!).
History
Date User Action Args
2010-10-20 18:05:29Maciek.Jsetrecipients: + Maciek.J, r.david.murray
2010-10-20 18:05:29Maciek.Jsetmessageid: <1287597929.66.0.0602225715872.issue10149@psf.upfronthosting.co.za>
2010-10-20 18:05:27Maciek.Jlinkissue10149 messages
2010-10-20 18:05:27Maciek.Jcreate