Message 119229 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	Maciek.J
Recipients	Maciek.J, r.david.murray
Date	2010-10-20.18:05:27
SpamBayes Score	1.2617755e-05
Marked as misclassified	No
Message-id	<1287597929.66.0.0602225715872.issue10149@psf.upfronthosting.co.za>
In-reply-to

Content
Hm... It turns out that there is a "buffer_text" attribute: http://docs.python.org/library/pyexpat.html#xml.parsers.expat.xmlparser.buffer_text And setting this attribute to "True" seems to solve the problem. It solves my problem, but docs are still very confusing. I see two things that should be fixed: 1. In CharacterDataHandler description it should be explicitly noted that data may be chunked even if it is short(!). 2. Description of buffer_text attribute should contain a notice that data may also be arbitrary chunked if this is set to False. My data _was_not_ chunked at new line characters (as the description suggest). It was chunked in the middle of a sentence (there were no whitespace in it!).

Hm... It turns out that there is a "buffer_text" attribute:
http://docs.python.org/library/pyexpat.html#xml.parsers.expat.xmlparser.buffer_text
And setting this attribute to "True" seems to solve the problem.

It solves my problem, but docs are still very confusing. I see two things that should be fixed:
1. In CharacterDataHandler description it should be explicitly noted that data may be chunked even if it is short(!).
2. Description of buffer_text attribute should contain a notice that data may also be arbitrary chunked if this is set to False. My data _was_not_ chunked at new line characters (as the description suggest). It was chunked in the middle of a sentence (there were no whitespace in it!).

History
Date	User	Action	Args
2010-10-20 18:05:29	Maciek.J	set	recipients: + Maciek.J, r.david.murray
2010-10-20 18:05:29	Maciek.J	set	messageid: <1287597929.66.0.0602225715872.issue10149@psf.upfronthosting.co.za>
2010-10-20 18:05:27	Maciek.J	link	issue10149 messages
2010-10-20 18:05:27	Maciek.J	create