This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author neologix
Recipients Eric.Wolf, neologix, niemeyer, wrobell
Date 2011-03-01.11:27:55
SpamBayes Score 5.10372e-06
Marked as misclassified No
Message-id <1298978877.31.0.201492140333.issue10900@psf.upfronthosting.co.za>
In-reply-to
Content
@Eric.Wolf

Could you try with this:

            # Read in anohter chunk of the file
            # NOTE: It's possible that an XML tag will be greater than buffsize
            #       This will break in that situation
-            newb = self.fp.read(self.bufpos)
+            newb = self.fp.read(self.buffsize)

Also, could you provide the output of
strace -emmap2,sbrk,brk python <script>

I could be completely wrong, but both in your case and in wrobell's case, there's a lot of _PyBytes_Resize going on, and given how PyObject_Realloc is implemented, this could lead to heavy heap fragmentation.
History
Date User Action Args
2011-03-01 11:27:57neologixsetrecipients: + neologix, niemeyer, wrobell, Eric.Wolf
2011-03-01 11:27:57neologixsetmessageid: <1298978877.31.0.201492140333.issue10900@psf.upfronthosting.co.za>
2011-03-01 11:27:55neologixlinkissue10900 messages
2011-03-01 11:27:55neologixcreate