This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author lars.gustaebel
Recipients lars.gustaebel
Date 2008-02-10.11:43:59
SpamBayes Score 0.10428157
Marked as misclassified No
Message-id <1202643843.0.0.134352659823.issue2058@psf.upfronthosting.co.za>
In-reply-to
Content
tarfile.py wastes lots of memory resources. The memory consumption does
not depend on the size of an archive but on the numbers of members in it.
The attached patch reduces memory usage by about 60% and consists of two
independent strategies (each with about 30% reduction):

1. Add __slots__ to the TarInfo class. This was proposed in issue1540385
a while ago but rejected due to backward-compatibility issues.

2. Remove the undocumented buf attribute of the TarInfo class. buf
stores the original 512-byte header block read from the archive. This
was introduced in r45954 and is rather useless except for GNUTYPE_SPARSE
processing. This might as well be a candidate for backporting to 2.6.
History
Date User Action Args
2008-02-10 11:44:03lars.gustaebelsetspambayes_score: 0.104282 -> 0.10428157
recipients: + lars.gustaebel
2008-02-10 11:44:03lars.gustaebelsetspambayes_score: 0.104282 -> 0.104282
messageid: <1202643843.0.0.134352659823.issue2058@psf.upfronthosting.co.za>
2008-02-10 11:44:01lars.gustaebellinkissue2058 messages
2008-02-10 11:44:01lars.gustaebelcreate