This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author serhiy.storchaka
Recipients lars.gustaebel, nadeem.vawda, r.david.murray, serhiy.storchaka, teamnoir
Date 2013-08-16.20:59:35
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1376686775.42.0.915162102884.issue18744@psf.upfronthosting.co.za>
In-reply-to
Content
Thank you for the script Richard.

If you say about performance degradation when extracting a tarfile in changed order this behavior is expected. When you read a gzip file in random order you need seek in it. A gzip file is a singe-direction road. For seeking in a gzip file you need decompress all data between you current position (or from the file start) and target position. In case of random order you need decompress 1/3 tarfile in the mean for every extracted file.

THe tarfile module can't do anything with this. It can't first extract all file in the memory because uncompressed file can be too big. It can't resort a list of extracted file in natural order because it can change semantic (a tarfile can contains duplicates and symlinks). Just don't do this. Don't extract a large number of files from compressed tarfile in changed order.
History
Date User Action Args
2013-08-16 20:59:35serhiy.storchakasetrecipients: + serhiy.storchaka, lars.gustaebel, nadeem.vawda, r.david.murray, teamnoir
2013-08-16 20:59:35serhiy.storchakasetmessageid: <1376686775.42.0.915162102884.issue18744@psf.upfronthosting.co.za>
2013-08-16 20:59:35serhiy.storchakalinkissue18744 messages
2013-08-16 20:59:35serhiy.storchakacreate