This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author donut
Recipients donut
Date 2008-06-02.02:24:48
SpamBayes Score 0.042010687
Marked as misclassified No
Message-id <1212373498.52.0.766248013838.issue3026@psf.upfronthosting.co.za>
In-reply-to
Content
mmap on large files on 64 bit platforms in python >=2.5 returns some
sort of garbage.  In 2.4 it would just throw an exception.  Now I get
something like this (script runs md5.md5 on mmap object, and then runs
os.system md5sum for comparison):

This is python2.5 from Ubuntu 8.04 AMD64
/tmp$ python2.5 testbigfile.py 
python mmap md5: 1230552d39b7c1751f86bae5205ec0c8
abe59e28c9a3f11b883f62c80a3833a5 *bigfile


This is python svn as of 20080601, compiled the on same system.
/tmp$ python2.6 testbigfile.py
testbigfile.py:5: DeprecationWarning: the md5 module is deprecated; use
hashlib instead
  import md5
python mmap md5: 1230552d39b7c1751f86bae5205ec0c8
abe59e28c9a3f11b883f62c80a3833a5 *bigfile


Also note how the python md5 call returns immediately, not something you
would expect when md5ing 4GB of data.
History
Date User Action Args
2008-06-02 02:24:59donutsetspambayes_score: 0.0420107 -> 0.042010687
recipients: + donut
2008-06-02 02:24:58donutsetspambayes_score: 0.0420107 -> 0.0420107
messageid: <1212373498.52.0.766248013838.issue3026@psf.upfronthosting.co.za>
2008-06-02 02:24:56donutlinkissue3026 messages
2008-06-02 02:24:56donutcreate