This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author Eric.Wolf
Recipients Eric.Wolf, neologix, niemeyer, wrobell
Date 2011-03-01.21:22:24
SpamBayes Score 6.47106e-07
Marked as misclassified No
Message-id <1299014545.23.0.590778714643.issue10900@psf.upfronthosting.co.za>
In-reply-to
Content
Stupid questions are always worth asking. I did check the MD5 sum earlier and just checked it again (since I copied the file from one machine to another):

ebwolf@ubuntu:/opt$ md5sum /host/full-planet-110115-1800.osm.bz2 
0e3f81ef0dd415d8f90f1378666a400c  /host/full-planet-110115-1800.osm.bz2
ebwolf@ubuntu:/opt$ cat full-planet-110115-1800.osm.bz2.md5 
0e3f81ef0dd415d8f90f1378666a400c  full-planet-110115-1800.osm.bz2

There you have it. I was able to convert the bz2 to gzip with no errors:

bzcat full-planet-110115-1800.osm.bz2 | gzip > full-planet.osm.gz

FYI: This problem came up last year with no resolution:

http://mail.python.org/pipermail/tutor/2010-February/074610.html

Thanks for looking at this. Let me know if there's anything else you'd like me to try. In general, is it best to always read the same number of bytes? And what is the best value to pass for buffering in BZ2File? I just made up something hoping it would work.

I'm still waiting on the bzcat to /dev/null
History
Date User Action Args
2011-03-01 21:22:25Eric.Wolfsetrecipients: + Eric.Wolf, niemeyer, wrobell, neologix
2011-03-01 21:22:25Eric.Wolfsetmessageid: <1299014545.23.0.590778714643.issue10900@psf.upfronthosting.co.za>
2011-03-01 21:22:24Eric.Wolflinkissue10900 messages
2011-03-01 21:22:24Eric.Wolfcreate