Message218961
I ran into this:
>>> gzout = gzip.open('test.gz','wb')
>>> gzout.write('abcdefgh') # write expects bytes not str
Traceback (most recent call last):
File "<pyshell#2>", line 1, in <module>
gzout.write('abcdefgh')
File "/usr/lib/python3.4/gzip.py", line 343, in write
self.crc = zlib.crc32(data, self.crc) & 0xffffffff
TypeError: 'str' does not support the buffer interface
>>> gzout.write(b'abcdefgh') # ok, use bytes instead
8
>>> gzout.close()
But now the file is not recognized as valid gzip format anymore (neither by the gzip module nor by external software):
>>> gzin = gzip.open('test.gz','rb')
>>> next(gzin)
Traceback (most recent call last):
File "<pyshell#32>", line 1, in <module>
next(gzin)
File "/usr/lib/python3.4/gzip.py", line 594, in readline
c = self.read(readsize)
File "/usr/lib/python3.4/gzip.py", line 365, in read
if not self._read(readsize):
File "/usr/lib/python3.4/gzip.py", line 465, in _read
self._read_eof()
File "/usr/lib/python3.4/gzip.py", line 487, in _read_eof
raise OSError("Incorrect length of data produced")
OSError: Incorrect length of data produced
Turns out that gzip.write increased the ISIZE field value by 8 already during the failed call with the str object, so it is now 16 instead of 8:
>>> raw = open('test.gz','rb')
>>> [n for n in raw.read()] # ISIZE is the fourth last element
[31, 139, 8, 8, 51, 46, 127, 83, 2, 255, 116, 101, 115, 116, 0, 75, 76, 74, 78, 73, 77, 75, 207, 0, 0, 80, 42, 239, 174, 16, 0, 0, 0]
in other words: gzip.GzipFile.write() leaps (and modifies) before it checks its input argument. |
|
Date |
User |
Action |
Args |
2014-05-23 11:22:57 | wolma | set | recipients:
+ wolma |
2014-05-23 11:22:57 | wolma | set | messageid: <1400844177.32.0.625819528356.issue21560@psf.upfronthosting.co.za> |
2014-05-23 11:22:57 | wolma | link | issue21560 messages |
2014-05-23 11:22:55 | wolma | create | |
|