This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author malin
Recipients Esa.Peuha, Jeffrey.Kintscher, akira, gregory.p.smith, josh.r, kenorb, malin, maubp, miss-islington, nadeem.vawda, peremen, serhiy.storchaka, vnummela
Date 2019-09-14.04:31:45
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1568435505.34.0.581997018173.issue21872@roundup.psfhosted.org>
In-reply-to
Content
Some memos:

1, In liblzma, these missing bytes were copied inside `dict_repeat` function:

 788         case SEQ_COPY:
 789                 // Repeat len bytes from distance of rep0.
 790                 if (unlikely(dict_repeat(&dict, rep0, &len))) {

See liblzma's source code (xz-5.2 branch):
https://git.tukaani.org/?p=xz.git;a=blob;f=src/liblzma/lzma/lzma_decoder.c

2, Above replies said xz's command line tools can extract the problematic files successfully.

This is because xz checks `if (avail_out == 0)` first, then checks `if (avail_in == 0)`
See `uncompress` function in this source code (xz-5.2 branch):
https://git.tukaani.org/?p=xz.git;a=blob;f=src/xzdec/xzdec.c;hb=refs/heads/v5.2

This check order just avoids the problem.
History
Date User Action Args
2019-09-14 04:31:45malinsetrecipients: + malin, gregory.p.smith, nadeem.vawda, akira, maubp, serhiy.storchaka, Esa.Peuha, josh.r, vnummela, kenorb, peremen, miss-islington, Jeffrey.Kintscher
2019-09-14 04:31:45malinsetmessageid: <1568435505.34.0.581997018173.issue21872@roundup.psfhosted.org>
2019-09-14 04:31:45malinlinkissue21872 messages
2019-09-14 04:31:45malincreate