Author rhettinger
Recipients Mariatta, dfrojas, eli.bendersky, lukasz.langa, matrixise, mdk, nedbat, rhettinger, scoder, serhiy.storchaka, sivert, taleinat, vstinner
Date 2019-03-19.16:58:02
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1553014682.17.0.534609190401.issue34160@roundup.psfhosted.org>
In-reply-to
Content
> how that's acceptable?

For docutils, we'll most likely propose some variant of St├ęphane Wirtel's script to test semantic equivalence for docutils.  For other cases, Serhiy is working on a C14N canonicalization tool which is specifically designed for the task of creating reproducible output, in a cross-language standards compliant way.

As Stefan Behnel clearly articulated, there are multiple reasons why Python should not guarantee byte-for-byte serialization across point releases.  That said, we'll likely make the guarantee across micro-releases.  That will make it possible a third mitigation strategy of generating new baseline files for a new point releases and adding a version check to decide which baseline to test against.

FWIW, we had a similar discussion regarding hash randomization.  While there are a number of significant differences, the outcome is relevantL  User tests that depended on non-guaranteed implementation details had to be fixed.
History
Date User Action Args
2019-03-19 16:58:02rhettingersetrecipients: + rhettinger, scoder, vstinner, taleinat, nedbat, eli.bendersky, lukasz.langa, serhiy.storchaka, matrixise, sivert, mdk, Mariatta, dfrojas
2019-03-19 16:58:02rhettingersetmessageid: <1553014682.17.0.534609190401.issue34160@roundup.psfhosted.org>
2019-03-19 16:58:02rhettingerlinkissue34160 messages
2019-03-19 16:58:02rhettingercreate