regrtest/buildbot: test run marked as failure even when re-run succeeds #68939

zware · 2015-07-29T21:25:56Z

BPO	24751
Nosy	@db3l, @bitdancer, @zware
Files	issue24751.diff

^{Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.}

Show more details

GitHub fields:

assignee = 'https://github.com/zware'
closed_at = <Date 2015-08-09.03:05:42.344>
created_at = <Date 2015-07-29.21:25:56.440>
labels = ['easy', 'type-bug', 'tests']
title = 'regrtest/buildbot: test run marked as failure even when re-run succeeds'
updated_at = <Date 2015-08-09.03:05:48.065>
user = 'https://github.com/zware'

bugs.python.org fields:

activity = <Date 2015-08-09.03:05:48.065>
actor = 'python-dev'
assignee = 'zach.ware'
closed = True
closed_date = <Date 2015-08-09.03:05:42.344>
closer = 'zach.ware'
components = ['Tests']
creation = <Date 2015-07-29.21:25:56.440>
creator = 'zach.ware'
dependencies = []
files = ['40076']
hgrepos = []
issue_num = 24751
keywords = ['patch', 'easy', 'buildbot']
message_count = 7.0
messages = ['247632', '247634', '247728', '248013', '248291', '248308', '248311']
nosy_count = 4.0
nosy_names = ['db3l', 'r.david.murray', 'python-dev', 'zach.ware']
pr_nums = []
priority = 'low'
resolution = 'fixed'
stage = 'resolved'
status = 'closed'
superseder = None
type = 'behavior'
url = 'https://bugs.python.org/issue24751'
versions = ['Python 2.7', 'Python 3.4', 'Python 3.5', 'Python 3.6']

zware · 2015-07-29T21:25:56Z

The buildbots all run the test suite with the '-w', which re-runs any tests that failed in the main test sequence at a higher verbosity level. More often than not it seems the re-run tests succeed, but the exit code is still 1 so the build is marked as a failure.

The simplest action I'd like would be to exit(0) iff all re-run tests pass on the re-run. Alternatively, we could try to get a bit fancier and exit with some other return code, and adjust the build master to interpret that return code as "passed, with warnings" and mark the build as amber rather than red.

bitdancer · 2015-07-29T21:37:48Z

I think option 1 is to be preferred. One of the things we've been talking about for the workflow is gating on the buildbots passing, and the way that works with flaky tests is if the check fails, you just run the test again so you get a green and the patch can be gated in. So from that perspective if the tests pass on rerun the result is most useful if it is green.

Unless we want to say amber is OK for gating...but in terms of cognative load I think green is better. After all, our current green state is morally equivalent to running the tests again and having them pass..

zware · 2015-07-31T02:52:36Z

Here's a patch.

python-dev · 2015-08-05T03:00:20Z

New changeset 6987a9c7dde9 by Zachary Ware in branch '2.7':
Issue bpo-24751: When running regrtest with '-w', don't fail if re-run succeeds.
https://hg.python.org/cpython/rev/6987a9c7dde9

New changeset 9964edf2fd1e by Zachary Ware in branch '3.4':
Issue bpo-24751: When running regrtest with '-w', don't fail if re-run succeeds.
https://hg.python.org/cpython/rev/9964edf2fd1e

New changeset 9d1f6022261d by Zachary Ware in branch '3.5':
Issue bpo-24751: Merge with 3.4
https://hg.python.org/cpython/rev/9d1f6022261d

New changeset 6f67c74608b6 by Zachary Ware in branch 'default':
Closes bpo-24751: Merge with 3.5
https://hg.python.org/cpython/rev/6f67c74608b6

db3l · 2015-08-08T19:48:48Z

While running a manual test (make buildbottest) on my 2.7 Ubuntu buildbot, I ran into an exception in this patch:

The tail end of the test run:

[401/401/1] test_signal
379 tests OK.
1 test failed:
    test_curses
21 tests skipped:
    test_aepack test_al test_applesingle test_bsddb185 test_cd test_cl
    test_dl test_gl test_imgfile test_kqueue test_linuxaudiodev
    test_macos test_macostools test_msilib test_ossaudiodev
    test_scriptpackages test_startfile test_sunaudiodev test_winreg
    test_winsound test_zipfile64
Those skips are all expected on linux2.
Re-running failed tests in verbose mode
Traceback (most recent call last):
  File "./Lib/test/regrtest.py", line 1598, in <module>
    main()
  File "./Lib/test/regrtest.py", line 655, in main
    for test in bad[:]:
TypeError: 'set' object has no attribute '__getitem__'

The code is attempting to iterate over a sliced copy of bad (bad[:]) due to later possible mutation, but by that point, if you had failures, bad is a set, from the block shortly above where it subtracts out the environment changed list. I was testing 2.7, but I think the issue affects all branches.

Perhaps list(bad) instead of bad[:]?

zware · 2015-08-09T02:24:28Z

Ah. The problem is on 2.7 only; 3.x calls sorted() on the set operation. The set operation should just go away, though; we don't count ENV_CHANGED as 'bad' anymore.

Will fix shortly.

python-dev · 2015-08-09T03:05:48Z

New changeset 7d69b214e668 by Zachary Ware in branch '2.7':
Issue bpo-24751: Fix running regrtest with '-w' flag in case of test failures.
https://hg.python.org/cpython/rev/7d69b214e668

zware added the easy label Jul 29, 2015

zware added tests Tests in the Lib/test dir type-bug An unexpected behavior, bug, or error labels Jul 31, 2015

python-dev mannequin closed this as completed Aug 5, 2015

bitdancer reopened this Aug 8, 2015

zware closed this as completed Aug 9, 2015

zware self-assigned this Aug 9, 2015

ezio-melotti transferred this issue from another repository Apr 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

regrtest/buildbot: test run marked as failure even when re-run succeeds #68939

regrtest/buildbot: test run marked as failure even when re-run succeeds #68939

zware commented Jul 29, 2015

zware commented Jul 29, 2015

bitdancer commented Jul 29, 2015

zware commented Jul 31, 2015

python-dev mannequin commented Aug 5, 2015

db3l commented Aug 8, 2015

zware commented Aug 9, 2015

python-dev mannequin commented Aug 9, 2015

regrtest/buildbot: test run marked as failure even when re-run succeeds #68939

regrtest/buildbot: test run marked as failure even when re-run succeeds #68939

Comments

zware commented Jul 29, 2015

zware commented Jul 29, 2015

bitdancer commented Jul 29, 2015

zware commented Jul 31, 2015

python-dev mannequin commented Aug 5, 2015

db3l commented Aug 8, 2015

zware commented Aug 9, 2015

python-dev mannequin commented Aug 9, 2015