test_pty fails when using setsid() #82728

vstinner · 2019-10-21T11:38:40Z

BPO	38547
Nosy	@db3l, @vstinner, @pablogsal, @miss-islington
PRs	bpo-38547: Fix test_pty if the process is the session leader #17519 [3.8] bpo-38547: Fix test_pty if the process is the session leader (GH-17519) #17520 [3.7] bpo-38547: Fix test_pty if the process is the session leader (GH-17519) #17521

^{Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.}

Show more details

GitHub fields:

assignee = None
closed_at = <Date 2019-12-09.11:00:12.673>
created_at = <Date 2019-10-21.11:38:39.966>
labels = ['tests', '3.9']
title = 'test_pty fails when using setsid()'
updated_at = <Date 2019-12-09.11:15:27.344>
user = 'https://github.com/vstinner'

bugs.python.org fields:

activity = <Date 2019-12-09.11:15:27.344>
actor = 'miss-islington'
assignee = 'none'
closed = True
closed_date = <Date 2019-12-09.11:00:12.673>
closer = 'vstinner'
components = ['Tests']
creation = <Date 2019-10-21.11:38:39.966>
creator = 'vstinner'
dependencies = []
files = []
hgrepos = []
issue_num = 38547
keywords = ['patch']
message_count = 12.0
messages = ['355059', '357402', '357403', '357414', '357415', '357416', '357419', '358065', '358066', '358067', '358068', '358070']
nosy_count = 4.0
nosy_names = ['db3l', 'vstinner', 'pablogsal', 'miss-islington']
pr_nums = ['17519', '17520', '17521']
priority = 'normal'
resolution = 'fixed'
stage = 'resolved'
status = 'closed'
superseder = None
type = None
url = 'https://bugs.python.org/issue38547'
versions = ['Python 3.9']

vstinner · 2019-10-21T11:38:40Z

regrtest has been modified in bpo-38502 to use setsid() when using multiprocessing mode (-jN command line option).

Problem: David Bolen identified that test_pty started to fail on his bolen-ubuntu worker (Ubuntu 18.04.3) since my commit ecb035c.

https://buildbot.python.org/all/#/builders/141/builds/2679

0:19:05 load avg: 1.81 [234/419/1] test_pty crashed (Exit code -1) -- running: test_unicodedata (55.5 sec)

I can reproduce the issue locally:

---

$ ./python -m test -j2 test_pty -v
== CPython 3.9.0a0 (heads/urlparse_ipv6:cc733a8cb6, Oct 21 2019, 11:34:36) [GCC 9.2.1 20190827 (Red Hat 9.2.1-1)]
== Linux-5.2.18-200.fc30.x86_64-x86_64-with-glibc2.29 little-endian
== cwd: /home/vstinner/python/master/build/test_python_20242
== CPU count: 8
== encodings: locale=UTF-8, FS=utf-8
0:00:00 load avg: 0.70 Run tests in parallel using 2 child processes
0:00:00 load avg: 0.70 [1/1/1] test_pty crashed (Exit code -1)
test_basic (test.test_pty.PtyTest) ...

== Tests result: FAILURE ==

1 test failed:
test_pty

Total duration: 383 ms
Tests result: FAILURE
---

It's surprising that there is no output!

I would prefer to keep process groups in regrtest, it's really helpful to be able to kill all processes spawned by a test worker process.

I'm not sure how/why PTY depends is incompatible with setsid().

pablogsal · 2019-11-24T17:55:26Z

This also happens when running the test suite with high parallelism:

./python -m test -j 20

This fails with:

== Tests result: FAILURE ==

398 tests OK.

2 tests failed:
test_embed test_pty

pablogsal · 2019-11-24T17:58:04Z

Indeed, almost all buildbots have to repeat the test_pty.

I think this needs to be fixed or process groups in regrtest should be limited or reverted.

vstinner · 2019-11-24T22:16:04Z

I think this needs to be fixed or process groups in regrtest should be limited or reverted.

What do you mean by "limited"?

Process groups really help to prevent to leak grandchild processes in multiprocessing tests, when tests are interrupted manually by CTRL+C or by a timeout (sadly, only when the timeout is handled by regrtest, not when it's handled by faulthandler).

See bpo-38502 for the rationale.

pablogsal · 2019-11-24T22:46:50Z

What do you mean by "limited"?

I mean to deactivate it by default and make opt-in.

Process groups really help to prevent to leak grandchild processes in multiprocessing tests, when tests are interrupted manually by CTRL+C or by a timeout (sadly, only when the timeout is handled by regrtest, not when it's handled by faulthandler).

I love process groups and they are awesome. But having these test being re-run on every buildbot and failing on my machine when just running test with -j is very annoying.

vstinner · 2019-11-24T22:50:43Z

Can't we fix test_pty?

db3l · 2019-11-24T23:27:30Z

I think fixing the underlying pty issue should certainly be the goal, but the question is whether the process group change should remain active in the meantime, as its presence is causing a regression in the tests. I think such cases in the past are usually rolled back, right?

I was originally on the fence since process groups address a real problem, especially in interactive testing, while creating an arguably aesthetic issue for my case of the buildbots (a warning rather than failure).

But Pablo's point about a normal manual full test run failing (not a warning as with the buildbots) feels persuasive since that's probably as common as the issue being addressed by the change. Even if pre-existing, the pty failure is exposed by the process group change, so it might be best for the process group change to wait on fixing the pty issue.

I don't know how to weigh the relative impact though, e.g,. how many people are likely to run into each failure case. There's probably more people doing a normal test run than breaking out of such tests though. At the least, it's a worst impact than just the warnings on the buildbots.

Perhaps an intermediate fallback could be gating the process group change behind a regrtest option (opt-in) which could then preserve its benefits upon request, without negatively impacting the default test process, whether manual or on the buildbots.

At least until resource is available to resolve the pty issue.

vstinner · 2019-12-09T10:36:19Z

I think fixing the underlying pty issue should certainly be the goal (...)

I wrote PR 17519 which fix the bug. We just have to ignore SIGHUP signal.

vstinner · 2019-12-09T10:57:18Z

New changeset a1838ec by Victor Stinner in branch 'master':
bpo-38547: Fix test_pty if the process is the session leader (GH-17519)
a1838ec

vstinner · 2019-12-09T11:00:13Z

Ok, I merged my fix to master. The backport to 3.7 and 3.8 will follow quickly. I close the issue.

Sorry for the inconvenience ;-)

miss-islington · 2019-12-09T11:15:11Z

New changeset b9f4b49 by Miss Islington (bot) in branch '3.7':
bpo-38547: Fix test_pty if the process is the session leader (GH-17519)
b9f4b49

miss-islington · 2019-12-09T11:15:27Z

New changeset d08fd29 by Miss Islington (bot) in branch '3.8':
bpo-38547: Fix test_pty if the process is the session leader (GH-17519)
d08fd29

vstinner added 3.9 only security fixes tests Tests in the Lib/test dir labels Oct 21, 2019

vstinner closed this as completed Dec 9, 2019

ezio-melotti transferred this issue from another repository Apr 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test_pty fails when using setsid() #82728

test_pty fails when using setsid() #82728

vstinner commented Oct 21, 2019

vstinner commented Oct 21, 2019

pablogsal commented Nov 24, 2019

pablogsal commented Nov 24, 2019

vstinner commented Nov 24, 2019

pablogsal commented Nov 24, 2019

vstinner commented Nov 24, 2019

db3l commented Nov 24, 2019

vstinner commented Dec 9, 2019

vstinner commented Dec 9, 2019

vstinner commented Dec 9, 2019

miss-islington commented Dec 9, 2019

miss-islington commented Dec 9, 2019

test_pty fails when using setsid() #82728

test_pty fails when using setsid() #82728

Comments

vstinner commented Oct 21, 2019

vstinner commented Oct 21, 2019

pablogsal commented Nov 24, 2019

pablogsal commented Nov 24, 2019

vstinner commented Nov 24, 2019

pablogsal commented Nov 24, 2019

vstinner commented Nov 24, 2019

db3l commented Nov 24, 2019

vstinner commented Dec 9, 2019

vstinner commented Dec 9, 2019

vstinner commented Dec 9, 2019

miss-islington commented Dec 9, 2019

miss-islington commented Dec 9, 2019