msg201860 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2013-10-31 23:39 |
Each Python thread holds references to objects, in its current frame for example. At Python shutdown, clearing threads state happens very late: the import machinery is already dead, types are finalized, etc. If a thread has an object with a destructor, bad things may happen.
For example, when open files are destroyed, a warning is emitted. The problem is that you cannot emits warnings because the warnings module has been unloaded, and so you miss warnings. See the issue #19442 for this specific case.
It is possible to clear the Python threads earlier since Python 3.2, because Python threads will now exit cleanly when they try to acquire the GIL: see PyEval_RestoreThread(). The value of the tstate pointer is used in PyEval_RestoreThread(), but the pointer is not dereferenced (the content of a Python state is not read). So it is even possible to free the memory of the threads state (not only to clear the threads state).
Attached patch implements destroy the all threads except the current thread, and clear the state of the current thread.
The patch calls also the garbage collector before flushing stdout and stderr, and disable signal handlers just before PyImport_Cleanup(). The main idea is to reorder functions like this:
- clear state of all threads to release strong references -> may call destructores
- force a garbage collector to release more references -> may call destructores
- flush stdout and stderr -> write pending warnings and any other buffered messages
- disable signal handler -> only at the end so previous steps can still be interrupted by CTRL+c
The side effect of the patch is that the destructor of destroyed objects are now called, especially for daemon threads. If you try the warn_shutdown.py script attached to #19442, you now get the warning with the patch! As a result, test_threading.test_4_daemon_threads() is also modified by my patch to ignore warnings :-)
If I understood correctly, the patch only has an impact on daemon threads, the behaviour of classic threads is unchanged.
|
msg202155 - (view) |
Author: Serhiy Storchaka (serhiy.storchaka) * |
Date: 2013-11-04 19:07 |
Is it possible to write a test for this behavior?
|
msg202156 - (view) |
Author: Antoine Pitrou (pitrou) * |
Date: 2013-11-04 19:12 |
I'm afraid clearing thread states is a bit too brutal. What if some destructor relies on contents of the thread states (e.g. thread locals)?
|
msg202164 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2013-11-04 20:22 |
2013/11/4 Antoine Pitrou <report@bugs.python.org>:
> I'm afraid clearing thread states is a bit too brutal. What if some destructor relies on contents of the thread states (e.g. thread locals)?
When Py_Finalize() is called, only one Python thread hold the GIL.
After _Py_Finalizing=tstate is set, no other thread can hold the GIL.
If another Python tries to lock the GIL, it is "killed" by
PyEval_RestoreThread().
Is that correct? If yes, in which thread would the destructor be
called? Would it read Python thread locals without holding the GIL?
|
msg202165 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2013-11-04 20:24 |
2013/11/4 Serhiy Storchaka <report@bugs.python.org>:
> Is it possible to write a test for this behavior?
It is possible to test it manually using warn_shutdown.py attached to
#19442. The warnings module may be used to test the change.
|
msg202166 - (view) |
Author: Antoine Pitrou (pitrou) * |
Date: 2013-11-04 20:28 |
I am referring to the following part of your patch:
+ /* Clear the state of the current thread */
+ PyThreadState_Clear(tstate);
You are clearing the thread state of the currently executing thread, which doesn't sound right.
|
msg202169 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2013-11-04 20:56 |
> You are clearing the thread state of the currently executing
> thread, which doesn't sound right.
Oh, I didn't realize that PyThreadState_Clear() clears also Python thread locals.
Here is a new patch without PyThreadState_Clear(tstate) and with two unit tests:
- ensure that Python thread locals are not destroyed before destructors are called
- ensure that object destructors are called before Python thread states are destroyed
|
msg202667 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2013-11-12 02:13 |
neologix made a review on rietveld:
http://bugs.python.org/review/19466/#ps9818
There was an issue in finalize_threads-2.patch: the test didn't pass in release mode. I fixed it by adding -Wd to the command line option.
I also replaced threading.Lock with threading.Event in the unit test to synchronize the two threads.
=> finalize_threads-3.patch
|
msg202699 - (view) |
Author: Roundup Robot (python-dev) |
Date: 2013-11-12 15:45 |
New changeset c2a13acd5e2b by Victor Stinner in branch 'default':
Close #19466: Clear the frames of daemon threads earlier during the Python
http://hg.python.org/cpython/rev/c2a13acd5e2b
|
msg202704 - (view) |
Author: Vajrasky Kok (vajrasky) * |
Date: 2013-11-12 16:17 |
Typo:
s/immediatly/immediately.
I attached the patch.
|
msg202705 - (view) |
Author: Roundup Robot (python-dev) |
Date: 2013-11-12 16:19 |
New changeset 10a8e676b87b by Victor Stinner in branch 'default':
Issue #19466: Fix typo. Patch written by Vajrasky Kok.
http://hg.python.org/cpython/rev/10a8e676b87b
|
msg202912 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2013-11-15 00:47 |
The fix raised a new issue: #19565.
I also saw a crash on Windows which is probably related:
http://buildbot.python.org/all/builders/x86 Windows Server 2003 [SB] 3.x/builds/1717/steps/test/logs/stdio
======================================================================
FAIL: test_4_daemon_threads (test.test_threading.ThreadJoinOnShutdown)
----------------------------------------------------------------------
Traceback (most recent call last):
File "E:\Data\buildslave\cpython\3.x.snakebite-win2k3r2sp2-x86\build\lib\test\test_threading.py", line 783, in test_4_daemon_threads
rc, out, err = assert_python_ok('-c', script)
File "E:\Data\buildslave\cpython\3.x.snakebite-win2k3r2sp2-x86\build\lib\test\script_helper.py", line 69, in assert_python_ok
return _assert_python(True, *args, **env_vars)
File "E:\Data\buildslave\cpython\3.x.snakebite-win2k3r2sp2-x86\build\lib\test\script_helper.py", line 55, in _assert_python
"stderr follows:\n%s" % (rc, err.decode('ascii', 'ignore')))
AssertionError: Process return code is 3221225477, stderr follows:
----------------------------------------------------------------------
|
msg206009 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2013-12-13 01:47 |
I'm unable to reproduce the test_4_daemon_threads failure, can anyone try on Windows?
|
msg206028 - (view) |
Author: Charles-François Natali (neologix) * |
Date: 2013-12-13 08:30 |
Hum...
Correct me if I'm wrong, but destroying the thread state of daemon threads
while they're running is really a bad idea in fact: for example, if
warnings are now emitted for unclosed file objects, this means that the
file object, and all associated buffers, are destroyed.
But even though the daemon thread doesn't hold the GIL, he might still be
using this object.
For example, if we have:
daemon thread:
readinto:
release_gil
read(fd, fileobj->buffer, fileobj->size)
acquire_gil
and the main thread exits, then fileobj will be deallocated.
It's actually fairly easy to write a short crasher launching a deamon
thread reading from e.g. /dev/zero in loop, with the main thread exiting
right in the middle.
|
msg206032 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2013-12-13 09:35 |
"It's actually fairly easy to write a short crasher launching a deamon
thread reading from e.g. /dev/zero in loop, with the main thread exiting
right in the middle."
I'm unable to write such crasher, can you please give an example? Are you able to crash Python on Linux or on Python 3.3?
Does the threading test makes sense?
|
msg210993 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2014-02-11 17:52 |
I didn't see test_threading failing anymore recently, so I'm closing the issue. Thanks.
|
msg210998 - (view) |
Author: Charles-François Natali (neologix) * |
Date: 2014-02-11 18:55 |
> I didn't see test_threading failing anymore recently, so I'm closing the
issue. Thanks.
Hm...
Could someone review my message http://bugs.python.org/msg206028?
Because AFAICT, this will lead to random crashes with daemon threads.
|
msg211011 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2014-02-11 20:33 |
> Could someone review my message http://bugs.python.org/msg206028?
I replied with questions and you didn't reply :-)
|
msg211072 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2014-02-12 09:55 |
A new bug was found because of changes of this issue: #20526. IMO it is safer to revert changes of this issue in Python 3.4, it's too late to analyze such tricky bug. See #20526 for the discussion.
|
msg211149 - (view) |
Author: Roundup Robot (python-dev) |
Date: 2014-02-13 11:52 |
New changeset 1166b3321012 by Victor Stinner in branch 'default':
Issue #20526, #19466: Revert changes of issue #19466 which introduces a
http://hg.python.org/cpython/rev/1166b3321012
|
msg211154 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2014-02-13 13:35 |
I reverted the changes because it introduced regressions.
|
msg213823 - (view) |
Author: Roundup Robot (python-dev) |
Date: 2014-03-17 06:31 |
New changeset 9ce58a73b6b5 by Victor Stinner in branch '3.4':
Issue #20526, #19466: Revert changes of issue #19466 which introduces a
http://hg.python.org/cpython/rev/9ce58a73b6b5
|
msg363651 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2020-03-08 11:15 |
The change has been reverted in 2014. I reopen the issue since using daemon thraeds became safer in Python 3.9. As soon as a daemon thread attempts to take the GIL, it does exit immediately.
With the commit eb4e2ae2b8486e8ee4249218b95d94a9f0cc513e (bpo-39877), it should be even more reliable.
Let me retry to fix this issue again. I reopen the issue and I wrote a new PR.
|
msg363654 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2020-03-08 11:29 |
asyncio_gc.py: script to reproduce bpo-20526 (copied from there), but ported to Python 3.9 (replace asyncio.async with asyncio.ensure_future).
Sadly, Python with PR 18848 does crash when running asyncio_gc.py if I press CTRL+c twice:
* A thread does crash in _PyObject_GC_TRACK_impl() called indirectly by select_epoll_poll_impl(), in a Python thread
* while main thread is exiting Python: Py_FinalizeEx() is calling _PyImport_Cleanup() which blocks on take_gil()
It seems like the main thread does *not* hold the hold, the thread which crashed does.
It should not happen: _PyRuntimeState_SetFinalizing(runtime, tstate) has been called and so no other thread should be able to take the GIL anymore, but exit immediately.
|
msg363779 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2020-03-09 22:37 |
New changeset 9ad58acbe8b90b4d0f2d2e139e38bb5aa32b7fb6 by Victor Stinner in branch 'master':
bpo-19466: Py_Finalize() clears daemon threads earlier (GH-18848)
https://github.com/python/cpython/commit/9ad58acbe8b90b4d0f2d2e139e38bb5aa32b7fb6
|
msg363781 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2020-03-09 22:44 |
I merged more changes in bpo-39877 which made possible to finally fix this issue.
|
|
Date |
User |
Action |
Args |
2022-04-11 14:57:52 | admin | set | github: 63665 |
2020-03-09 22:44:45 | vstinner | set | status: open -> closed versions:
+ Python 3.9, - Python 3.4 messages:
+ msg363781
components:
+ Interpreter Core resolution: fixed stage: patch review -> resolved |
2020-03-09 22:37:53 | vstinner | set | messages:
+ msg363779 |
2020-03-08 17:26:52 | vstinner | set | pull_requests:
+ pull_request18212 |
2020-03-08 17:26:27 | vstinner | set | files:
+ asyncio_gc.py |
2020-03-08 11:29:18 | vstinner | set | messages:
+ msg363654 |
2020-03-08 11:20:05 | vstinner | set | stage: resolved -> patch review pull_requests:
+ pull_request18205 |
2020-03-08 11:15:55 | vstinner | set | status: closed -> open resolution: fixed -> (no value) messages:
+ msg363651
|
2014-03-17 06:31:03 | python-dev | set | messages:
+ msg213823 |
2014-02-13 13:35:11 | vstinner | set | status: open -> closed resolution: fixed messages:
+ msg211154
|
2014-02-13 11:52:29 | python-dev | set | messages:
+ msg211149 |
2014-02-12 09:55:27 | vstinner | set | status: closed -> open resolution: fixed -> (no value) messages:
+ msg211072
|
2014-02-11 20:33:45 | vstinner | set | messages:
+ msg211011 |
2014-02-11 18:55:56 | neologix | set | messages:
+ msg210998 |
2014-02-11 17:52:39 | vstinner | set | status: open -> closed resolution: fixed messages:
+ msg210993
|
2013-12-13 09:35:20 | vstinner | set | messages:
+ msg206032 |
2013-12-13 08:30:38 | neologix | set | messages:
+ msg206028 |
2013-12-13 01:47:42 | vstinner | set | messages:
+ msg206009 |
2013-11-15 00:47:29 | vstinner | set | status: closed -> open resolution: fixed -> (no value) messages:
+ msg202912
|
2013-11-12 16:19:00 | python-dev | set | messages:
+ msg202705 |
2013-11-12 16:17:01 | vajrasky | set | files:
+ fix_typo_19466.patch nosy:
+ vajrasky messages:
+ msg202704
|
2013-11-12 15:45:41 | python-dev | set | status: open -> closed
nosy:
+ python-dev messages:
+ msg202699
resolution: fixed stage: resolved |
2013-11-12 02:13:13 | vstinner | set | files:
+ finalize_threads-3.patch
messages:
+ msg202667 |
2013-11-04 20:56:45 | vstinner | set | files:
+ finalize_threads-2.patch
messages:
+ msg202169 |
2013-11-04 20:28:21 | pitrou | set | nosy:
+ neologix messages:
+ msg202166
|
2013-11-04 20:24:01 | vstinner | set | messages:
+ msg202165 |
2013-11-04 20:22:12 | vstinner | set | messages:
+ msg202164 |
2013-11-04 19:12:17 | pitrou | set | messages:
+ msg202156 |
2013-11-04 19:07:31 | serhiy.storchaka | set | messages:
+ msg202155 |
2013-11-04 10:47:29 | vstinner | set | nosy:
+ serhiy.storchaka
|
2013-11-01 00:38:57 | Arfrever | set | nosy:
+ Arfrever
|
2013-10-31 23:39:46 | vstinner | create | |