This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

classification
Title: expose posix_spawn(p)
Type: enhancement Stage: resolved
Components: Extension Modules Versions: Python 3.8
process
Status: closed Resolution: fixed
Dependencies: 33191 33441 Superseder:
Assigned To: gregory.p.smith Nosy List: John Jones, alex, benjamin.peterson, dhduvall, gennad, gregory.p.smith, martin.panter, miss-islington, ned.deily, neologix, pablogsal, serhiy.storchaka, vstinner
Priority: Keywords: patch

Created on 2014-01-01 18:13 by benjamin.peterson, last changed 2022-04-11 14:57 by admin. This issue is now closed.

Pull Requests
URL Status Linked Edit
PR 5109 merged pablogsal, 2018-01-06 01:17
PR 5109 merged pablogsal, 2018-01-06 01:17
PR 5418 merged pablogsal, 2018-01-29 08:36
PR 6332 merged serhiy.storchaka, 2018-04-01 16:23
PR 6334 merged gregory.p.smith, 2018-04-01 18:55
PR 6336 merged miss-islington, 2018-04-01 19:02
PR 6673 merged miss-islington, 2018-05-01 13:45
PR 6693 merged pablogsal, 2018-05-02 10:54
PR 6725 merged serhiy.storchaka, 2018-05-08 09:47
PR 6794 merged pablogsal, 2018-05-14 14:08
Messages (70)
msg207137 - (view) Author: Benjamin Peterson (benjamin.peterson) * (Python committer) Date: 2014-01-01 18:13
posix_spawn is a nice, efficient replacement for fork()/exec(). We should expose it and possibly use it in subprocess where possible.
msg207139 - (view) Author: Gregory P. Smith (gregory.p.smith) * (Python committer) Date: 2014-01-01 21:17
Unless it could replace the fork+exec code path in its entirety, which I do not believe is possible, I see posix_spawn() as a distraction and additional maintenance burden with no benefit.

http://pubs.opengroup.org/onlinepubs/7999959899/functions/posix_spawn.html

Read the RATIONALE section.  The posix_spawn API was not created to make subprocess creation easier (i'd argue that it is the same burden to setup a proper call to posix_spawn as it is to do everything right for fork and exec).

One notable thing posix_spawn() does not support: setsid() (start_new_session=True) of the child process.  Obviously it also couldn't handle the arbitrary preexec_fn but preexec_fn is in general considered harmful.
msg213344 - (view) Author: Benjamin Peterson (benjamin.peterson) * (Python committer) Date: 2014-03-13 02:25
Okay, this seems like a bad idea.
msg222570 - (view) Author: Danek Duvall (dhduvall) Date: 2014-07-08 17:23
Our project (the Solaris packaging system, IPS), relies on posix_spawn() primarily for the ability to fork without making a large memory reservation (and possibly failing) because the forking process was itself very large.  That's the (a?) bug benefit of posix_spawn() -- it's not a benefit for the programmer using it (who might have to fall back to fork/exec), but for the end-user that benefits from its streamlined operation.

You're right that it doesn't handle everything that subprocess.Popen() does -- though at least on Solaris there's a way to change the cwd in the file actions, and I'm sure we'd consider adding a way to do the setsid() as well.  The rest should be possible cross-platform.
msg222571 - (view) Author: Alex Gaynor (alex) * (Python committer) Date: 2014-07-08 17:26
Danek, you might find https://github.com/dreid/posix_spawn/ useful, it provides bindings and a public API over posix_spawn (it's not complete yet, but if there's stuff missing, feel free to file a ticket!)
msg222572 - (view) Author: Danek Duvall (dhduvall) Date: 2014-07-08 17:38
Cool.  We implemented our own version as a straight-up native module (https://java.net/projects/ips/sources/pkg-gate/content/src/modules/pspawn.c), and our Popen replacement is not at present a complete replacement for the one in the stdlib, but it does what we need it to do.  We'd absolutely switch if it came in to the stdlib, though, and I think there would be plenty of programs that would benefit from it (certainly a number of large Python programs in Solaris have run into exactly this problem).
msg269684 - (view) Author: Danek Duvall (dhduvall) Date: 2016-07-01 19:02
Oh, for what it's worth, Solaris added setsid support to posix_spawn a few years ago, as a result of this conversation.  I still think it would be worthwhile supporting this in the stdlib, since we keep running into processes which have a lot of memory reserved and can't fork some small program because they're running in a memory-limited environment.
msg289758 - (view) Author: John Jones (John Jones) Date: 2017-03-17 12:08
To prevent subprocess/os.fork() doubling my memory after loading a large numpy array into memory, I now have to start my script with 650 calls to subprocess.Popen(), which just sit their waiting for some stdin to start doing something. This doesn't happen until after the numpy array is loaded, else I quickly run out of memory.

I think this sort of situation results in more unnecessary complexity compared to using posix_spawn, in spite of the concerns about using posix_spawn. Perhaps subprocess.Popen just needs a "posix_spawn=True" parameter?
msg289769 - (view) Author: Gregory P. Smith (gregory.p.smith) * (Python committer) Date: 2017-03-17 16:33
I think someone wanting this will need to put forward a patch adding it to be reviewed and mulled over.  As Alex mentioned in msg22571 - https://github.com/dreid/posix_spawn/ exists as does the code Danek pointed at in the next comment.  try those.

I suggest someone who actively cares about a limited available process address space environment contribute this.  (ie: not your typical 64-bit system)

fork()+exec() does not cause significant memory allocation, only a brief ~doubling of mapped address space, with backing pages being the originals just marked copy on write, but never written to by the child. The exec undoes that mapping.

It is technically possible to cause the copy on writes to happen if you immediately write to a large amount of memory in the parent process after the fork has happened before the exec has, but that seems like a rare timing problem that could even be worked around by monitoring the forked child to see that the exec has occurred before continuing.
msg289775 - (view) Author: John Jones (John Jones) Date: 2017-03-17 18:49
I agree with everything you're saying Gregory, however I don't think the significance of the memory doubling is as inconsequential as you might first think. For example, i have on my 64bit Linux system 128Gb of RAM, and a numpy table that's around 70Gb. Spawning a subprocess, even though memory is doubled for a very short period of time, is enough to raise a MemoryError, despite the subprocess i'm spawning using only 2 or 3Mb after the exec().

I do appreciate that for most Python users however, they will not see much benefit from what I imagine is quite a lot of development work.

FWIW, I did try the posix_spawn module, but i couldn't figure out how to write data to the stdin of a posix_spawn subprocess, and gave up in place of the commonly recommended solution to this problem (via StackExchange) of spawning lots of subprocesses before you put stuff in memory. Fortunately, for my problem, this was a possible solution. For others I think they're going to have to use posix_spawn, or an entirely different programming language if that doesn't work.
msg289791 - (view) Author: Gregory P. Smith (gregory.p.smith) * (Python committer) Date: 2017-03-18 02:34
All I'm really saying is that someone who wants this should provide a
patch/PR with unittests. :)  I can help review and go from there.

It does make sense to me for it to be available as part of the subprocess
API if it is available at all, likely an alternative implementation of or
behavior flag to _posixsubprocess.fork_exec() with appropriate autoconf and
conditional compilation based on availability ifdefs.

On Fri, Mar 17, 2017 at 11:49 AM John Jones <report@bugs.python.org> wrote:

>
> John Jones added the comment:
>
> I agree with everything you're saying Gregory, however I don't think the
> significance of the memory doubling is as inconsequential as you might
> first think. For example, i have on my 64bit Linux system 128Gb of RAM, and
> a numpy table that's around 70Gb. Spawning a subprocess, even though memory
> is doubled for a very short period of time, is enough to raise a
> MemoryError, despite the subprocess i'm spawning using only 2 or 3Mb after
> the exec().
>
> I do appreciate that for most Python users however, they will not see much
> benefit from what I imagine is quite a lot of development work.
>
> FWIW, I did try the posix_spawn module, but i couldn't figure out how to
> write data to the stdin of a posix_spawn subprocess, and gave up in place
> of the commonly recommended solution to this problem (via StackExchange) of
> spawning lots of subprocesses before you put stuff in memory. Fortunately,
> for my problem, this was a possible solution. For others I think they're
> going to have to use posix_spawn, or an entirely different programming
> language if that doesn't work.
>
> ----------
>
> _______________________________________
> Python tracker <report@bugs.python.org>
> <http://bugs.python.org/issue20104>
> _______________________________________
>
msg311040 - (view) Author: Gregory P. Smith (gregory.p.smith) * (Python committer) Date: 2018-01-29 01:56
New changeset 6c6ddf97c402709713d668d0ed53836a7749ba99 by Gregory P. Smith (Pablo Galindo) in branch 'master':
bpo-20104: Expose `posix_spawn` in the os module (GH-5109)
https://github.com/python/cpython/commit/6c6ddf97c402709713d668d0ed53836a7749ba99
msg311041 - (view) Author: Gregory P. Smith (gregory.p.smith) * (Python committer) Date: 2018-01-29 01:58
remaining work before closing this:
  Doc/library/os.rst needs to describe os.posix_spawn
msg311051 - (view) Author: Martin Panter (martin.panter) * (Python committer) Date: 2018-01-29 03:54
Pablo’s code looked unfinished to me. As well as missing documentation, I suspect there may be memory leaks and poor error handling.

The two calls above the “fail:” label look like dead code. The “parse_envlist” result appears to be leaked.

I’m curious why you never call “posix_spawn_file_actions_destroy”. I saw on Open BSD <https://man.openbsd.org/posix_spawn_file_actions_init.3> it reclaims memory, and it seems sensible to call it on other platforms as well.

No error checking on any of the “posix_spawn_file_actions_” methods.

If “posix_spawn” fails, the “pid” variable will be returned uninitialized.
msg311081 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-01-29 08:37
I have opened a PR to address this issues:

https://github.com/python/cpython/pull/5415
msg311086 - (view) Author: Martin Panter (martin.panter) * (Python committer) Date: 2018-01-29 10:03
Does the PySequence_Fast result need releasing if the following “for” loop fails? There is a Py_DECREF only in the successful case, which seems inconsistent.

Does Python still support non-UTF-8 locales and bytes filenames? I haven’t been keeping up, but I assumed these things were still supported in many cases. It seems strange to only support UTF-8 in the “addopen” file action.

Pablo’s second PR currently <https://github.com/python/cpython/pull/5418/commits/e50bdb9> calls PyErr_SetString(PyExc_OSError, . . .) with a custom error message depending on which OS call failed. But I wonder if it is more important to set the “errno” attribute (which I think should choose an OSError subclass if appropriate). Perhaps you can do that by assigning the return values to “errno” and then calling PyErr_SetFromErrno. A disadvantage might be less context about which stage went wrong.
msg311087 - (view) Author: Martin Panter (martin.panter) * (Python committer) Date: 2018-01-29 10:34
TypeError if “posix_spawn_file_actions_init” fails doesn’t seem right. I suggest OSError, MemoryError, or even plain Exception instead.

“File_actionsp” is set to point to a local variable “_file_actions”, but the variable goes out of scope before the pointer is used. This is technically undefined behaviour (even if it happens to work). Simplest solution would be to move the variables into the same outer scope.
msg311089 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-01-29 10:56
Regarding the leak, I was under the assumption that as File_actionsp is pointing to a stack initialized _file_actions and is this last variable the one that is passed to posix_spawn_file_actions_init, it was not needed to explicitly call posix_spawn_file_actions_destroy as the variable will go out of scope and the pointer is pointing to a stack variable. Am I missing something?
msg311094 - (view) Author: Martin Panter (martin.panter) * (Python committer) Date: 2018-01-29 11:23
Your assumption about calling “file_actions_destroy” would be okay if the posix_spawn_file_actions_t object was a simple object or structure. But I imagine most implementations would allocate memory when you call one of the “add” methods. Especially “addopen”, which is supposed to copy the filename string somewhere. Looking at “uclibc” <https://repo.or.cz/uclibc-ng.git/blob/v1.0.28:/include/spawn.h#l263>, I see it calls “free”, because it allocates an array for all the file actions.
msg311100 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2018-01-29 11:48
Python master doesn't build on macOS Tiger anymore:

http://buildbot.python.org/all/#/builders/30/builds/581

./Modules/posixmodule.c:251:19: error: spawn.h: No such file or directory
./Modules/posixmodule.c: In function ‘os_posix_spawn_impl’:
./Modules/posixmodule.c:5174: error: ‘posix_spawn_file_actions_t’ undeclared (first use in this function)
./Modules/posixmodule.c:5174: error: (Each undeclared identifier is reported only once
./Modules/posixmodule.c:5174: error: for each function it appears in.)
./Modules/posixmodule.c:5174: error: ‘file_actionsp’ undeclared (first use in this function)
./Modules/posixmodule.c:5176: error: parse error before ‘_file_actions’
./Modules/posixmodule.c:5177: warning: implicit declaration of function ‘posix_spawn_file_actions_init’
./Modules/posixmodule.c:5177: error: ‘_file_actions’ undeclared (first use in this function)
./Modules/posixmodule.c:5232: warning: implicit declaration of function ‘posix_spawn_file_actions_addopen’
./Modules/posixmodule.c:5245: warning: implicit declaration of function ‘posix_spawn_file_actions_addclose’
./Modules/posixmodule.c:5262: warning: implicit declaration of function ‘posix_spawn_file_actions_adddup2’
./Modules/posixmodule.c:5274: warning: implicit declaration of function ‘posix_spawn’
make: *** [Modules/posixmodule.o] Error 1

See also bpo-32705: "Current Android does not have posix_spawn" (now fixed). In PR 5413 of this bpo, it was also asked if "#define HAVE_POSIX_SPAWN 1" of posixmodule.c makes sence since there is already a configure check for posix_spawn()?

https://github.com/python/cpython/pull/5413#issuecomment-361183294
msg311102 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-01-29 12:03
@vstinner I have removed the #define HAVE_POSIX_SPAWN 1 in PR 5418.
msg311144 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2018-01-29 15:39
FYI bolen-dmg-3.x was also broken by this change:

http://buildbot.python.org/all/#/builders/69/builds/114
msg311164 - (view) Author: Yury Selivanov (yselivanov) * (Python committer) Date: 2018-01-29 20:25
FYI

New defect(s) Reported-by: Coverity Scan Showing 2 of 2 defect(s)
** CID 1428733: Memory - illegal accesses (RETURN_LOCAL) /Modules/posixmodule.c: 5281 in os_posix_spawn_impl()
________________________________________________________________________________________________________ *** CID 1428733: Memory - illegal accesses (RETURN_LOCAL)
/Modules/posixmodule.c: 5281 in os_posix_spawn_impl()
5275 }
5276 }
5277 Py_DECREF(seq);
5278 }
5279
5280 _Py_BEGIN_SUPPRESS_IPH
5281 posix_spawn(&pid, path->narrow, file_actionsp, NULL, argvlist, envlist);
5282 return PyLong_FromPid(pid);
5283 _Py_END_SUPPRESS_IPH
5284
5285 path_error(path); 5286
** CID 1428732: Control flow issues (UNREACHABLE) /Modules/posixmodule.c: 5285 in os_posix_spawn_impl()
________________________________________________________________________________________________________ *** CID 1428732: Control flow issues (UNREACHABLE)
/Modules/posixmodule.c: 5285 in os_posix_spawn_impl()
5279
5280 _Py_BEGIN_SUPPRESS_IPH
5281 posix_spawn(&pid, path->narrow, file_actionsp, NULL, argvlist, envlist);
5282 return PyLong_FromPid(pid);
5283 _Py_END_SUPPRESS_IPH
5284
5285 path_error(path); 5286
 
5287 free_string_array(envlist, envc); 5288
5289 fail:
5290
msg311165 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-01-29 20:27
Thank Yury for the information!

This should be fixed now in PR418.
msg311166 - (view) Author: Gregory P. Smith (gregory.p.smith) * (Python committer) Date: 2018-01-29 20:34
New changeset 0cd6bca65519109a8a7862d38ba1b8924e432a16 by Gregory P. Smith (Pablo Galindo) in branch 'master':
bpo-20104: Fix leaks and errors in new os.posix_spawn (GH-5418)
https://github.com/python/cpython/commit/0cd6bca65519109a8a7862d38ba1b8924e432a16
msg314732 - (view) Author: Serhiy Storchaka (serhiy.storchaka) * (Python committer) Date: 2018-03-31 08:21
posix_spawn() is not documented, it contains reference leaks (issue33191) and can cause crashes. It's style doesn't conform PEP 7.

The simplest fix requires changing the undocumented interface.
msg314770 - (view) Author: Serhiy Storchaka (serhiy.storchaka) * (Python committer) Date: 2018-04-01 11:27
I'm trying to fix issues with os.posix_spawn(), but there are so much of them, that perhaps it would be better to remove this undocumented feature from 3.7.
msg314774 - (view) Author: Ned Deily (ned.deily) * (Python committer) Date: 2018-04-01 13:58
> perhaps it would be better to remove this undocumented feature from 3.7

We need to make a decision about this before the b4 cutoff in 4 weeks.  But it would not be fair to ask people to put in the effort to try to fix things if there is not a good chance for success and if people's time could be better spent on currently more important issues.  Greg, Pablo, others: opinions?
msg314775 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-04-01 15:05
In PR6331 I am trying to address all issues identified by Serhiy.
msg314777 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-04-01 15:06
Notice that some of the errors identified at the end of 5109 were already corrected by the other PRs in this issue.
msg314779 - (view) Author: Serhiy Storchaka (serhiy.storchaka) * (Python committer) Date: 2018-04-01 16:28
PR 6332 fixes issues found by me in os.posix_spawn(). It is not completed yet, needed documentation and tests. And I have doubts about API.
msg314781 - (view) Author: Gregory P. Smith (gregory.p.smith) * (Python committer) Date: 2018-04-01 19:01
New changeset 79760ed256987cead08d668b95675afba6b3760f by Gregory P. Smith in branch 'master':
bpo-20104: Add os.posix_spawn documentation. (#6334)
https://github.com/python/cpython/commit/79760ed256987cead08d668b95675afba6b3760f
msg314783 - (view) Author: Gregory P. Smith (gregory.p.smith) * (Python committer) Date: 2018-04-01 19:11
PR 6332 is overall a good cleanup.  I pushed a couple changes to that PR and left comments for some others.  It should go in.

There are basic tests for os.posix_spawn in test_posix.py.  More would be nice as we don't currently have any for the error paths.

The API is intentionally low level and simple as that is what the os (posix) module is all about.
msg314784 - (view) Author: miss-islington (miss-islington) Date: 2018-04-01 19:13
New changeset ab8457232121dfdfb1d4bfcf806a842fbe402722 by Miss Islington (bot) in branch '3.7':
bpo-20104: Add os.posix_spawn documentation. (GH-6334)
https://github.com/python/cpython/commit/ab8457232121dfdfb1d4bfcf806a842fbe402722
msg314826 - (view) Author: Ned Deily (ned.deily) * (Python committer) Date: 2018-04-02 20:24
Thanks for adding the doc updates.  There are a few rendering issues:  https://docs.python.org/3.7/library/os.html#os.posix_spawn
msg315795 - (view) Author: Serhiy Storchaka (serhiy.storchaka) * (Python committer) Date: 2018-04-26 13:43
What about posix_spawnp()? The title mentions both functions, but I don't see any mentions about posix_spawnp() in the discussion. Its omission looks unintentional to me.
msg315995 - (view) Author: Serhiy Storchaka (serhiy.storchaka) * (Python committer) Date: 2018-05-01 13:45
New changeset ef347535f289baad22c0601e12a36b2dcd155c3a by Serhiy Storchaka in branch 'master':
bpo-20104: Improve error handling and fix a reference leak in os.posix_spawn(). (#6332)
https://github.com/python/cpython/commit/ef347535f289baad22c0601e12a36b2dcd155c3a
msg315997 - (view) Author: miss-islington (miss-islington) Date: 2018-05-01 14:18
New changeset 77fa7835da0cb49d30ac5d4c32bf6eb71eae0742 by Miss Islington (bot) in branch '3.7':
bpo-20104: Improve error handling and fix a reference leak in os.posix_spawn(). (GH-6332)
https://github.com/python/cpython/commit/77fa7835da0cb49d30ac5d4c32bf6eb71eae0742
msg316014 - (view) Author: Ned Deily (ned.deily) * (Python committer) Date: 2018-05-01 20:15
Thanks for the latest fixes, Serhiy.

Now is there anything more that needs to be done for this issue?
msg316046 - (view) Author: Serhiy Storchaka (serhiy.storchaka) * (Python committer) Date: 2018-05-02 06:19
The current implementation looks half-baked to me. It doesn't implement the following features:

1. posix_spawnp(). It is like posix_spawn(), but searches an executable in PATH.

2. Passing various attributes of the created child process. POSIX_SPAWN_SETSIGMASK, POSIX_SPAWN_SETSCHEDPARAM, etc.

And I have doubts about introducing constants like POSIX_SPAWN_OPEN:

1. There is no need to make them integers. They could be strings (for readability) or opaque values.

2. Their names can conflict with future standard constants related to posix_path().

Implementing new features in 3.8 can conflict with the current design. Removing posix_spawn() in 3.7 and deferring the work to 3.8 still looks a better solution to me.
msg316063 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2018-05-02 10:04
> The current implementation looks half-baked to me. (...)

I would prefer to see a full implementation before Python 3.7 final, or to remove the feature from Python 3.7 and redo it in Python 3.8.

It seems doable to finish the implementation before Python 3.7 final.
msg316068 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-05-02 10:55
I have open https://github.com/python/cpython/pull/6693 to start iterating over the missing capabilities of posix_spawn (Passing various attributes of the created child process).

Please, review to make sure that the design is OK and I will proceed to add tests.

Thanks!
msg316088 - (view) Author: Serhiy Storchaka (serhiy.storchaka) * (Python committer) Date: 2018-05-02 20:42
As for PR 6693: I think using keyword arguments would be more Pythonic.
msg316150 - (view) Author: Gregory P. Smith (gregory.p.smith) * (Python committer) Date: 2018-05-04 00:47
Since this is a new API, the most important thing is to have decided how we want it to look in 3.7, it is okay for it to be missing features so long as the API doesn't prevent them from being added in the future.

ie: It is fine to ship it without posix_spawnattr_t attrp "flags" support and without the "p" variant supported in 3.7.

Regarding PR 6693, I agree, keyword only arguments for the flags instead of a sequence of tuples is a good idea.  Since 3.7b4 has shipped I'm not sure we should even consider adding new _features_ to this new API in 3.7 to appear in rc1 - though being entirely new it wouldn't break anything, it just doesn't give us any change to revise them.  I'd be conservative at this point.
msg316235 - (view) Author: Martin Panter (martin.panter) * (Python committer) Date: 2018-05-06 14:02
To wrap “posix_spawnattr_setschedparam” perhaps you could combine it with the scheduler policy:

# Inherit current policy and parameters:
posix_spawn(..., scheduler=None)

# Set new policy with parameters:
posix_spawn(..., scheduler=(policy, param))

# Inherit current policy but set new parameters:
posix_spawn(..., scheduler=(None, param))
msg316238 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-05-06 15:14
One open question is how to construct and pass through the struct "sched_param" that “posix_spawnattr_setschedparam” needs.
msg316246 - (view) Author: Martin Panter (martin.panter) * (Python committer) Date: 2018-05-07 00:15
Can you use the existing sched_param class?
https://docs.python.org/3/library/os.html#os.sched_param
msg316247 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-05-07 00:27
Thanks! I have updated the PR and added tests.
msg316268 - (view) Author: Serhiy Storchaka (serhiy.storchaka) * (Python committer) Date: 2018-05-07 12:56
I have doubts about combining setschedpolicy and setschedparam.

On one hand, they are set by independent functions in C and have separate flags. When pass arguments separately we don't need to invent a parameter name: just remove the common "posix_spawnattr_" prefix from corresponding C function names. Passing scheduler=(None, param) looks clumsy. And from implementation side, parsing separate arguments is easier that parsing a single argument as a 2-tuple.

On other hand, these arguments are not completely independent: the setschedparam argument is required if setschedpolicy is specified.

Passing as separate arguments still looks more preferable to me.
msg316282 - (view) Author: Serhiy Storchaka (serhiy.storchaka) * (Python committer) Date: 2018-05-08 09:53
I propose the following changes to the file_actions parameter in PR 6725. Currently it is the fourth positional-only parameter with the default value None.

1. Change its default value to an empty tuple. None will no longer be accepted as a special case. This will simplify a mental model.

2. Since the order of parameters already doesn't math the natural order of the corresponding C function, and many other keyword-only parameters will be added in PR 6693, it makes sense to make file_actions a keyword-only parameter too.
msg316489 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2018-05-14 03:30
For Python 3.7, I discussed with Pablo and he seems to be ok to remove the function from Python 3.7, to get more time to polish the API in the master branch (future Python 3.8).

Can someone please write a PR to remove the function from the 3.7 branch?

I dislike the idea of applying a significant change on 3.7 to change the API between a beta and a rc release. IMHO it's too late, and we might find some API bugs. I would prefer to get more feedback on the API before setting it in stone (once a 3.x.0 version is released, it's really hard to change the API).
msg316495 - (view) Author: Martin Panter (martin.panter) * (Python committer) Date: 2018-05-14 08:15
I suggested the “scheduler” tuple to bring the two related parameters (scheduling policy and sched_param) together, similar to how they are paired as the second and third parameters to “os.sched_setscheduler”, and because I thought it would imply that a scheduling policy without sched_param is not valid.

But separate keyword parameters would also work if you prefer. I might call them “schedpolicy” and “schedparam”, but if you prefer the naming scheme in the current proposal, you could call them “setscheduler” and “setschedparam”. Although in that case, setting the “setscheduler” parameter on its own would not be sufficient to correctly set the POSIX_SPAWN_SETSCHEDULER flag.
msg316503 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-05-14 13:16
I think that is the biggest argument towards using a tuple: that just setting the priority is not enough (and also is decontextualized as different policies have different priorities). On the other hand one could say that the API is a low level API and therefore the user should be aware of such a problem. Also, using a tuple makes the API a bit asymmetric as Serhiy mention but I think this can compensate for the conceptual map the will be established (as in each priority is associated with a policy) if the tuple API is used.
msg316522 - (view) Author: Serhiy Storchaka (serhiy.storchaka) * (Python committer) Date: 2018-05-14 15:44
As for the scheduler interface, yet one option is using two mutually exclusive parameters setschedparam and setscheduler. The first take a sched_param, the second takes a pair: int and sched_param. This will not simplify the implementation, but they directly correspond to functions os.sched_setparam() and os.sched_setscheduler(). I don't say that this interface is better than alternatives, I just mention yet one option.

posix_spawn() can be easily implemented via fork()/exec(). See the reference implementation: http://pubs.opengroup.org/onlinepubs/009695399/xrat/xsh_chap03.html . It can be directly translated to Python. Seems the only value of posix_spawn() is on systems that don't provide fork()/exec(), but provide posix_spawn(). Is posix_spawn() supported on Windows? What are other systems supported by Python that don't provide fork()/exec()?

I'm for removing os.posix_spawn() in 3.7. Even if we will accept the current interface and merge PR 6693 we can find other problems with the interface or the implementation. And os.posix_spawnp() still is not implemented. It is just too dangerous to add such complex feature between the last beta and RC.
msg316525 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-05-14 15:54
Notice that https://github.com/python/cpython/pull/6794 is already open to remove posix_spawn from 3.7.
msg316551 - (view) Author: Gregory P. Smith (gregory.p.smith) * (Python committer) Date: 2018-05-14 18:56
Pablo, Victor, Ned and I synced up at the pycon2018 sprints.  We're removing posix_spawn from 3.7 (https://github.com/python/cpython/pull/6794) while the API is worked on.
msg316588 - (view) Author: Gregory P. Smith (gregory.p.smith) * (Python committer) Date: 2018-05-14 21:52
New changeset 8e633a4035bcff458c45fa095f4b8eab2f158466 by Gregory P. Smith (Pablo Galindo) in branch '3.7':
bpo-20104: Remove posix_spawn from 3.7 (GH-6794)
https://github.com/python/cpython/commit/8e633a4035bcff458c45fa095f4b8eab2f158466
msg316592 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-05-14 22:09
Some interesting benchmarks of posix spawn:

https://github.com/rtomayko/posix-spawn/blob/master/README.md
msg316595 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-05-14 22:17
Also this reading may be relevant/interesting:

https://about.gitlab.com/2018/01/23/how-a-fix-in-go-19-sped-up-our-gitaly-service-by-30x/
msg317071 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2018-05-18 23:52
posix_spawn can still be found in Python 3.7:

configure:11243: posix_fallocate posix_fadvise posix_spawn pread preadv preadv2 \
configure.ac:3470: posix_fallocate posix_fadvise posix_spawn pread preadv preadv2 \
pyconfig.h.in:710:/* Define to 1 if you have the `posix_spawn' function. */

This check has been added by the commit 6c6ddf97c402709713d668d0ed53836a7749ba99 and should be removed from Python 3.7 as well, no? It's not a release blocker, but it may be good to remove posix_spawn from configure.ac (and related generated files).
msg317076 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-05-19 00:14
I originally removed it from the configure script in PR6794 but it was reintroduced in commit 57009526f6a405e0ffe8c16012cce509b62cb577. Check the PR for Greg's rationale.
msg317084 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2018-05-19 00:50
> I originally removed it from the configure script in PR6794 but it was reintroduced in commit 57009526f6a405e0ffe8c16012cce509b62cb577. Check the PR for Greg's rationale.

Oh ok.
msg317510 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2018-05-24 02:19
Tests fail on PPC64 Fedora 3.x: bpo-33630.
msg323639 - (view) Author: Serhiy Storchaka (serhiy.storchaka) * (Python committer) Date: 2018-08-17 09:11
Benjamin, Gregory, could you please look at PR 6693? Is it what you want?
msg324765 - (view) Author: Pablo Galindo Salgado (pablogsal) * (Python committer) Date: 2018-09-07 15:44
New changeset 254a4663d8c5970ae2928185c50ebaa6c7e62c80 by Pablo Galindo in branch 'master':
bpo-20104: Add flag capabilities to posix_spawn (GH-6693)
https://github.com/python/cpython/commit/254a4663d8c5970ae2928185c50ebaa6c7e62c80
msg324832 - (view) Author: Serhiy Storchaka (serhiy.storchaka) * (Python committer) Date: 2018-09-08 11:48
New changeset d700f97b627989d41cd4629dc02969f9a6b56d2f by Serhiy Storchaka in branch 'master':
bpo-20104: Change the file_actions parameter of os.posix_spawn(). (GH-6725)
https://github.com/python/cpython/commit/d700f97b627989d41cd4629dc02969f9a6b56d2f
msg324833 - (view) Author: Serhiy Storchaka (serhiy.storchaka) * (Python committer) Date: 2018-09-08 11:51
Thank you for your contribution Pablo. This issue have appeared much more complex and less obvious than it looked initially.
msg324881 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2018-09-09 11:30
> Thank you for your contribution Pablo. This issue have appeared much more
complex and less obvious than it looked initially.

Yeah, thanks Pablo for your tenacy! This function is interesting in term of
performance and correctness ("atomic" function, signal safe?).
msg324883 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2018-09-09 11:40
I close the issue, it's now done.

If someone wants to experiment in posix_spawn() in subprocess, please open a new issue.
msg333129 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2019-01-07 00:47
Serhiy:
> And os.posix_spawnp() still is not implemented.

I created bpo-35674 to see how to expose this feature.
History
Date User Action Args
2022-04-11 14:57:56adminsetgithub: 64303
2019-01-07 00:47:02vstinnersetmessages: + msg333129
2018-09-09 11:40:06vstinnersetstatus: open -> closed
resolution: fixed
messages: + msg324883

stage: patch review -> resolved
2018-09-09 11:30:35vstinnersetmessages: + msg324881
2018-09-08 11:51:33serhiy.storchakasetmessages: + msg324833
2018-09-08 11:48:22serhiy.storchakasetmessages: + msg324832
2018-09-07 15:44:27pablogsalsetmessages: + msg324765
2018-08-17 09:11:55serhiy.storchakasetmessages: + msg323639
2018-05-24 02:19:59vstinnersetmessages: + msg317510
2018-05-19 00:50:40vstinnersetmessages: + msg317084
2018-05-19 00:14:41pablogsalsetmessages: + msg317076
2018-05-18 23:52:31vstinnersetmessages: + msg317071
2018-05-14 22:40:23ned.deilysetpriority: release blocker ->
versions: - Python 3.7
2018-05-14 22:17:14pablogsalsetmessages: + msg316595
2018-05-14 22:09:59pablogsalsetpriority: normal -> release blocker

messages: + msg316592
components: - Library (Lib)
versions: + Python 3.7
2018-05-14 21:53:37gregory.p.smithsetcomponents: + Library (Lib)
2018-05-14 21:53:14gregory.p.smithsetpriority: release blocker -> normal
versions: - Python 3.7
2018-05-14 21:52:45gregory.p.smithsetmessages: + msg316588
2018-05-14 18:56:36gregory.p.smithsetmessages: + msg316551
2018-05-14 16:17:16pablogsalsetmessages: - msg316526
2018-05-14 15:58:40pablogsalsetmessages: + msg316526
2018-05-14 15:54:29pablogsalsetmessages: + msg316525
2018-05-14 15:44:58serhiy.storchakasetmessages: + msg316522
2018-05-14 14:08:10pablogsalsetpull_requests: + pull_request6479
2018-05-14 13:16:47pablogsalsetmessages: + msg316503
2018-05-14 08:15:45martin.pantersetmessages: + msg316495
2018-05-14 03:30:35vstinnersetpriority: deferred blocker -> release blocker

messages: + msg316489
2018-05-08 09:53:52serhiy.storchakasetmessages: + msg316282
2018-05-08 09:47:08serhiy.storchakasetpull_requests: + pull_request6418
2018-05-07 12:58:18serhiy.storchakasetdependencies: + Expose the sigset_t converter via private API
2018-05-07 12:56:52serhiy.storchakasetmessages: + msg316268
2018-05-07 00:27:52pablogsalsetmessages: + msg316247
2018-05-07 00:15:36martin.pantersetmessages: + msg316246
2018-05-06 15:14:51pablogsalsetmessages: + msg316238
2018-05-06 14:02:23martin.pantersetmessages: + msg316235
2018-05-04 00:47:15gregory.p.smithsetmessages: + msg316150
2018-05-02 20:42:26serhiy.storchakasetmessages: + msg316088
2018-05-02 10:55:45pablogsalsetmessages: + msg316068
2018-05-02 10:54:33pablogsalsetpull_requests: + pull_request6386
2018-05-02 10:04:42vstinnersetmessages: + msg316063
2018-05-02 06:19:38serhiy.storchakasetmessages: + msg316046
2018-05-01 20:15:52ned.deilysetpriority: release blocker -> deferred blocker

messages: + msg316014
versions: + Python 3.8
2018-05-01 20:08:48ned.deilylinkissue33191 superseder
2018-05-01 14:18:47miss-islingtonsetmessages: + msg315997
2018-05-01 13:45:22miss-islingtonsetpull_requests: + pull_request6368
2018-05-01 13:45:07serhiy.storchakasetmessages: + msg315995
2018-04-26 13:43:57serhiy.storchakasetmessages: + msg315795
2018-04-25 15:03:53pablogsallinkissue33357 superseder
2018-04-02 20:24:53ned.deilysetmessages: + msg314826
2018-04-01 19:13:40miss-islingtonsetnosy: + miss-islington
messages: + msg314784
2018-04-01 19:11:10gregory.p.smithsetmessages: + msg314783
2018-04-01 19:02:11miss-islingtonsetpull_requests: + pull_request6047
2018-04-01 19:01:50gregory.p.smithsetmessages: + msg314781
2018-04-01 18:55:46gregory.p.smithsetpull_requests: + pull_request6046
2018-04-01 16:28:02serhiy.storchakasetmessages: + msg314779
2018-04-01 16:23:37serhiy.storchakasetstage: commit review -> patch review
pull_requests: + pull_request6045
2018-04-01 15:06:41pablogsalsetmessages: + msg314777
2018-04-01 15:05:10pablogsalsetmessages: + msg314775
2018-04-01 13:58:03ned.deilysetmessages: + msg314774
2018-04-01 11:27:38serhiy.storchakasetmessages: + msg314770
2018-03-31 08:21:43serhiy.storchakasetstatus: closed -> open

nosy: + serhiy.storchaka
messages: + msg314732

dependencies: + Refleak in posix_spawn
resolution: fixed -> (no value)
2018-01-30 07:38:33gregory.p.smithsetassignee: gregory.p.smith
2018-01-30 07:38:25gregory.p.smithsetstatus: open -> closed
resolution: fixed
stage: patch review -> commit review
2018-01-29 20:34:47gregory.p.smithsetmessages: + msg311166
2018-01-29 20:27:57pablogsalsetmessages: + msg311165
2018-01-29 20:25:50yselivanovsetnosy: - yselivanov
2018-01-29 20:25:33yselivanovsetnosy: + yselivanov
messages: + msg311164
2018-01-29 16:43:56ned.deilysetpriority: normal -> release blocker
nosy: + ned.deily
2018-01-29 15:39:36vstinnersetnosy: + vstinner
messages: + msg311144
2018-01-29 12:03:01pablogsalsetnosy: - vstinner
messages: + msg311102
2018-01-29 11:48:17vstinnersetnosy: + vstinner
messages: + msg311100
2018-01-29 11:23:39martin.pantersetmessages: + msg311094
2018-01-29 10:56:21pablogsalsetmessages: + msg311089
2018-01-29 10:34:43martin.pantersetmessages: + msg311087
2018-01-29 10:03:27martin.pantersetmessages: + msg311086
2018-01-29 08:37:46pablogsalsetmessages: + msg311081
2018-01-29 08:36:46pablogsalsetkeywords: + patch
pull_requests: + pull_request5253
2018-01-29 03:54:28martin.pantersetnosy: + pablogsal, martin.panter
messages: + msg311051
2018-01-29 01:58:42gregory.p.smithsetpriority: low -> normal

messages: + msg311041
2018-01-29 01:56:13gregory.p.smithsetmessages: + msg311040
2018-01-08 00:25:28gregory.p.smithsetstatus: closed -> open
type: performance -> enhancement
resolution: rejected -> (no value)
stage: needs patch -> patch review
2018-01-06 01:17:42pablogsalsetpull_requests: + pull_request4977
2018-01-06 01:17:41pablogsalsetpull_requests: + pull_request4976
2017-03-18 02:34:28gregory.p.smithsetmessages: + msg289791
2017-03-17 18:49:37John Jonessetmessages: + msg289775
2017-03-17 16:33:15gregory.p.smithsetmessages: + msg289769
versions: + Python 3.7, - Python 3.5
2017-03-17 12:08:14John Jonessetnosy: + John Jones
messages: + msg289758
2016-07-01 19:02:47dhduvallsetmessages: + msg269684
2014-07-08 17:38:08dhduvallsetmessages: + msg222572
2014-07-08 17:26:20alexsetmessages: + msg222571
2014-07-08 17:23:56dhduvallsetnosy: + dhduvall
messages: + msg222570
2014-03-13 02:25:13benjamin.petersonsetstatus: open -> closed
resolution: rejected
messages: + msg213344
2014-01-01 21:17:39gregory.p.smithsetpriority: normal -> low

messages: + msg207139
2014-01-01 20:03:47pitrousetnosy: + gregory.p.smith, neologix
2014-01-01 18:36:01alexsetnosy: + alex
2014-01-01 18:22:17gennadsetnosy: + gennad
2014-01-01 18:13:20benjamin.petersoncreate