Rework CALL_FUNCTION* opcodes #71400

serhiy-storchaka · 2016-06-04T07:45:34Z

BPO	27213
Nosy	@vstinner, @rbtcollins, @markshannon, @berkerpeksag, @serhiy-storchaka, @1st1, @serprex
Dependencies	bpo-27140: Opcode for creating dict with constant keys
Files	callfunc.patch callfunc2.patch callfunc3.patch callfunc4.patch callfunc5.patch callfunc-6.patch callfunc-7.patch callfunc-8.patch callfunc-9.patch callfunc-10.patch callfunc-11.patch callfunc-12.patch

^{Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.}

Show more details

GitHub fields:

assignee = 'https://github.com/serhiy-storchaka'
closed_at = <Date 2016-09-22.16:54:56.652>
created_at = <Date 2016-06-04.07:45:33.985>
labels = ['interpreter-core', 'type-feature']
title = 'Rework CALL_FUNCTION* opcodes'
updated_at = <Date 2016-09-22.21:13:20.263>
user = 'https://github.com/serhiy-storchaka'

bugs.python.org fields:

activity = <Date 2016-09-22.21:13:20.263>
actor = 'vstinner'
assignee = 'serhiy.storchaka'
closed = True
closed_date = <Date 2016-09-22.16:54:56.652>
closer = 'serhiy.storchaka'
components = ['Interpreter Core']
creation = <Date 2016-06-04.07:45:33.985>
creator = 'serhiy.storchaka'
dependencies = ['27140']
files = ['43474', '43493', '43602', '43669', '43741', '44403', '44485', '44487', '44508', '44510', '44546', '44551']
hgrepos = ['354']
issue_num = 27213
keywords = ['patch']
message_count = 55.0
messages = ['267241', '267341', '268285', '268652', '268720', '268860', '268862', '268907', '268948', '268998', '269623', '269669', '269696', '270064', '270521', '273372', '273684', '273685', '274641', '275259', '275261', '275264', '275295', '275348', '275351', '275352', '275372', '275387', '275388', '275402', '275405', '275409', '275411', '275412', '275448', '275452', '275748', '275749', '275752', '275766', '275770', '275784', '275861', '275885', '276004', '276006', '276008', '276021', '276022', '276063', '276064', '276073', '276102', '277233', '277239']
nosy_count = 8.0
nosy_names = ['vstinner', 'rbcollins', 'Mark.Shannon', 'python-dev', 'berker.peksag', 'serhiy.storchaka', 'yselivanov', 'Demur Rumed']
pr_nums = []
priority = 'normal'
resolution = 'fixed'
stage = 'resolved'
status = 'closed'
superseder = None
type = 'enhancement'
url = 'https://bugs.python.org/issue27213'
versions = ['Python 3.6']

serhiy-storchaka · 2016-06-04T07:45:33Z

Currently there are 4 opcodes (CALL_FUNCTION, CALL_FUNCTION_VAR, CALL_FUNCTION_KW, CALL_FUNCTION_VAR_KW) for calling a function depending of presenting the var-positional and var-keyword arguments:

    func(arg1, ..., argN, name1=kwarg1, ..., nameM=kwargM)
    func(arg1, ..., argN, *args, name1=kwarg1, ..., nameM=kwargM)
    func(arg1, ..., argN, name1=kwarg1, ..., nameM=kwargM, **kwargs)
    func(arg1, ..., argN, *args, name1=kwarg1, ..., nameM=kwargM, **kwargs)

The number of positional and keyword arguments are packed in oparg, and both numbers are limited by 255. Thus the single keyword argument makes oparg not fitting in 8 bit and requires EXTENDED_ARG. The stack contains first positional arguments, then optional args tuple, then keyword arguments, then optional kwargs dict. For every keyword argument the two values are pushed on the stack: argument name (always constant string) and argument value.

I collected a statistic about opcodes in compiled code during running Python tests [1] (maybe it is biased, but this is large and multifarious assembly, and I hope it takes some representation about average Python code). According to it about 90% of compiled function calls are calls with the fixed number of positional arguments (CALL_FUNCTION with oparg < 256), the rest 10% are calls with the fixed number of positional and keyword arguments (CALL_FUNCTION with oparg >= 256), and only about 0.5% are calls with the var-positional or var-keyword arguments.

I propose to use the different sets of opcodes that corresponds to these cases:

    func(arg1, ..., argN)
    func(arg1, ..., argN, name1=kwarg1, ..., nameM=kwargM)
    func(*args, **kwargs)

CALL_FUNCTION takes the fixed number of positional arguments. oparg is the number of arguments. The stack contains positional arguments.
CALL_FUNCTION_KW takes the fixed number of positional and keyword arguments. oparg is the number of all arguments. The stack contains values of arguments (first positional, then keyword), then a tuple of keyword names (as in proposed new opcode BUILD_CONST_KEY_MAP in bpo-27140).
CALL_FUNCTION_EX takes the variable number of positional and keyword arguments. oparg is 0. The stack contains a tuple of positional arguments and a dict of keyword arguments (they are build in the bytecode with using BUILD_TUPLE, BUILD_CONST_KEY_MAP, BUILD_LIST_UNPACK and BUILD_MAP_UNPACK_WITH_CALL). This is the most general variant, others exist just for the optimization of common cases.

Benefits:

Calling a function with keyword arguments uses less stack and less LOAD_CONST instructions.
Calling a function with keyword arguments no longer needs EXTENDED_ARG.
The number of positional and keyword arguments is no longer limited by 255 (at least not in the bytecode).
The bytecode looks simpler, oparg always is just the number of arguments taken from the stack.

This proposition was discussed on Python-Ideas [2].

[1] http://permalink.gmane.org/gmane.comp.python.ideas/39993
[2] http://comments.gmane.org/gmane.comp.python.ideas/39961

serprex · 2016-06-04T23:12:07Z

I'd like to take on creating a patch for this proposal once bpo-27140 lands

serhiy-storchaka · 2016-06-11T21:51:33Z

I was going to write a patch myself, but since I'm sure in your skills, I yield it to you.

serhiy-storchaka · 2016-06-16T09:37:32Z

Do you take this for your Demur?

serprex · 2016-06-17T11:35:34Z

I've been working on this, may have the ceval portion mostly worked out but can't test until I finish the compile portion. Haven't had time this week, will have time to focus this weekend

serprex · 2016-06-19T17:38:12Z

Attaching first iteration. Very drafty. Still need to fix test_dis; will run test suite this evening. Perf on pybench went from 16.5s to 17.5s. It was 18.3s prior to reintroducing use of fast_function. Code still needs clean up besides investigation into how to optimize

Changes not described in the original patch concept: I made CALL_FUNCTION_EX oparg carry 2 flag bits on whether there's a kwdict or a vararg tuple. Code may be simpler to only have kwdict be optional. BUILD_MAP_UNPACK_WITH_CALL was modified to only use the highest bit of the 2nd byte to track where function_pos is. Most uses of BUILD_MAP_UNPACK_WITH_CALL is f(*va,**kw) so I chose to only set the highest bit when there isn't a positional unpacking. If we pushed an empty tuple then that flag bit could be removed

markshannon · 2016-06-19T18:42:49Z

I seemed to have been added to the nosy list. I guess that means that my opinions are wanted ;)

I have wanted to clean up the code around making calls for quite a while. I just haven't had time.

I agree that adding a simpler opcode for making calls with positional only args is worthwhile.
However creating the tuple and/or dict at every call site is going to lead to slow downs. It gets even worse for bound-methods, as yet another tuple will need to be created to accommodate self.

What I would suggest is a new internal calling convention which embodies the six-part form of CALL_FUNCTION_VAR_KW, that is (number-of-postionals, pointer-to-positionals, number-of-keywords, pointer-to-keywords, starargs-tuple, doublestarargs-dict).
As a first step, each of the four current CALL_FUNCTION variants could then call this function, filling in the relevant parameters.

The new function would look something like this:
PyEval_Call_Callable(
PyObject *callable,
int npos,
PyObject **positionals,
int nkws,
PyObject **keywords,
PyObject *starargs,
PyObject *dictargs) {
    if (Is_PyFunction(callable))
        return PyFunction_Call(callable, npos, ...);

if (Is_CFunction(callable))
    return CFunction_Call(callable, npos, ...);

// Otherwise general object...

// We might want PyType_Call as well, since object creation is
// another common form of call.

// Create tuple and dict from args.
return PyObject_Call(callable, tuple, dict);

}

This would mean that the logic for frame creation would be where it belongs (in the code for the Function object) and that special casing the various forms of CFunctions could be moved to where that belongs.

Given what Serhiy says about calls with just positional parameters being by far the most common, it sounds as if it would make sense to add
PyFunction_CallSimple(PyObject *callable, int npos, PyObject**args)
and
CFuntion_CallSimple(PyObject *callable, int npos, PyObject**args)
variants for the new CALL_FUNCTION opcode to use.

With those changes, it would then make sense to replace the four opcodes
CALL_FUNCTION, CALL_FUNCTION_VAR, CALL_FUNCTION_KW, CALL_FUNCTION_VAR_KW

with
CALL_FUNCTION -- As Serhiy suggests
CALL_FUNCTION_KW -- As Serhiy suggests
CALL_FUNCTION_VAR_KW -- As currently exists

serhiy-storchaka · 2016-06-20T15:45:41Z

Thank you Demur. I left few comments on Rietveld (this is only the quick glance).

Good news that this change allows to simplify BUILD_MAP_UNPACK_WITH_CALL. But using 15th bit means using EXTENDED_ARG. I think it would be simpler to just push an empty tuple with LOAD_CONST. The same number of opcodes, but simpler implementation and nicer disassemble output.

Mark's idea about using oparg of the general variant for specifying a number of positional arguments without packing them in a tuple looks interesting. But I don't bother about this case. According to the statistics of running CPython tests *args or *kwargs are used only in 0.4% of calls. This is hardly worth optimizing. I'm going to collect more detailed statistics for number of args and function/method distinguishing.

As about PyFunction_CallSimple() etc, see bpo-26814 and bpo-27128. _PyObject_FastCall() protocol and new calling protocol should cover the case of new CALL_FUNCTION and may be CALL_FUNCTION_KW (>99% of calls in sum).

serprex · 2016-06-20T23:45:46Z

callfunc2 fixes test_dis, addresses code review, currently implements a copy of _PyEval_EvalCodeWithName as _PyEval_EvalCodeWithName2 which changes a few lines to work with new keyword stack layout so that we can use fast_function with kwargs

CALL_FUNCTION_EX is benchmarking much slower (4x slower when using **kw), but unpatched hits similar perf when using multiple **kw. So most slowdown is due to BUILD_MAP_UNPACK_WITH_CALL being slow. So some patch which speeds up intersection check (eg optimize to not allocate intersection when disjoint) should greatly diminish the perf loss on this simpler implementation. I'll open a separate issue for this

serhiy-storchaka · 2016-06-21T13:50:27Z

bpo-27358 is too complex, more complex than this issue. But I think the simple fix the regression in case of the single **kw is checking wherever the sum dict is empty.

```
           if (with_call) {
```

+ if (with_call && PyDict_Size(sum)) {

serhiy-storchaka · 2016-06-30T19:47:32Z

Added comments on Rietveld.

serprex · 2016-07-01T14:10:49Z

callfunc3 addresses most feedback. Doesn't address _PyEval_EvalCodeWithName2 code bloat, & I disagree with mentioning BUILD_MAP_UNPACK_WITH_CALL change in magic number update as the ABI of BUILD_MAP_UNPACK_WITH_CALL is unchanged. ie if we were to implement bpo-27358 after this there would be no need to bump the magic number

One thing which looks odd to me is the INCREF/DECREF calls on function objects surrounding calls. Especially odd is CALL_FUNCTION_EX where we finish with two calls to Py_DECREF(func). It shows up in IMPORT_NAME too

serhiy-storchaka · 2016-07-01T21:11:47Z

The ABI of BUILD_MAP_UNPACK_WITH_CALL is changed. oparg=514 meant merging 2 dicts, but with the patch it will mean merging 514 dicts.

The INCREF/DECREF calls on function objects surrounding calls are not needed, because the refcount was increased when the function pushed on the stack. They were needed if use PyMethod optimization, since PyMethod_GET_FUNCTION() returns borrowed link.

Seems you have missed my comments on compile.c.

Added more comments on Rietveld.

What about CALL_FUNCTION_EX benchmarking? Did the optimization help?

serprex · 2016-07-09T20:02:18Z

Pybench is now only ~200ms slower instead of 1200ms slower. But the whole point of this patch is that CALL_FUNCTION_EX shouldn't be optimized for, so I'd much prefer real benchmarking results

serprex · 2016-07-15T23:22:09Z

Since the most common use of CALL_FUNCTION_EX is..

def f(*x,*kw):
    other_func(*x, **kw)

I've added some code to BUILD_MAP_UNPACK_WITH_CALL & BUILD_TUPLE_UNPACK to not allocate a new object if called with oparg of 1 & TOP() is correct type

vstinner · 2016-08-22T13:05:27Z

See also the issue bpo-27809 "_PyObject_FastCall(): add support for keyword arguments" where we discuss if we can completely avoid the creation of a temporary dictionary to pass keyword arguments.

vstinner · 2016-08-25T22:03:17Z

Pybench is now only ~200ms slower instead of 1200ms slower. But the whole point of this patch is that CALL_FUNCTION_EX shouldn't be optimized for, so I'd much prefer real benchmarking results

FYI I released a first version of the "new" Python benchmark suite: performance 0.1 (quickly fixed with performance 0.1.1):

The README file contains a few information how to run a stable benchmark. Please help me to complete this documentation :-)

I may help you to run these benchmarks.

vstinner · 2016-08-25T22:27:13Z

I'm working on a new calling convention: "fast call".

I already pushed changes implementing first functions:

_PyObject_FastCallDict(PyObject *func, PyObject **stack, Py_ssize_t nargs, PyObject *kwargs)

kwargs is a Python dict, but it can be NULL.

_PyObject_FastCallDict() avoids the creation a temporary tuple when you don't have to pass keyword arguments.

But I'm interested by an evolution to avoid also the creation of a temporary dictionary: issue bpo-27830, "Add _PyObject_FastCallKeywords(): avoid the creation of a temporary dictionary for keyword arguments". This function would be the fundation for a new calling convention for C functions: bpo-27810, "Add METH_FASTCALL: new calling convention for C functions".

I didn't read your patch carefully, but I would like to make sure that your patch would benefit of these APIs to avoid the creation of temporary tuple and/or dict.

"func(*args, key=value)" uses CALL_FUNCTION_EX and so requires to create a temporary dictionary. Maybe it's ok, I don't think that this syntax is commonly used.

"func(*args)" uses CALL_FUNCTION_EX: if args is a tuple, can you confirm that it would be possible to use _PyObject_FastCallDict() and/or _PyObject_FastCallKeywords() to avoid the creation a new temporary tuple?

Hum, I don't think that "func(*args)" can be optimized if args is a list because a list is mutable, we cannot use a direct access to the internal C array of a list.

vstinner · 2016-09-06T22:16:13Z

I uploaded callfunc-6.patch which is supposed to be a rebased version of callfunc5.patch. I made tiny coding style changes.

Python/ceval.c changed a lot since 1 month 1/2, so I'm not 100% sure that the rebase is correct.

Known issues:

test_traceback and test_extcall fail
_PyEval_EvalCodeWithName2() should be merged into _PyEval_EvalCodeWithName() to factorize the code

I didn't review the change yet.

vstinner · 2016-09-09T05:07:53Z

Rebased change.

1st1 · 2016-09-09T05:14:30Z

Serhiy, Victor and I want this to be merged tomorrow morning PST to land in 3.6 before the freeze. Would you be able to merge this yourself? If not, we'll do it ourselves in the morning.

vstinner · 2016-09-09T05:47:38Z

Known issues:

test_traceback and test_extcall fail

_PyEval_EvalCodeWithName2() should be merged into _PyEval_EvalCodeWithName() to factorize the code

I fixed test_extcall and removed _PyEval_EvalCodeWithName2(): callfunc-8.patch.

I don't know how to fix test_traceback.

@serhiy: I really need the patch tomorrow. Would you mind to push it? If you are unable to fix test_traceback, just skip the test with a FIXME and open a new issue (it should be easy to fix it later).

I really want to finish METH_FASTCALL, issue bpo-27810, before the first beta next monday and basically tomorrow is for me the last time when I can still push things.

If we really find a major design issue, you still have 3 to 4 months to revert the change. If we find a minor design flaw or another optimization, again, we will still have time to enhance the code during the beta stage. The most important part is to push the major change to reduce changes after the first beta.

serhiy-storchaka · 2016-09-09T09:39:41Z

I'm going to push the patch today. But changes to test_extcall look as a regression to me.

vstinner · 2016-09-09T16:56:23Z

I'm going to push the patch today.

I'm going to push the patch in ~10 min since I don't see it pushed yet.

But changes to test_extcall look as a regression to me.

I agree that it looks like a regression, but it might be tricky to
retrieve the function name from the stack. I'm not even sure that the
function name is always already on the stack.

Again, I have to rush before of the feature freeze, but I hope that we
will find a smart solution for this error message.

serhiy-storchaka · 2016-09-09T17:11:58Z

Please wait. I just have sat down at my computer.

vstinner · 2016-09-09T17:14:00Z

Please wait. I just have sat down at my computer.

Oh, you are here :-) Go ahead!

vstinner · 2016-09-09T18:23:52Z

Please wait. I just have sat down at my computer.

Can you please give me a status and/or an ETA? This change is blocking me for 2 more changes.

serhiy-storchaka · 2016-09-09T18:50:22Z

I just fixed test_extcall and test_traceback, and going to get rid of the
duplication in _PyEval_EvalCodeWithName. I need maybe a hour or two.

vstinner · 2016-09-09T18:52:34Z

I just fixed test_extcall and test_traceback, and going to get rid of the duplication in _PyEval_EvalCodeWithName. I need maybe a hour or two.

What do you mean? I already fixed test_extcall and code duplication?

"I fixed test_extcall and removed _PyEval_EvalCodeWithName2(): callfunc-8.patch."

I proposed to skip the failing test in test_traceback and fix it after the beta release, I don't think that it's a major issue, it can be fixed later.

Can you please push callfunc-8.patch right now (skipping failing tests in test_traceback).

python-dev · 2016-09-09T19:28:41Z

New changeset a77756e480c2 by Victor Stinner in branch 'default':
Rework CALL_FUNCTION* opcodes
https://hg.python.org/cpython/rev/a77756e480c2

vstinner · 2016-09-09T19:33:39Z

I'm really sorry Serhiy that I didn't give you more time to finish the review and push yourself the change :-( Feel free to enhance the code, I don't expect major changes.

For test_traceback, I opened the issue bpo-28050: if you already know how to fix it, please go ahead ;-)

serhiy-storchaka · 2016-09-09T19:48:21Z

You changed the test. I made the code passing unchanged test (in additional
the bytecode for f(*a) and f(**kw) becomes simpler). _PyEval_EvalCodeWithName
still contained large duplicated part. It can be removed.

Here is my current patch.

python-dev · 2016-09-09T19:50:30Z

New changeset 854b08acca97 by Victor Stinner in branch 'default':
Issue bpo-27213: document changes in Misc/NEWS
https://hg.python.org/cpython/rev/854b08acca97

vstinner · 2016-09-09T19:51:36Z

I just pushed my patch for the issue bpo-27830 (add _PyObject_FastCallKeywords()). I'm trying to even push the first parts of METH_FASTCALL (issue bpo-27810).

Could you please try to rebase callfunc-9.patch on top of the development branch? The code is moving soooo fast this week :-/

serhiy-storchaka · 2016-09-09T21:23:33Z

Here is rebased patch.

vstinner · 2016-09-09T21:36:49Z

Here is rebased patch.

callfunc-10.patch looks like an even better enhancement compared to Python 3.5, nice work. Would you be ok to wait after the beta1 release? The change doesn't seem to change the bytecode too much, nor the C API, whereas I'm trying to push a few other changes for FASTCALL.

Moreover, I didn't understand fully the change, I left some questions.

berkerpeksag · 2016-09-11T08:55:24Z

I'm getting segfaults with a77756e480c2 (thanks to 'hg bisect') on Ubuntu 12.04. I've checked buildbots, but they seem happy.

In test_sys:

test_recursionlimit_recovery (test.test_sys.SysModuleTest) ... Fatal Python error: Segmentation fault

In test_runpy:

test_main_recursion_error (test.test_runpy.RunPathTestCase) ... Fatal Python error: Segmentation fault

I'm also getting the following error with Serhiy's latest patch:

python: Objects/abstract.c:2385: _PyStack_AsDict: Assertion `PyDict_GetItem(kwdict, key) != ((void *)0)' failed.
Aborted (core dumped)
generate-posix-vars failed
make: *** [pybuilddir.txt] Error 1

vstinner · 2016-09-11T09:01:17Z

Author: Berker Peksag (berker.peksag) * (Python committer) Date: 2016-09-11 08:55

Berker: "I'm getting segfaults with a77756e480c2 (thanks to 'hg bisect') on Ubuntu 12.04 (...)
test_recursionlimit_recovery (...) test_main_recursion_error (...)"

We should check if the change counts correctly the number of frames.

In callfunc-10.patch, Serhiy modified a counter related to recursion limit/number of frames to fix test_traceback (issue bpo-28050). He wrote on the review:

"Seems the number 50 is arbitrary. Now we should increase it. Why? This issue needs further investigation. Maybe this is a sign of regression."

To me, it's suspicious that we have to modify an unit test. Changing the opcode or internals must not have an impact on unit tests on recursion error.

Note: It might also be related to my FASTCALL changes, but you wrote that "hg bisect" pointed you to this change.

Berker: "I'm also getting the following error with Serhiy's latest patch: python: Objects/abstract.c:2385: _PyStack_AsDict: Assertion `PyDict_GetItem(kwdict, key) != ((void *)0)' failed."

This one might be related to the compact dict change.

Note: Just to make be sure that it is not a local issue, please try a "make distclean" and make sure that all .pyc files are removed, and then recompile Python.

berkerpeksag · 2016-09-11T09:10:59Z

[...] please try a "make distclean" and make sure that all .pyc files are removed, and then recompile Python.

I forgot to mention that in my earlier comment. I usually run "make distclean" before "./configure --with-pydebug; make -s -j" so unfortunately running "make distclean" didn't help.

serhiy-storchaka · 2016-09-11T10:07:45Z

python: Objects/abstract.c:2385: _PyStack_AsDict: Assertion
`PyDict_GetItem(kwdict, key) != ((void *)0)' failed. Aborted (core dumped)
generate-posix-vars failed
make: *** [pybuilddir.txt] Error 1

Ah, it's my fault. I didn't test the latest patch with debug build. Here should be == instead of !=. Could you please test updated patch? Unfortunately I can's reproduce crashes in test_sys and test_runpy (on 32-bit Linux).

berkerpeksag · 2016-09-11T10:35:56Z

Thanks, Serhiy.

test_recursionlimit_recovery in test_sys passed, but now I'm getting the following failure:

FAIL: test_recursionlimit_fatalerror (test.test_sys.SysModuleTest)

I modified test_recursionlimit_fatalerror to use subTest() and it looks like the test passed when sys.setrecursionlimit(50). See i=1000 in the following output:

======================================================================
FAIL: test_recursionlimit_fatalerror (test.test_sys.SysModuleTest) (i=1000)
----------------------------------------------------------------------

Traceback (most recent call last):
  File "/home/berker/hacking/cpython/default/Lib/test/test_sys.py", line 283, in test_recursionlimit_fatalerror
    err)
AssertionError: b'Fatal Python error: Cannot recover from stack overflow' not found in b''

test_runpy still segfaults:

test_main_recursion_error (test.test_runpy.RunPathTestCase) ... Fatal Python error: Segmentation fault

I wrote a quick and dirty reproducer for this. I can attach here if you think it would be useful for debugging.

serhiy-storchaka · 2016-09-11T12:46:38Z

Please attach a reproducer Berker. I still can't reproduce crashes. Maybe it depends on platform or installed third-party module or Python command line options.

python-dev · 2016-09-11T21:53:26Z

New changeset 51b635e81958 by Serhiy Storchaka in branch 'default':
Issue bpo-27213: Fixed different issues with reworked CALL_FUNCTION* opcodes.
https://hg.python.org/cpython/rev/51b635e81958

rbtcollins · 2016-09-11T23:27:27Z

See http://bugs.python.org/issue28086 - this introduced a regression in the test suite.

serhiy-storchaka · 2016-09-12T08:52:06Z

I have committed the patch before beta 1 because it change bytecode generating and interpreting. The bytecode generated before the patch is compatible with the patched interpreter, but the bytecode generated by patched compiler can be not compatible with unpatched interpreter. After releasing beta 1 this would require changing bytecode magic number.

vstinner · 2016-09-12T09:04:10Z

The change 51b635e81958 introduced memory leaks. Example:

$ ./python -m test -R 3:3 test_extcall
(...)
test_extcall leaked [102, 102, 102] references, sum=306
test_extcall leaked [41, 41, 41] memory blocks, sum=123

I'm analyzing the change to try to identify leaks.

python-dev · 2016-09-12T09:17:34Z

New changeset f217419d08f0 by Victor Stinner in branch 'default':
Issue bpo-27213: Fix reference leaks
https://hg.python.org/cpython/rev/f217419d08f0

python-dev · 2016-09-12T11:04:31Z

New changeset f860b7a775c5 by Victor Stinner in branch 'default':
ssue bpo-27213: Reintroduce checks in _PyStack_AsDict()
https://hg.python.org/cpython/rev/f860b7a775c5

python-dev · 2016-09-12T11:38:51Z

New changeset e372c0ad32ce by Victor Stinner in branch 'default':
Revert change f860b7a775c5
https://hg.python.org/cpython/rev/e372c0ad32ce

New changeset 2558bc4a4ebf by Victor Stinner in branch 'default':
Document kwnames in _PyObject_FastCallKeywords() and _PyStack_AsDict()
https://hg.python.org/cpython/rev/2558bc4a4ebf

1st1 · 2016-09-12T15:59:36Z

After releasing beta 1 this would require changing bytecode magic number.

Why don't we change the magic number before b1 (if your bytecode change is already committed)?

vstinner · 2016-09-12T16:14:13Z

Why don't we change the magic number before b1 (if your bytecode change is already committed)?

The magic number changed many times between Python 3.5 and the future Python 3.6 beta 1. Maybe we skipped increasing this number for one bytecode change, but it should only impact the few developers who worked on the development version. In this case: just remove all ".pyc" files and it should be fine ;-)

In short: I think it's fine, don't do anything else ;-)

serhiy-storchaka · 2016-09-12T17:19:26Z

Objects/methodobject.c: In function ‘_PyCFunction_FastCallKeywords’:
Objects/methodobject.c:281:24: warning: ‘nkwargs’ may be used uninitialized in this function [-Wmaybe-uninitialized]
assert((nargs == 0 && nkwargs == 0) || stack != NULL);
^

vstinner · 2016-09-12T19:55:04Z

Thanks for the report, warning fixed in the issue bpo-28105.

serhiy-storchaka · 2016-09-22T16:54:57Z

I think that's all with this issue.

vstinner · 2016-09-22T21:13:20Z

I think that's all with this issue.

Thanks Demur and Serhiy for your great work!

serhiy-storchaka added interpreter-core (Objects, Python, Grammar, and Parser dirs) type-feature A feature request or enhancement labels Jun 4, 2016

serhiy-storchaka self-assigned this Sep 9, 2016

serhiy-storchaka closed this as completed Sep 22, 2016

ezio-melotti transferred this issue from another repository Apr 10, 2022

Rework CALL_FUNCTION* opcodes #71400

Rework CALL_FUNCTION* opcodes #71400

Comments

serhiy-storchaka commented Jun 4, 2016

serhiy-storchaka commented Jun 4, 2016

serprex mannequin commented Jun 4, 2016

serhiy-storchaka commented Jun 11, 2016

serhiy-storchaka commented Jun 16, 2016

serprex mannequin commented Jun 17, 2016

serprex mannequin commented Jun 19, 2016

markshannon commented Jun 19, 2016

serhiy-storchaka commented Jun 20, 2016

serprex mannequin commented Jun 20, 2016

serhiy-storchaka commented Jun 21, 2016

serhiy-storchaka commented Jun 30, 2016

serprex mannequin commented Jul 1, 2016

serhiy-storchaka commented Jul 1, 2016

serprex mannequin commented Jul 9, 2016

serprex mannequin commented Jul 15, 2016

vstinner commented Aug 22, 2016

vstinner commented Aug 25, 2016

vstinner commented Aug 25, 2016

vstinner commented Sep 6, 2016

vstinner commented Sep 9, 2016

1st1 commented Sep 9, 2016

vstinner commented Sep 9, 2016

serhiy-storchaka commented Sep 9, 2016

vstinner commented Sep 9, 2016

serhiy-storchaka commented Sep 9, 2016

vstinner commented Sep 9, 2016

vstinner commented Sep 9, 2016

serhiy-storchaka commented Sep 9, 2016

vstinner commented Sep 9, 2016

python-dev mannequin commented Sep 9, 2016

vstinner commented Sep 9, 2016

serhiy-storchaka commented Sep 9, 2016

python-dev mannequin commented Sep 9, 2016

vstinner commented Sep 9, 2016

serhiy-storchaka commented Sep 9, 2016

vstinner commented Sep 9, 2016

berkerpeksag commented Sep 11, 2016

vstinner commented Sep 11, 2016

berkerpeksag commented Sep 11, 2016

serhiy-storchaka commented Sep 11, 2016

berkerpeksag commented Sep 11, 2016

serhiy-storchaka commented Sep 11, 2016

python-dev mannequin commented Sep 11, 2016

rbtcollins commented Sep 11, 2016

serhiy-storchaka commented Sep 12, 2016

vstinner commented Sep 12, 2016

python-dev mannequin commented Sep 12, 2016

python-dev mannequin commented Sep 12, 2016

python-dev mannequin commented Sep 12, 2016

1st1 commented Sep 12, 2016

vstinner commented Sep 12, 2016

serhiy-storchaka commented Sep 12, 2016

vstinner commented Sep 12, 2016

serhiy-storchaka commented Sep 22, 2016

vstinner commented Sep 22, 2016