Clean up and speed up dict iteration #72370

serhiy-storchaka · 2016-09-16T22:10:08Z

BPO	28183
Nosy	@rhettinger, @vstinner, @ned-deily, @methane, @serhiy-storchaka, @zhangyangyu
PRs	[Do Not Merge] Convert `Misc/NEWS` so that it is managed by towncrier #552
Files	dict_iter.patch dict_iter2.patch dict_iter3.patch dict_iter4.patch: more PEP7 compliant code dict_iter5.patch dict_iter6.patch dict_iter7.patch dict_iter8.patch

^{Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.}

Show more details

GitHub fields:

assignee = 'https://github.com/serhiy-storchaka'
closed_at = <Date 2016-10-09.20:17:19.798>
created_at = <Date 2016-09-16.22:10:07.621>
labels = ['interpreter-core', '3.7', 'performance']
title = 'Clean up and speed up dict iteration'
updated_at = <Date 2017-03-31.16:36:09.744>
user = 'https://github.com/serhiy-storchaka'

bugs.python.org fields:

activity = <Date 2017-03-31.16:36:09.744>
actor = 'dstufft'
assignee = 'serhiy.storchaka'
closed = True
closed_date = <Date 2016-10-09.20:17:19.798>
closer = 'serhiy.storchaka'
components = ['Interpreter Core']
creation = <Date 2016-09-16.22:10:07.621>
creator = 'serhiy.storchaka'
dependencies = []
files = ['44697', '44719', '44857', '44864', '44874', '44876', '44879', '44921']
hgrepos = []
issue_num = 28183
keywords = ['patch']
message_count = 29.0
messages = ['276755', '276789', '276811', '276821', '276823', '276830', '276868', '276873', '276889', '276899', '276902', '276903', '276915', '276919', '276922', '277501', '277502', '277596', '277612', '277631', '277687', '277699', '277701', '277713', '277718', '277752', '277854', '278387', '278388']
nosy_count = 7.0
nosy_names = ['rhettinger', 'vstinner', 'ned.deily', 'methane', 'python-dev', 'serhiy.storchaka', 'xiang.zhang']
pr_nums = ['552']
priority = 'normal'
resolution = 'fixed'
stage = 'resolved'
status = 'closed'
superseder = None
type = 'performance'
url = 'https://bugs.python.org/issue28183'
versions = ['Python 3.6', 'Python 3.7']

The text was updated successfully, but these errors were encountered:

serhiy-storchaka · 2016-09-16T22:10:07Z

In some circumstances iterating dict under 3.6 can be 20% slower than under 3.5.

$ ./python -m perf timeit -s "d = dict.fromkeys(range(10**6))" -- "list(d)"

Python 3.5: Median +- std dev: 33.8 ms +- 0.7 ms
Python 3.6: Median +- std dev: 37.8 ms +- 0.5 ms

Seems this is compiler and platform specific, it is reproducible only with GCC on 32 bit.

Proposed patch restores 3.5 performance and simplifies the code.

Python 3.6 patched: Median +- std dev: 33.7 ms +- 0.7 ms

Other types of iteration:

$ ./python -m perf timeit -s "d = dict.fromkeys(range(10**6)); v = d.values()" -- "list(v)"
Python 3.5:            Median +- std dev: 26.2 ms +- 0.7 ms
Python 3.6 unpatched:  Median +- std dev: 28.0 ms +- 0.6 ms
Python 3.6 patched:    Median +- std dev: 26.3 ms +- 1.1 ms

$ ./python -m perf timeit -s "d = dict.fromkeys(range(10**6)); v = d.items()" -- "list(v)"
Python 3.5:            Median +- std dev: 232 ms +- 6 ms
Python 3.6 unpatched:  Median +- std dev: 259 ms +- 6 ms
Python 3.6 patched:    Median +- std dev: 243 ms +- 9 ms

_PyDict_Next():
$ ./python -m perf timeit -s "d = dict.fromkeys(range(10**6))" -- "set(d)"
Python 3.5: Median +- std dev: 68.3 ms +- 1.8 ms
Python 3.6 unpatched: Median +- std dev: 68.1 ms +- 2.5 ms
Python 3.6 patched: Median +- std dev: 66.0 ms +- 1.2 ms

PyDict_Next():
$ ./python -m perf timeit -s "from _testcapi import test_dict_iteration as t" -- "t()"
Python 3.5: Median +- std dev: 3.31 ms +- 0.10 ms
Python 3.6 unpatched: Median +- std dev: 3.51 ms +- 0.09 ms
Python 3.6 patched: Median +- std dev: 3.43 ms +- 0.09 ms

zhangyangyu · 2016-09-17T12:44:32Z

Serhiy, I don't understand your patch. You delete the logic about split table and then iterating split table actually fails.

>>> class C:
...     pass
... 
>>> a, b = C(), C()
>>> a.a, a.b = 1, 2
>>> list(b.__dict__)
['a', 'b']
>>>

Without attributes now b also get entries when iterating.

serhiy-storchaka · 2016-09-17T18:21:43Z

Good catch Xiang Zhang! I missed this point. Here is fixed patch.

$ ./python -m perf timeit -s "d = dict.fromkeys(range(10**6))" -- "list(d)"
Python 3.5:            Median +- std dev: 33.8 ms +- 0.7 ms
Python 3.6 unpatched:  Median +- std dev: 37.8 ms +- 0.5 ms
Python 3.6 patched:    Median +- std dev: 34.0 ms +- 0.6 ms

$ ./python -m perf timeit -s "d = dict.fromkeys(range(10**6)); v = d.values()" -- "list(v)"
Python 3.5:            Median +- std dev: 26.2 ms +- 0.7 ms
Python 3.6 unpatched:  Median +- std dev: 28.0 ms +- 0.6 ms
Python 3.6 patched:    Median +- std dev: 26.1 ms +- 0.8 ms

$ ./python -m perf timeit -s "d = dict.fromkeys(range(10**6)); v = d.items()" -- "list(v)"
Python 3.5:            Median +- std dev: 232 ms +- 6 ms
Python 3.6 unpatched:  Median +- std dev: 259 ms +- 6 ms
Python 3.6 patched:    Median +- std dev: 249 ms +- 6 ms

$ ./python -m perf timeit -s "d = dict.fromkeys(range(10**6))" -- "set(d)"
Python 3.5:            Median +- std dev: 68.3 ms +- 1.8 ms
Python 3.6 unpatched:  Median +- std dev: 68.1 ms +- 2.5 ms
Python 3.6 patched:    Median +- std dev: 63.7 ms +- 1.5 ms

$ ./python -m perf timeit -s "from _testcapi import test_dict_iteration as t" -- "t()"
Python 3.5:            Median +- std dev: 3.31 ms +- 0.10 ms
Python 3.6 unpatched:  Median +- std dev: 3.51 ms +- 0.09 ms
Python 3.6 patched:    Median +- std dev: 3.43 ms +- 0.05 ms

rhettinger · 2016-09-17T19:17:12Z

Also consider adding __length_hint__ to the various mapping views. That would help with a common case of listing the view.

serhiy-storchaka · 2016-09-17T19:57:08Z

Dict iterators already have __length_hint__. Dict views have __len__.

rhettinger · 2016-09-17T20:02:42Z

Yes, I missed seeing that.

rhettinger · 2016-09-18T02:00:04Z

Also, please take a look at resizing. It looks like it is doing way too much work. The original keys, values, and hashes shouldn't move at all, only the indices array needs to updated.

Here is the pure python version from the original proof-of-concept at https://code.activestate.com/recipes/578375

    def _resize(self, n):
        '''Reindex the existing hash/key/value entries.
           Entries do not get moved, they only get new indices.
           No calls are made to hash() or __eq__().

    '''
    n = 2 ** n.bit_length()                     # round-up to power-of-two
    self.indices = self._make_index(n)
    for index, hashvalue in enumerate(self.hashlist):
        for i in Dict._gen_probes(hashvalue, n-1):
            if self.indices[i] == FREE:
                break
        self.indices[i] = index
    self.filled = self.used

Likewise, it looks like there is room for improvement in dict_copy(). It can memcpy() all four arrays and then incref all the keys. That should be considerably faster than zeroing new arrays and applying re-insertion logic.

zhangyangyu · 2016-09-18T07:49:43Z

Serhiy: Patch LGTM except two trivial comments on Rietveld.

Raymond: With such implementation keys, values and hashes are all organized together and there seems no _resize operation can only adjust hashes without breaking the entire layout.

methane · 2016-09-18T13:29:09Z

LGTM, thanks.

rhettinger · 2016-09-18T17:15:57Z

Raymond: With such implementation keys, values and hashes are all
organized together and there seems no _resize operation can only
adjust hashes without breaking the entire layout.

None of those needs to change during a resize. Only indices array needs to be rebuilt.

rhettinger · 2016-09-18T18:00:57Z

We can still clean this up for Python 3.6. We're in feature freeze, not development freeze. The compact dict patch was very rough when it landed and is expected to continue to be polished so that the expected benefits are realized.

serhiy-storchaka · 2016-09-18T18:37:21Z

Actually most of optimization is not specific for new dict implementation.

rhettinger · 2016-09-18T21:14:27Z

Here is a rough (very rough and not compileable) sketch of the direction that resizing should take:

static void
insert_indices_clean(PyDictObject *mp, Py_hash_t hash)
{
    size_t i, perturb;
    PyDictKeysObject *k = mp->ma_keys;
    size_t mask = (size_t)DK_SIZE(k)-1;

    i = hash & mask;
    for (perturb = hash; dk_get_index(k, i) != DKIX_EMPTY;
         perturb >>= PERTURB_SHIFT) {
	i = mask & ((i << 2) + i + perturb + 1);
    }
    dk_set_index(k, i, k->dk_nentries);
}

static int
dictresize(PyDictObject *mp, Py_ssize_t minused)
{
    Py_ssize_t i, newsize;
    PyDictKeyEntry *ep0;

    /* Find the smallest table size > minused. */
    for (newsize = PyDict_MINSIZE;
         newsize <= minused && newsize > 0;
         newsize <<= 1)
        ;
    if (newsize <= 0) {
        PyErr_NoMemory();
        return -1;
    }

realloc(dk->dk_indicies, es * newsize);
memset(&dk->dk_indices.as_1[0], 0xff, es * size);
dk->dk_size = size;

    for (i = 0; i < mp->dk_nentries; i++) {
        PyDictKeyEntry *ep = &ep0[i];
        if (ep->me_value != NULL) {
            insert_indices_clean(mp, ep->me_hash);
        }
    }
    return 0;
}

serhiy-storchaka · 2016-09-18T21:29:45Z

Raymond, I think this is different issue.

rhettinger · 2016-09-18T22:15:03Z

Moved dictresize() discussion to http://bugs.python.org/issue28199

methane · 2016-09-27T07:35:07Z

Serhiy, may I update your patch, if you're busy in this week?

serhiy-storchaka · 2016-09-27T07:40:13Z

Feel free to do this. I'm not such busy, I'm experimenting with other variants.

methane · 2016-09-28T08:43:04Z

Updated with small refactoring.
Confirmed passes quicktest and no warning from clang on OS X.

methane · 2016-09-28T12:48:48Z

Draft Misc/NEWS entry (and commit message) is:

Core and Builtins
-----------------

+- Issue bpo-28183: Optimize dict iteration.
+

vstinner · 2016-09-28T14:32:48Z

I reviewed dict_iter4.patch on Rietveld.

methane · 2016-09-29T08:45:50Z

recreate patch with different option, since Rietveld doesn't accept dict_iter5.patch

vstinner · 2016-09-29T10:37:10Z

dict_iter7.patch LGTM, go ahead. Just dont forget to mention Serhiy Storchaka as the co-author ;-)

serhiy-storchaka · 2016-09-29T10:45:35Z

Please wait until I merge my changes.

serhiy-storchaka · 2016-09-29T18:45:38Z

It looks to me that the patch contain unneeded changes. Some changes don't affect performance, others don't look well justified. Are there benchmarks that confirm the benefit of these changes?

vstinner · 2016-09-29T19:48:05Z

Serhiy Storchaka added the comment:

It looks to me that the patch contain unneeded changes. Some changes don't affect performance, others don't look well justified. Are there benchmarks that confirm the benefit of these changes?

Hum, since I expect disagreements on the changes, I propose to split
the patch into two parts: minimum change to optimize, and a second
change (later, once the first one is merged) to cleanup further the
code.

"Some changes don't affect performance"

Well, a cleanup is supposed to not affect performances at all :-)

methane · 2016-09-30T13:08:05Z

Refactoring includes replacing PyDict_Next with _PyDict_Next.
It can improve performance by skipping some checks.
But it would be too small to see difference from benchmark.

I'll reduce diff size, hopefully in this weekend.

methane · 2016-10-02T05:43:28Z

dict_iter8.patch is based on dict_iter3.patch. Added some comments and fixing indents.
No change about _PyDict_Next API.

python-dev · 2016-10-09T20:09:24Z

New changeset 112714f3745d by Serhiy Storchaka in branch '3.6':
Issue bpo-28183: Optimize and cleanup dict iteration.
https://hg.python.org/cpython/rev/112714f3745d

serhiy-storchaka · 2016-10-09T20:17:20Z

Few weeks I tried to rewrite loops in more fast way, but I have not found anything significantly faster or clearer. Some forms of was a few percent faster, but I would be difficult to explain this effect. Most likely it was an artifact of the compilation.

Removed unrelated changes from dict_iter8.patch and added additional check in _PyDict_Next() before committing.

serhiy-storchaka added 3.7 (EOL) end of life interpreter-core (Objects, Python, Grammar, and Parser dirs) performance Performance or resource usage labels Sep 16, 2016

serhiy-storchaka self-assigned this Sep 18, 2016

rhettinger removed the 3.7 (EOL) end of life label Sep 18, 2016

methane added the 3.7 (EOL) end of life label Sep 28, 2016

serhiy-storchaka closed this as completed Oct 9, 2016

ezio-melotti transferred this issue from another repository Apr 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean up and speed up dict iteration #72370

Clean up and speed up dict iteration #72370

serhiy-storchaka commented Sep 16, 2016

serhiy-storchaka commented Sep 16, 2016

zhangyangyu commented Sep 17, 2016

serhiy-storchaka commented Sep 17, 2016

rhettinger commented Sep 17, 2016

serhiy-storchaka commented Sep 17, 2016

rhettinger commented Sep 17, 2016

rhettinger commented Sep 18, 2016

zhangyangyu commented Sep 18, 2016

methane commented Sep 18, 2016

rhettinger commented Sep 18, 2016

rhettinger commented Sep 18, 2016

serhiy-storchaka commented Sep 18, 2016

rhettinger commented Sep 18, 2016

serhiy-storchaka commented Sep 18, 2016

rhettinger commented Sep 18, 2016

methane commented Sep 27, 2016

serhiy-storchaka commented Sep 27, 2016

methane commented Sep 28, 2016

methane commented Sep 28, 2016

vstinner commented Sep 28, 2016

methane commented Sep 29, 2016

vstinner commented Sep 29, 2016

serhiy-storchaka commented Sep 29, 2016

serhiy-storchaka commented Sep 29, 2016

vstinner commented Sep 29, 2016

methane commented Sep 30, 2016

methane commented Oct 2, 2016

python-dev mannequin commented Oct 9, 2016

serhiy-storchaka commented Oct 9, 2016

Clean up and speed up dict iteration #72370

Clean up and speed up dict iteration #72370

Comments

serhiy-storchaka commented Sep 16, 2016

serhiy-storchaka commented Sep 16, 2016

zhangyangyu commented Sep 17, 2016

serhiy-storchaka commented Sep 17, 2016

rhettinger commented Sep 17, 2016

serhiy-storchaka commented Sep 17, 2016

rhettinger commented Sep 17, 2016

rhettinger commented Sep 18, 2016

zhangyangyu commented Sep 18, 2016

methane commented Sep 18, 2016

rhettinger commented Sep 18, 2016

rhettinger commented Sep 18, 2016

serhiy-storchaka commented Sep 18, 2016

rhettinger commented Sep 18, 2016

serhiy-storchaka commented Sep 18, 2016

rhettinger commented Sep 18, 2016

methane commented Sep 27, 2016

serhiy-storchaka commented Sep 27, 2016

methane commented Sep 28, 2016

methane commented Sep 28, 2016

vstinner commented Sep 28, 2016

methane commented Sep 29, 2016

vstinner commented Sep 29, 2016

serhiy-storchaka commented Sep 29, 2016

serhiy-storchaka commented Sep 29, 2016

vstinner commented Sep 29, 2016

methane commented Sep 30, 2016

methane commented Oct 2, 2016

python-dev mannequin commented Oct 9, 2016

serhiy-storchaka commented Oct 9, 2016