Message 336706 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	josh.r
Recipients	ammar2, josh.r, larry, serhiy.storchaka, vstinner, xtreak
Date	2019-02-26.19:12:26
SpamBayes Score	-1.0
Marked as misclassified	Yes
Message-id	<1551208347.19.0.0530809746155.issue36127@roundup.psfhosted.org>
In-reply-to

Content
How much bigger does the core interpreter + built-in extension modules get when you make this change? How much more memory is used by real world programs? I'm a little concerned when I see individual functions growing by 140 lines in the first file of the diff. Clearly the increase in memory usage wasn't enough to slow down the microbenchmarks (which can probably fit the entire hot code path in CPU on die cache), but I'd be worried about the aggregate effect on real world programs if we go from a handful of argument parsing code paths reused by every function (and therefore kept constantly hot) to hundreds (or thousands?) of unique argument parsing code paths, each one unique to a single function. It could easily look great when a single function is being called repeatedly, while looking much less great (possibly worse) when the many varied function calls are interleaved. It should be tested on a number of systems too; any losses to cache unfriendliness would be highly dependent on the size of the CPU cache. I'll grant, it doesn't seem like inlining positional argument parsing has caused problems, and it looks like we're still using _PyArg_UnpackKeywords, so argument parsing isn't completely devolved to each functions, but I think we need something more than microbenchmarks before we jump on this. If my worries end up being unfounded, I'll be happy. Looks very cool if we can really get that sort of speed up for all function calls, not just positional args only functions. :-)

How much bigger does the core interpreter + built-in extension modules get when you make this change? How much more memory is used by real world programs?

I'm a little concerned when I see individual functions growing by 140 lines in the first file of the diff. Clearly the increase in memory usage wasn't enough to slow down the microbenchmarks (which can probably fit the entire hot code path in CPU on die cache), but I'd be worried about the aggregate effect on real world programs if we go from a handful of argument parsing code paths reused by every function (and therefore kept constantly hot) to hundreds (or thousands?) of unique argument parsing code paths, each one unique to a single function. It could easily look great when a single function is being called repeatedly, while looking much less great (possibly worse) when the many varied function calls are interleaved.

It should be tested on a number of systems too; any losses to cache unfriendliness would be highly dependent on the size of the CPU cache.

I'll grant, it doesn't seem like inlining positional argument parsing has caused problems, and it looks like we're still using _PyArg_UnpackKeywords, so argument parsing isn't completely devolved to each functions, but I think we need something more than microbenchmarks before we jump on this.

If my worries end up being unfounded, I'll be happy. Looks very cool if we can really get that sort of speed up for all function calls, not just positional args only functions. :-)

History
Date	User	Action	Args
2019-02-26 19:12:27	josh.r	set	recipients: + josh.r, vstinner, larry, serhiy.storchaka, ammar2, xtreak
2019-02-26 19:12:27	josh.r	set	messageid: <1551208347.19.0.0530809746155.issue36127@roundup.psfhosted.org>
2019-02-26 19:12:27	josh.r	link	issue36127 messages
2019-02-26 19:12:26	josh.r	create