Issue 30140: Binary arithmetic does not always call subclasses first

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

This issue has been migrated to GitHub: https://github.com/python/cpython/issues/74326

classification

Title:	Binary arithmetic does not always call subclasses first
Type:	behavior	Stage:	resolved
Components:	Interpreter Core	Versions:	Python 3.8

process

Status:	closed	Resolution:	wont fix
Dependencies:		Superseder:
Assigned To:		Nosy List:	Eric.Wieser, Stephan Hoyer, belopolsky, gvanrossum, josh.r, leezu, mark.dickinson, ncoghlan, rhettinger, serhiy.storchaka, tim.peters, vstinner
Priority:	normal	Keywords:	needs review, patch

Created on 2017-04-23 04:21 by Stephan Hoyer, last changed 2022-04-11 14:58 by admin. This issue is now closed.

Pull Requests
URL	Status	Linked	Edit
PR 1325	closed	python-dev, 2017-04-27 15:58

Messages (13)
msg292149 - (view)	Author: Stephan Hoyer (Stephan Hoyer) *	Date: 2017-04-23 04:21
We are writing a system for overloading NumPy operations (see PR [1] and design doc [2]) that is designed to copy and extend Python's system for overloading binary operators. The reference documentation on binary arithmetic [3] states: > Note: If the right operand's type is a subclass of the left operand’s type and that subclass provides the reflected method for the operation, this method will be called before the left operand’s non-reflected method. This behavior allows subclasses to override their ancestors’ operations. However, this isn't actually done if the right operand merely inherits from the left operand's type. In practice, CPython requires that the right operand defines a different method before it defers to it. Note that the behavior is different for comparisons, which defer to subclasses regardless of whether they implement a new method [4]. I think this behavior is a mistake and should be corrected. It is just as useful to write generic binary arithmetic methods that are well defined on subclasses as generic comparison operations. In fact, this is exactly the design pattern we propose for objects implementing special operators like NumPy arrays (see NDArrayOperatorsMixin in [1] and [2]). Here is a simple example, of a well-behaved that implements addition by wrapping its value and returns NotImplemented when the other operand has the wrong type: class A: def __init__(self, value): self.value = value def __add__(self, other): if not isinstance(other, A): return NotImplemented return type(self)(self.value + other.value) __radd__ = __add__ def __repr__(self): return f'{type(self).__name__}({self.value!r})' class B(A): pass class C(A): def __add__(self, other): if not isinstance(other, A): return NotImplemented return type(self)(self.value + other.value) __radd__ = __add__ A does not defer to B: >>> A(1) + B(1) A(2) But it does defer to C, which defines new methods (literally copied/pasted) for __add__/__radd__: >>> A(1) + C(1) C(2) With the current behavior, special operator implementations need to explicitly account for the possibility that they are being called from a subclass by returning NotImplemented. My guess is that this is rarely done, which means that most of these methods are broken when used with subclasses, or subclasses needlessly reimplement these methods. Can we fix this logic for Python 3.7? [1] https://github.com/numpy/numpy/pull/8247 [2] https://github.com/charris/numpy/blob/406bbc652424fff332f49b0d2f2e5aedd8191d33/doc/neps/ufunc-overrides.rst [3] https://docs.python.org/3/reference/datamodel.html#object.__ror__ [4] http://bugs.python.org/issue22052
msg292157 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2017-04-23 08:45
This is probably worth bringing up on the python-dev or python-ideas mailing lists for greater visibility. I can't think of any plausible non-historical reason why it makes sense for comparisons to behave one way and arithmetic operators another. Altering this might be a PEP-level change, though. The "Coercion rules" section[1] of the Python 2.7 docs is a bit more explicit about the intent: """ Exception to the previous item: if the left operand is an instance of a built-in type or a new-style class, and the right operand is an instance of a proper subclass of that type or class and overrides the base’s __rop__() method, the right operand’s __rop__() method is tried before the left operand’s __op__() method. """ so the check for an override was clearly intentional, rather than an implementation convenience or accident. (It's also clearly intentional in the source and comments.) The 3.x docs don't have the "and overrides" language; I haven't figured out why and when that language changed. [1] https://docs.python.org/release/2.7.6/reference/datamodel.html#coercion-rules
msg292188 - (view)	Author: Stephan Hoyer (Stephan Hoyer) *	Date: 2017-04-24 01:26
Posted to python-ideas: https://mail.python.org/pipermail/python-ideas/2017-April/045451.html Mark -- just out of curiosity, could you point me to where this logic is implemented in CPython's source? This feels like something that could once been called a bug but that by now may have become a feature, by virtue of how long it's lasted out in the world.
msg292198 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2017-04-24 07:21
> could you point me to where this logic is implemented in CPython's source? Most of the relevant code is in Objects/abstract.c and Objects/typeobject.c. A BINARY_ADD opcode (for example) ends up calling PyNumber_Add: https://github.com/python/cpython/blob/v3.6.1/Objects/abstract.c#L913 which in turn calls binary_op1, where you see an explicit check for the nb_add slots of the two operands being identical: https://github.com/python/cpython/blob/v3.6.1/Objects/abstract.c#L769-L770 For a user-defined class, the slots themselves are defined in typeobject.c. Here's where nb_add is defined: https://github.com/python/cpython/blob/v3.6.1/Objects/typeobject.c#L5952 and here's the explicit check for overloading in the SLOT1BIN macro definition: https://github.com/python/cpython/blob/v3.6.1/Objects/typeobject.c#L5796 There's also an explicit test for the arithmetic operation behaviour in Lib/test/test_descr.py. In short, I doubt this was ever a bug: everything points to this being a deliberate design decision. I hope someone on python-ideas can elaborate on the rationale behind that design decision (and also on why that rationale doesn't apply to comparisons). In contrast, it does seem plausible to me that the comparison failure to check for an explicit override may have been accidental.
msg292278 - (view)	Author: Josh Rosenberg (josh.r) *	Date: 2017-04-25 20:15
I'd assume the preference for __rop__ only on subclass overload is because __rop__ method are usually fallback methods, and differ behaviorally from the __op__ methods in type strictness. In particular, the __rop__ fallbacks are often so non-strict that they return a completely different type; fractions.Fraction.__rop__ is willing to coerce itself and the other operand to float and produce a float result if the other operand is a numbers.Real (and not a Rational). They also tend to be slower (checking against ABCs and doing more type coercion) than the __op__ path. If you jump straight to __rop__ because the right hand side is a subclass, but the subclass didn't overload it, you end up going through that fallback, assumed extra liberal and slow, code path. It doesn't work this way with comparison operators because those are 100% reflexive; there is no expectation that comparing in one direction will be more or less type permissive than comparing in the other direction (stuff like __rcmp__ has been gone for ages after all), so unconditionally comparing using the child class comparator first is fine, and more likely to get correct results. The design pattern that has problems here is a bit unorthodox to start with. It assumes that the child class constructor will work exactly the same as the parent (no additional mandatory arguments for instance), and that it's always correct for parent + child to produce the type of child. Usually, in an OO design, the parent is not supposed to have any specific knowledge of children; it's the job of the children to work with instances of the parent, if necessary. If delegation to the child is desired, implement __op__ with stricter type checking (to preclude subclasses) and __rop__ with relaxed type checking (to allow them); when the __op__ executes, it will return NotImplemented for the child class, then delegate to __rop__, which will use the child's type.
msg292279 - (view)	Author: Stephan Hoyer (Stephan Hoyer) *	Date: 2017-04-25 20:55
> The design pattern that has problems here is a bit unorthodox to start with. I agree. This was meant strictly as a simple example for illustrative purposes. Steven D'Aprano's example from python-ideas may be a better one: https://mail.python.org/pipermail/python-ideas/2017-April/045455.html class A: def __add__(self, other): self.log() ... __radd__ = __add__ class B(A): def log(self): ... Our actual use case for NumPy involved writing a mixin that look more like this, that expects a specified method to implement arithmetic, i.e., class NDArrayOperatorsMixin: def __add__(self, other): return self._calculate(np.add, self, other) def __radd__(self, other): return self._calculate(np.add, other, self) ... # repeat for all special methods class A(NDArrayOperatorsMixin): def _calculate(self, op, args): if not all(isinstance(arg, A) for arg in args): return NotImplemented ... # implement calculation class B(A): def _calculate(self, op, args): ... # something different In A() + B(), B never gets the chance to override A's implementation of __add__ via _calculate, because it overrode a different method (_calculate) which happens to contain the implementation for __radd__, but not __radd__ itself. Anyways, if you have serious concerns about changing this, it is probably best respond to Guido on python-ideas: https://mail.python.org/pipermail/python-ideas/2017-April/045468.html
msg292460 - (view)	Author: Serhiy Storchaka (serhiy.storchaka) *	Date: 2017-04-27 16:14
binary_op1() is wrong place for fixing this issue. You need to change SLOT1BINFULL in Objects/typeobject.c.
msg301382 - (view)	Author: Stephan Hoyer (Stephan Hoyer) *	Date: 2017-09-05 22:01
Serhiy: thanks for the tip. I've updated my PR, which I think is now ready for review.
msg304597 - (view)	Author: Stephan Hoyer (Stephan Hoyer) *	Date: 2017-10-18 23:19
Ping -- it would be great if someone could take a look at my PR. (I suspect it needs more documentation, tips on where to put that would be appreciated.)
msg325553 - (view)	Author: Guido van Rossum (gvanrossum) *	Date: 2018-09-17 16:53
If anyone cares, here's what I recall of the history of this feature. Originally my design was just 1. try left arg's __op__ 2. if that returns NotImplemented, try right arg's __rop__ That's simple to explain and usually works. But when thinking through edge cases, we found that if the right class is a subclass of the left class, this gives unexpected results, so we added the exception 0. if the right class subclasses the left class and it overrides __rop__, try that first I honestly believe that I didn't realize that it could ever matter if the right class subclasses the left class without overriding __rop__. And nobody else realized it either. I think we decided on the "right class must override __rop__" condition in part to preserve the original, simpler rules for cases where we thought they would suffice, in part because for most users trying __rop__ before __op__ is unexpected, and in part because we didn't realize that it mattered. And it only matters if the base class uses something like type(self) or self.__class__, which as a design pattern is sometimes dubious and other times useful. Also note that the language Stephan quoted is somewhat different: it uses "provides" (__rop__) instead of "overrides". I could imagine that this, again, was a difference too subtle for most folks to notice. If I had to do it over, I agree that if the right class subclasses the left class, we should always try __rop__ first. But now that this has been an ingrained behavior for so long I worry that it will break existing code, esp. in cases where there is an asymmetry between __op__ and __rop__ and the class author did not expect __rop__ ever to be called first. Whether that's enough to stop this PR I think this is a question for the next BDFL to decide, alas. (I don't think it requires a PEP, it just requires someone to decide.)
msg325876 - (view)	Author: Nick Coghlan (ncoghlan) *	Date: 2018-09-20 13:48
Note that we've left a similar operand precedence handling issue languishing for a long time over compatibility concerns: https://bugs.python.org/issue11477 In that case, NumPy is actually benefiting from the discrepancy with the documentation though, as the lower-than-expected precedence of sq_concat and sq_repeat is what allows NumPy arrays to always take precedence over builtin container types, even when the array is the right hand operand.
msg347963 - (view)	Author: Guido van Rossum (gvanrossum) *	Date: 2019-07-15 14:36
Are there any core devs who care enough to make a decision here?
msg366186 - (view)	Author: Guido van Rossum (gvanrossum) *	Date: 2020-04-11 03:38
Let's give up on this one.

History
Date	User	Action	Args
2022-04-11 14:58:45	admin	set	github: 74326
2020-04-11 03:38:55	gvanrossum	set	status: open -> closed resolution: wont fix messages: + msg366186 stage: patch review -> resolved
2019-07-15 14:38:03	vstinner	set	nosy: + vstinner
2019-07-15 14:36:56	gvanrossum	set	messages: + msg347963
2019-07-15 09:35:23	leezu	set	nosy: + leezu
2018-09-20 13:48:36	ncoghlan	set	nosy: + ncoghlan messages: + msg325876
2018-09-17 19:41:18	rhettinger	set	nosy: + tim.peters
2018-09-17 16:53:37	gvanrossum	set	nosy: + gvanrossum messages: + msg325553 versions: + Python 3.8, - Python 3.7
2018-09-15 15:26:52	belopolsky	set	nosy: + belopolsky
2017-12-09 22:06:15	cheryl.sabella	set	keywords: + patch, needs review stage: patch review components: + Interpreter Core versions: + Python 3.7
2017-10-18 23:19:15	Stephan Hoyer	set	messages: + msg304597
2017-09-05 22:01:12	Stephan Hoyer	set	messages: + msg301382
2017-04-27 16:14:17	serhiy.storchaka	set	nosy: + serhiy.storchaka messages: + msg292460
2017-04-27 15:58:42	python-dev	set	pull_requests: + pull_request1436
2017-04-25 20:55:08	Stephan Hoyer	set	messages: + msg292279
2017-04-25 20:15:25	josh.r	set	nosy: + josh.r messages: + msg292278
2017-04-24 07:21:50	mark.dickinson	set	messages: + msg292198
2017-04-24 03:04:16	rhettinger	set	nosy: + rhettinger
2017-04-24 01:26:47	Stephan Hoyer	set	messages: + msg292188
2017-04-23 13:28:47	Eric.Wieser	set	nosy: + Eric.Wieser
2017-04-23 08:45:07	mark.dickinson	set	messages: + msg292157
2017-04-23 07:56:37	mark.dickinson	set	nosy: + mark.dickinson
2017-04-23 04:21:44	Stephan Hoyer	create