Issue 30571: Add integer formatting code for fixed-width signed arithmetic (2's complement)

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

This issue has been migrated to GitHub: https://github.com/python/cpython/issues/74756

classification

Title:	Add integer formatting code for fixed-width signed arithmetic (2's complement)
Type:	enhancement	Stage:
Components:	Interpreter Core	Versions:	Python 3.7

process

Status:	open	Resolution:
Dependencies:		Superseder:
Assigned To:	lisroach	Nosy List:	cheryl.sabella, eric.smith, lisroach, pitrou, rhettinger, serhiy.storchaka, skrah, talin
Priority:	low	Keywords:

Created on 2017-06-05 04:03 by rhettinger, last changed 2022-04-11 14:58 by admin.

Messages (8)
msg295162 - (view)	Author: Raymond Hettinger (rhettinger) *	Date: 2017-06-05 04:03
The current 'b' formatting code in inconvenient for emulating, demonstrating, and teaching two's complement arithmetic. The problem is that negative inputs are always formatted with a minus sign. I propose that some formatting code be provided for fixed-width display where the leading bit is a sign bit. For example, if code were a capital 'B' and the total width were 8-bits: >>> x = -12 >>> format(12, '08B') '11110100' Currently, to achieve the same effect, one of the following is used: >>> format(x if x >= 0 else x + 28, '08b') '11110100' or >>> format(x & (28 - 1), '08b') '11110100' For values outside the valid range, perhaps a ValueError could be raised: >>> format(-200, '08B') Traceback (most recent call last): ... ValueError: Expected value in range -128 <= x < 128, not -200 I'm not sure what the right code should be. The idea of capital 'B' is attractive, but we already have a different relationship between 'X' and 'x'. There could also be a modifier symbol such as '!' in '!8b'.
msg295749 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2017-06-12 07:01
-1 from me. Using `format(x % 2**8, '08b')` seems both short enough to be easy to use, and long enough to remind one that there's something a little bit unnatural going on here, given that two's complement isn't something that makes sense for arbitrary-sized integers.
msg295752 - (view)	Author: Serhiy Storchaka (serhiy.storchaka) *	Date: 2017-06-12 07:39
Concur with Mark. Similar issues already were raised multiple times on mailing lists and the conclusion is that explicit wrapping integers to specific range is better. Different behavior for integers out of range is needed in different applications.
msg295753 - (view)	Author: Raymond Hettinger (rhettinger) *	Date: 2017-06-12 08:00
This is a recurring need in my teaching of Python to hardware engineers. The whole point of having a binary format code is to show which bits are set. For negative values, that need is not being served by the current option. And it makes it awkward when modeling the effects of bitwise operations on signed numbers. I can understand a -1 if you think this is fundamentally broken, but if you're just saying that you've never personally needed this or casually dismissing the awkwardness of the usual workarounds, then it seems like a meaningless vote that blows-off my proposal which is based on real-world classroom experiences. The idea is that modifier (such as "!" would require a width argument (the signed representation only make sense in fixed width concepts as every assembly language programmer knows). Its presence would also imply the "0". Serhiy, I believe you've either read different maillist posts than I have or that you're misinterpreting this as a proposal for a fixed width integer type with automatic wrap-around and truncation (full emulation of a register). This proposal is for display formatting only. IMO, it remedies a deficiency where the current option is more for our convenience (not wanting to deal with the width) rather than for the user's convenience where the user wants to see which bits are set rather than seeing a minus sign (which is both obvious and useless). Also, when a senior coredev presents a proposal, I expect that it will be greeted with a little more open mindedness and not instantly shot down as if I have no idea what I'm talking about.
msg295754 - (view)	Author: Serhiy Storchaka (serhiy.storchaka) *	Date: 2017-06-12 08:20
> The idea is that modifier (such as "!" would require a width argument (the signed representation only make sense in fixed width concepts as every assembly language programmer knows). Its presence would also imply the "0". I would use the precision for this. Truncating the number of digits in integer representation is similar to truncating the number of characters for strings. But actually I don't think we should add such feature in the core. It has very limited application. This can be implemented as a third-part library. Actually I think the stdlib needs something like bitarray and/or bitset. The specially purposed types that allow testing and modifying separate bits and bit ranges.
msg295793 - (view)	Author: Stefan Krah (skrah) *	Date: 2017-06-12 14:54
To expand on what Mark said: If the proposed format code implicitly gives these results ... >>> format(12 % 28, '08b') '00001100' >>> format(-12 % 28, '08b') '11110100' ..., would students not expect that these two operations have the same result? >>> 0b11110100 \| 0b00001100 252 >>> >>> -12 \| 12 -4
msg295805 - (view)	Author: Antoine Pitrou (pitrou) *	Date: 2017-06-12 17:43
The format code space is already crowded, and it isn't immediately obvious that '08B' will format the 2's complement representation of the number, as opposed to '08b' which prepends a sign bit. Not to mention that, as Raymond remarked, we cannot use the same convention for hex formatting where 'X' already has a different meaning. Instead, how about adding an optional argument to bin()? And then we can also do the same for hex(). By the way, I do find annoying the default behaviour of prepending a sign bit on hexadecimal and binary displays (but I never use binary displays, so my annoyance is really with hexadecimal displays).
msg295843 - (view)	Author: Lisa Roach (lisroach) *	Date: 2017-06-13 03:25
I can't see the drawback of being able to format two's complement easily, isn't it more common to represent negative binary numbers in two's complement form? I find the - sign a little odd myself. I agree with the idea of using "!" with the precision to represent width.

History
Date	User	Action	Args
2022-04-11 14:58:47	admin	set	github: 74756
2017-06-13 03:25:21	lisroach	set	messages: + msg295843
2017-06-12 17:43:08	pitrou	set	nosy: + pitrou messages: + msg295805
2017-06-12 14:54:44	skrah	set	nosy: + skrah messages: + msg295793
2017-06-12 14:41:40	mark.dickinson	set	nosy: - mark.dickinson
2017-06-12 08:20:55	serhiy.storchaka	set	messages: + msg295754
2017-06-12 08:00:04	rhettinger	set	messages: + msg295753
2017-06-12 07:39:54	serhiy.storchaka	set	nosy: + serhiy.storchaka messages: + msg295752
2017-06-12 07:01:52	mark.dickinson	set	nosy: + eric.smith messages: + msg295749
2017-06-12 00:52:34	rhettinger	set	assignee: lisroach nosy: + lisroach
2017-06-06 09:09:59	cheryl.sabella	set	nosy: + cheryl.sabella
2017-06-05 04:03:26	rhettinger	create