Issue 5118: '%.2f' % 2.545 doesn't round correctly

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Unsupported provider

This issue has been migrated to GitHub: https://github.com/python/cpython/issues/49368

classification

Title:	'%.2f' % 2.545 doesn't round correctly
Type:	behavior	Stage:
Components:		Versions:	Python 2.7

process

Status:	closed	Resolution:	not a bug
Dependencies:		Superseder:
Assigned To:	mark.dickinson	Nosy List:	Ultrasick, Zeev.Rotshtein, mark.dickinson
Priority:	normal	Keywords:

Created on 2009-01-31 12:57 by Ultrasick, last changed 2022-04-11 14:56 by admin. This issue is now closed.

Messages (12)
msg80868 - (view)	Author: (Ultrasick)	Date: 2009-01-31 12:57
print '%.2f' % 2.544 // returns 2.54 print '%.2f' % 2.545 // returns 2.54 but should return 2.55 print '%.2f' % 2.546 // returns 2.55
msg80869 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2009-01-31 14:12
This is not a bug; it's a consequence of the finite accuracy of floating- point arithmetic. If you look at the actual value that's stored for '2.545', you'll see that it's actually slightly less than 2.545, so rounding it down is the correct thing to do. >>> 2.545 2.5449999999999999
msg80870 - (view)	Author: (Ultrasick)	Date: 2009-01-31 14:18
print round(2.545, 2) // returns 2.55
msg80871 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2009-01-31 15:14
> print round(2.545, 2) // returns 2.55 Aha! Yes, that one is a bug (see issue #1869), though it's not one that I regard as terribly serious, and not one that can be easily solved in all cases. Here's why I don't see it as particularly serious: you're rounding a value that's just on the boundary: 2.545+tiny_error should round up, while 2.545-tiny_error should round down. But tiny (or not-so-tiny) errors are an almost unavoidable part of working with binary floating-point arithmetic. Additionally, whether the binary approximation stored for 2.545 is less than or greater than the true value depends on many things (format of a C double, system C library function used for string-to-double conversion, etc.), so in a sense either 2.55 or 2.54 can be defended as a valid result, and a good numeric programmer won't write code that depends on getting one or the other. Having said that, if you're interested in providing a patch for issue #1869 I'd certainly take a look. If you care about exact representations of numbers with a finite number of places after the decimal point, you may be interested in Python's 'decimal' module.
msg80873 - (view)	Author: (Ultrasick)	Date: 2009-01-31 15:23
I am sorry but I am not a C programmer. I cannot provide any patches. As far as I understood this issue and issue #1869 have a common problem but this issue wouldn't be solved if issue #1869 is solved. "print '%.2f' % 2.545" doesn't seam to use the built in round() function. Otherwise the result would be 2.55 already as the result of round(2.545, 2) is. So you might want to reopen the bug. But either way I don't consider this bug as really serious either.
msg80874 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2009-01-31 15:36
> So you might want to reopen the bug. But either way I don't consider > this bug as really serious either. I don't understand. As far as I can see '%.2f' % 2.545 is returning the correct result: there is no bug here, so no need to reopen. '%.2f' should not return 2.55; it should return 2.54, which is exactly what it does. round(2.545, 2) should also return 2.54, but returns 2.55 instead; issue 1869 is already open for this. You're correct that the float formatting doesn't use round: it does whatever the platform C library's sprintf does.
msg80875 - (view)	Author: (Ultrasick)	Date: 2009-01-31 15:51
Well that's not what I have learned how rounding works. I think that's the more common way: 0.4 -> 0 0.5 -> 1 0.6 -> 1 I hope you don't try to spread the misbehavoir of pythons way of rounding print '%.2f' % 2.545 // returns 2.54 to the built in round() function. So that round() would also return 2.54. The result of rounding 2.545 is 2.55 no matter how python temporarly stores "2.545" and independent of how python does the rounding. The result is 2.55 and not 2.54. If python doesn't deliver "2.55" as the result of it's rounding algorithm then it's doing it wrong. And if python does stuff wrong then it has a bug. in my opinion
msg80876 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2009-01-31 16:05
> result is 2.55 and not 2.54. If python doesn't deliver "2.55" as the > result of it's rounding algorithm then it's doing it wrong. And if Sorry, but that's just not true. I suggest that you (a) read the section on floating-point[1] in the Python tutorial, and/or (b) ask about this on comp.lang.python if you feel inclined---there are plenty of people there who would be glad to explain what's going on here. [1] http://docs.python.org/tutorial/floatingpoint.html
msg158716 - (view)	Author: Zeev Rotshtein (Zeev.Rotshtein)	Date: 2012-04-19 11:19
Well this IS a bug. There is a certain globally accepted manner in which rounding work and python does something else. P.S.: A bug is when something doesn't do what it's supposed to do the way it's supposed to do it. This definition does not depend on "internal representation" or any such things.
msg158717 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2012-04-19 11:54
> Well this IS a bug. I assume that you're referring to behaviour like this: Python 2.7.2 (default, Jan 13 2012, 17:11:09) [GCC 4.2.1 (Apple Inc. build 5666) (dot 3)] on darwin Type "help", "copyright", "credits" or "license" for more information. >>> x = 2.545 >>> round(x, 2) 2.54 To explain again, what happens here is: (1) After the assignment 'x = 2.545', what's stored for x is not the precise decimal value 2.545, but a binary approximation to it. That binary approximation just happens to be very slightly less than 2.545. (2) Now when rounding, the usual rules are applies (values less than half get rounded down), to give 2.54. Which part(s) of the above do you think should be changed? Should the 'round' function incorrectly round some numbers up even though they fall below the halfway case?
msg159853 - (view)	Author: (Ultrasick)	Date: 2012-05-03 11:44
Ok, let's sum that up: There is a rounding problem. That's because Python uses floating point registers of the x86-CPU-architecture to store point values. This method is inaccurate and causes such problems. So Python inherits this bug from this value storing method. Even thou the origin of this bug is in the method which is beeing used, Python has inherited this bug and can't round correctly. If we would say that Python does not support point values but only floating point values of the x86-CPU-architecture (so not even floating point values in general) then we could admit that round(2.545, 2) has a bug because it "incorrectly" shows 2.55 as the result. But that wouldn't help us any further. One possible solution would be to use a different method to store point values. For exaple 2 integers could be used to store a point value lossless. The one integer stores whatever is left of the point and the other integer stores whatever is right of the point. Meaning: 25.0: -> integer #1: 0,000,000,025 -> integer #2: 0,000,000,000 25.99997: -> integer #1: 0,000,000,025 -> integer #2: 0,999,970,000 25.00001 -> integer #1: 0,000,000,025 -> integer #2: 0,000,010,000 As you can see, this method is lossless. As long as you don't try to store more than 32 significant bits in a register which is 32 bits in size. To be more accurate: you can't even use all 32 bits because the most significant digit can only be between 0 and 4 (4,294,967,295 barrier). Using this value storing method would mean quite some efforts for the developers. But then Python would be able to round correctly. So that's why I call it a "possible solution". I am not the one who is going to make the decision, whether a different value-storing-method is going to be implemented, indepentend how this value storing method may look like. But I am one of thouse who suffered from the method which is currently implemented. @Mark: And I am also one of thouse who lost a lot of interrest in helping out in the futher development of Python. It was because your haughtiness. You tried to show how perfect Python is and that there would be no bugs. But your last comment was a little more productive. Even thou you still haven't showed much interest in finding a solution to the problem. @Zeev: I already gave up. But you had more endurance. Thanks :-) Gary
msg159868 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2012-05-03 17:34
> That's because Python uses floating point registers of the x86-CPU- > architecture to store point values. This method is inaccurate and causes > such problems. Yes, exactly; this is the root cause. And as you suggest, Python could use a different numeric storage format that doesn't suffer from loss of information when initializing a number from a decimal string. There's an obvious candidate for that storage format, and that's the decimal.Decimal type. There are some issues, though: (1) decimal.Decimal operations are implemented in software (in pure Python for versions <= 3.2, and now in C in Python 3.3) and so are orders of magnitude slower than hardware-supported floats. That's one of the reasons that almost every mainstream programming language uses the binary-represented hardware floats as the main way of representing non-integral numbers. The need for those fast floats isn't going to go away in a hurry. The obvious solution here would be to for Python to support both binary floats and decimal floats, and perhaps to make numeric literals default to being decimal.Decimal instances. (2) Getting to the point where the Decimal type could be used for numeric literals will be a long road, full of backwards compatibility concerns, PEPs, and long and probably contentious python-dev discussions. Python's just taken the first step along that road by reimplementing the decimal module in C for Python 3.3; this improves the speed significantly (though floats are still significantly more efficient in both time and space, and likely will be for a long time), and also makes it easier to start using decimal more widely from within the core of Python. Reaching that point of having the Decimal type more fully integrated into Python is something that I know a good few of the Python developers are interested in (including me). But it's not going to be an easy or quick change. > @Mark: And I am also one of thouse who lost a lot of interrest in helping out in the futher development of > Python. It was because your haughtiness. I see how my earlier messages came across badly. I apologise for that, and I hope you won't let the poorly chosen words of just one Python developer out of many put you off future involvement in Python.

History
Date	User	Action	Args
2022-04-11 14:56:45	admin	set	github: 49368
2012-05-03 17:34:18	mark.dickinson	set	messages: + msg159868
2012-05-03 11:44:32	Ultrasick	set	messages: + msg159853
2012-04-19 11:54:12	mark.dickinson	set	assignee: mark.dickinson messages: + msg158717 versions: + Python 2.7, - Python 2.6
2012-04-19 11:19:02	Zeev.Rotshtein	set	nosy: + Zeev.Rotshtein messages: + msg158716
2009-01-31 16:05:46	mark.dickinson	set	messages: + msg80876
2009-01-31 15:51:42	Ultrasick	set	messages: + msg80875
2009-01-31 15:36:00	mark.dickinson	set	messages: + msg80874
2009-01-31 15:23:52	Ultrasick	set	messages: + msg80873
2009-01-31 15:14:08	mark.dickinson	set	messages: + msg80871
2009-01-31 14:18:07	Ultrasick	set	messages: + msg80870
2009-01-31 14:13:07	mark.dickinson	set	status: open -> closed
2009-01-31 14:12:37	mark.dickinson	set	resolution: not a bug messages: + msg80869 nosy: + mark.dickinson
2009-01-31 12:57:50	Ultrasick	create