Created on 2010-11-09 23:33 by barry, last changed 2013-04-02 15:38 by barry.
|msg120905 - (view)||Author: Barry A. Warsaw (barry) *||Date: 2010-11-09 23:33|
@mission[~:1001]% python2.7 -c "import locale; print locale.format('%.0f KB', 100)" Traceback (most recent call last): File "<string>", line 1, in <module> File "/usr/lib/python2.7/locale.py", line 189, in format "format specifier, %s not valid") % repr(percent)) ValueError: format() must be given exactly one %char format specifier, '%.0f KB' not valid @mission[~:1002]% python2.6 -c "import locale; print locale.format('%.0f KB', 100)" 100 KB
|msg120907 - (view)||Author: Amaury Forgeot d'Arc (amaury.forgeotdarc) *||Date: 2010-11-09 23:56|
This was changed by issue2522 on purpose; no suffix is allowed in locale.format().
|msg120908 - (view)||Author: Barry A. Warsaw (barry) *||Date: 2010-11-09 23:58|
Okay, so line 187 of locale.py has this test: if not match or len(match.group())!= len(percent): the problematic part is the len test. When format string is '%.0f KB' match.group() is '%.0f' but of course percent is the full string. This seems like a bogus test, since clearly the given input is a valid format string. I'm not sure what the intent of this test is. The Python 2.6 test is: if percent != '%': which is perhaps too naive. I guess I don't understand why this test is here. Wouldn't it make more sense to either just let any TypeError from _format() to percolate up, or to catch that TypeError and transform it into the ValueError? Why try to replicate the logic of str.__mod__()?
|msg120909 - (view)||Author: Barry A. Warsaw (barry) *||Date: 2010-11-10 00:01|
Hmm. So I guess the answer is to use locale.format_string() instead. But the documentation for locale.format() is not entirely clear about the prohibition on trailing text.
|msg120912 - (view)||Author: Eric V. Smith (eric.smith) *||Date: 2010-11-10 00:08|
I agree the documentation isn't terribly clear on what a "%char specifier" or "whole format string" is. FWIW, this is also a 3.1 and greater issue.
|msg120921 - (view)||Author: R. David Murray (r.david.murray) *||Date: 2010-11-10 15:24|
Yeah, obviously that language can be improved. 'exactly' was meant to imply 'nothing but', but clearly it doesn't. If we want to restore more stringent backward compatibility and allow trailing text, it would be possible to make format an alias for format_string. I'm not sure this is a good idea, but it is the most sensible way to restore backward compatibility while still fixing the original bug that I can think of. Or...perhaps there is little need of both 'format' and 'format_string' as public APIs, and we could deprecate (without removing) one of them. On the other hand, I believe the original bug affects the Ubuntu code that triggered this report...in other words, absent this fix chances are there would eventually have been a bug report against that code that would have necessitated that it change to use format_string anyway in order to get the correct locale-specific number formatting.
|msg120922 - (view)||Author: Łukasz Langa (lukasz.langa)||Date: 2010-11-10 16:07|
Please use the deprecation process when possible. That would mean creating an alias for the function you want to remove somewhat like this (taken from configparser): def readfp(self, fp, filename=None): """Deprecated, use read_file instead.""" warnings.warn( "This method will be removed in future versions. " "Use 'parser.read_file()' instead.", PendingDeprecationWarning, stacklevel=2 ) self.read_file(fp, source=filename)
|msg120978 - (view)||Author: Barry A. Warsaw (barry) *||Date: 2010-11-11 22:42|
The bug has been fixed upstream by replacing .format() with .format_string(). I'm not sure I understand why there are two different methods - .format() seems kind of pointless to me, but then I don't use the locale module enough to say what's useful. For Python 2.7 I think the only thing we can do is to update the docs so that the distinction and restrictions are clear.
|msg120989 - (view)||Author: R. David Murray (r.david.murray) *||Date: 2010-11-12 00:15|
Well, the distinction is that, before the bug fix that caused your issue, the 'format_string' method would use a regex to extract the % specifiers from the input string, and call 'format' to replace that % specifier with a properly localized result string. That is, 'format' was designed to handle a single % specifier with no extra text, basically as a helper method for format_string. The fact that it didn't reject extra text was, according to an internal comment, a defect of the implementation. (Passing any extra text would cause the implementation to fail to do the internationalization that was the entire reason for calling it.) When I fixed the bug I extracted the 'replace a single % specifier' code into an internal method, and made the format method live up to what I perceived to be its documented interface by rejecting extra input characters so that it could safely call the new internal substitution routine. Now, from the perspective of a *user* of the locale module, I fail to see the point in having both 'format' and 'format_string' exposed. If you want to format a single % specifier, just pass it to format_string. Thus my suggestion to make them both do the same thing (to cater to other code that may be calling format incorrectly) and then deprecate one of them (presumably format). To bad I didn't think of that when I fixed the original bug.
|msg120994 - (view)||Author: Barry A. Warsaw (barry) *||Date: 2010-11-12 00:25|
On Nov 12, 2010, at 12:15 AM, R. David Murray wrote: >Now, from the perspective of a *user* of the locale module, I fail to see the >point in having both 'format' and 'format_string' exposed. If you want to >format a single % specifier, just pass it to format_string. Thus my >suggestion to make them both do the same thing (to cater to other code that >may be calling format incorrectly) and then deprecate one of them (presumably >format). +1 >To bad I didn't think of that when I fixed the original bug. Dang.
|msg185792 - (view)||Author: Mark Lawrence (BreamoreBoy)||Date: 2013-04-02 02:12|
msg120978 "The bug has been fixed upstream...". Have I missed something as on Windows Vista...? c:\Users\Mark\MyPython>python Python 3.3.1rc1 (v3.3.1rc1:92c2cfb92405, Mar 25 2013, 22:39:19) [MSC v.1600 32 bit (Intel)] on win32 Type "help", "copyright", "credits" or "license" for more information. >>> import locale; print(locale.format('%.0f KB', 100)) Traceback (most recent call last): File "<stdin>", line 1, in <module> File "c:\python33\lib\locale.py", line 193, in format "format specifier, %s not valid") % repr(percent)) ValueError: format() must be given exactly one %char format specifier, '%.0f KB' not valid
|msg185828 - (view)||Author: Eric V. Smith (eric.smith) *||Date: 2013-04-02 11:32|
Barry meant that the upstream program that triggered this error has been changed to call format_string() instead of format(). The bug still exists in format(). My suggestion is to have format() be an alias for format_string(). Deprecating format() is an optional step, but may not be worth the hassle.
|msg185838 - (view)||Author: Barry A. Warsaw (barry) *||Date: 2013-04-02 13:35|
On Apr 02, 2013, at 11:32 AM, Eric V. Smith wrote: >My suggestion is to have format() be an alias for >format_string(). Deprecating format() is an optional step, but may not be >worth the hassle. Agreed on both counts.
|msg185840 - (view)||Author: Eric V. Smith (eric.smith) *||Date: 2013-04-02 15:04|
So I guess the question is: would this be a bug fix and applied to 2.7 and 3.3, or just an enhancement for 3.4? I think it would be a bug fix and thus should be backported. It's not like we'd be breaking any working code, unless it was expecting the exception.
|msg185841 - (view)||Author: Barry A. Warsaw (barry) *||Date: 2013-04-02 15:38|
On Apr 02, 2013, at 03:04 PM, Eric V. Smith wrote: >I think it would be a bug fix and thus should be backported. It's not like >we'd be breaking any working code, unless it was expecting the exception. That would be my preference.
|2013-04-02 15:38:06||barry||set||messages: + msg185841|
versions: + Python 3.3, Python 3.4, - Python 3.1, Python 3.2
|2013-04-02 13:35:48||barry||set||messages: + msg185838|
|2013-04-02 11:32:54||eric.smith||set||messages: + msg185828|
messages: + msg185792
|2010-11-12 00:25:35||barry||set||messages: + msg120994|
|2010-11-12 00:15:25||r.david.murray||set||messages: + msg120989|
|2010-11-11 22:42:51||barry||set||messages: + msg120978|
messages: + msg120922
messages: + msg120921
|2010-11-10 00:08:19||eric.smith||set||assignee: docs@python|
components: + Documentation
nosy: + docs@python
versions: + Python 3.1, Python 3.2
|2010-11-10 00:01:55||barry||set||messages: + msg120909|
|2010-11-09 23:58:52||barry||set||messages: + msg120908|
+ amaury.forgeotdarc, r.david.murray|
messages: + msg120907