os.chown: OverflowError: Python int too large to convert to C long #59506

do1 · 2012-07-09T02:15:39Z

BPO	15301
Nosy	@loewis, @birkenfeld, @terryjreedy, @mdickinson, @pitrou, @vstinner, @larryhastings, @benjaminp, @ned-deily, @skrah, @markshannon, @serhiy-storchaka
Files	larry.chown.unsigned.uid.gid.1.diff posix_uid_gid_conv_tests.patch: Extended tests for chown() posix_uid_gid_conv_index.patch: Use index* for uid and gid conversion larry.chown.unsigned.uid.gid.2.diff: Second cut at a patch, building on Serhiy's second patch. larry.chown.unsigned.uid.gid.3.diff: Updated patch with Serhiy's changes from Rietveld. larry.chown.unsigned.uid.gid.4.diff: Latest patch incorporating Serihy's comments. larry.chown.unsigned.uid.gid.5.diff: New patch which might even be good enough for trunk. larry.chown.unsigned.uid.gid.6.diff: Another day, another patch for uid_converter.

^{Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.}

Show more details

GitHub fields:

assignee = 'https://github.com/larryhastings'
closed_at = <Date 2013-08-11.01:03:15.841>
created_at = <Date 2012-07-09.02:15:39.422>
labels = ['type-bug']
title = 'os.chown: OverflowError: Python int too large to convert to C long'
updated_at = <Date 2013-08-11.06:18:31.608>
user = 'https://bugs.python.org/do1'

bugs.python.org fields:

activity = <Date 2013-08-11.06:18:31.608>
actor = 'serhiy.storchaka'
assignee = 'larry'
closed = True
closed_date = <Date 2013-08-11.01:03:15.841>
closer = 'terry.reedy'
components = ['None']
creation = <Date 2012-07-09.02:15:39.422>
creator = 'do1'
dependencies = []
files = ['26384', '29072', '29073', '29860', '29883', '29977', '29978', '29996']
hgrepos = []
issue_num = 15301
keywords = ['patch']
message_count = 42.0
messages = ['165053', '165054', '165425', '165427', '165428', '165429', '165430', '165431', '165433', '165477', '165481', '182088', '182089', '182533', '186862', '186909', '186913', '186970', '186971', '186974', '187019', '187075', '187576', '187579', '187580', '187581', '187677', '187689', '192193', '194534', '194645', '194646', '194647', '194649', '194650', '194654', '194655', '194763', '194857', '194858', '194860', '194865']
nosy_count = 16.0
nosy_names = ['loewis', 'georg.brandl', 'terry.reedy', 'mark.dickinson', 'pitrou', 'vstinner', 'larry', 'benjamin.peterson', 'ned.deily', 'Arfrever', 'skrah', 'Mark.Shannon', 'python-dev', 'Scott.Leerssen', 'serhiy.storchaka', 'do1']
pr_nums = []
priority = 'normal'
resolution = 'fixed'
stage = 'resolved'
status = 'closed'
superseder = None
type = 'behavior'
url = 'https://bugs.python.org/issue15301'
versions = ['Python 2.7', 'Python 3.2', 'Python 3.3', 'Python 3.4']

do1 · 2012-07-09T02:15:39Z

os.chown() can not change uid/gid to valid but high number.

# chown 4294967294.4294967294 a
# ls -l a
-rw-r--r-- 1 4294967294 4294967294 0 Jul 9 05:22 a

# python
Python 2.7.3 (default, Jun 24 2012, 06:19:44)
[GCC 4.1.2 20080704 (Red Hat 4.1.2-52)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import os
>>> os.chown("a", 4294967294, 4294967294)
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
OverflowError: Python int too large to convert to C long

do1 · 2012-07-09T02:26:36Z

I can add that system is x86_32. Same code in x86_64 result in no error even on older Python version 2.6.6

markshannon · 2012-07-13T23:22:08Z

Larry, you're the expert on this.

Looks to me like the format character in PyArg_ParseTuple in posix_fchown and kin should be 'k' (unsigned long) rather than 'l' (signed long).

ned-deily · 2012-07-13T23:34:22Z

Possible duplicate of bpo-4591?

larryhastings · 2012-07-14T00:15:47Z

I don't think "k" is the right answer.

POSIX defines the user and group parameters as uid_t and gid_t respectively; in turn, uid_t and gid_t are defined as "integers", pointedly omitting the either of the words "signed" or "unsigned". So POSIX makes no guarantee that uid_t and gid_t are unsigned.

On the other hand: chown takes special values defined as "(uid_t)-1" and "(gid_t)-1" to indicate "don't change this value". If we change chown to use "k" for uid and gid, then in order to use this value you'd have to pass in the equivalent unsigned value. And that'd require you to know the size of an int on the local platform. Yuck!

If this is a genuine problem, then I think we have to take "O" (or "O!") and convert the long by hand. If it's -1, use "(uid_t)-1" (etc), otherwise convert as unsigned. On the other hand, if this is only a theoretical problem then I'd prefer to wontfix and keep the code simple.

p.s. find uid_t / gid_t here: http://pubs.opengroup.org/onlinepubs/009695399/basedefs/sys/types.h.html
chown here:
http://pubs.opengroup.org/onlinepubs/009695399/functions/chown.html

do1 · 2012-07-14T00:22:00Z

Problem emerged when well-known backup software (rdiff-backup) is 'crashed' (as in no backups). So I think this is not that theoretical.

larryhastings · 2012-07-14T00:37:53Z

Thinking about it some more, maybe we need to be smart about whether uid_t/gid_t are signed or unsigned.

I'm no good with configure hackery--could anyone here enhance the configure script so it could tell us whether or not uid_t and gid_t were signed or unsigned?

ned-deily · 2012-07-14T00:40:00Z

There is a proposed patch by haypo with bpo-4591.

larryhastings · 2012-07-14T06:17:23Z

That patch from Victor looks pretty good. I'll try to get it in for beta 2, however I'm on the road right now. I'm back Sunday night and will try to do it then.

If someone else wants to jump in that's fine with me.

larryhastings · 2012-07-15T00:48:09Z

It looks like bpo-1747858 is another duplicate. Though arguably all these "dups" aren't really dups because I rewrote chown's argument parsing for 3.3.

larryhastings · 2012-07-15T01:14:39Z

Here's a first cut at a patch. It's really just an update of Victor's patch to bpo-4591 (done with his blessing). I tweaked it slightly; the parsing functions are now O& converter functions.

I also added a reasonable unit test. Note however that the unit test is only run if you run the test as root.

I don't think this needs any documentation, just a Misc/NEWS entry.

If someone can give me quick turnaround on a review (or two) maybe we can get this in before beta 2 tomorrow.

I'm adding Antoine and MvL as they were in on the earliest known bug of this class.

serhiy-storchaka · 2013-02-14T11:24:36Z

Similar test was committed for bpo-4591 and it fixes this issue. There are only two differences with Larry's patch: tests and handling of non-integers.

Here is a patch based on Larry's patch which extends tests for *chown().

serhiy-storchaka · 2013-02-14T11:33:08Z

And here is a patch which uses __index__ for uid and gid conversion. decimal.Decimal should be added to the list of wrong types in the test.

I'm not sure about the last patch, whether to consider it as a bugfix or as a new feature?

python-dev · 2013-02-20T17:53:44Z

New changeset 9b37e53838eb by Serhiy Storchaka in branch '2.7':
Issue bpo-15301: Enhance os.*chown() testing. Based on patch by Larry Hastings.
http://hg.python.org/cpython/rev/9b37e53838eb

New changeset a0baf5347cd1 by Serhiy Storchaka in branch '3.2':
Issue bpo-15301: Enhance os.*chown() testing. Based on patch by Larry Hastings.
http://hg.python.org/cpython/rev/a0baf5347cd1

New changeset e97b6394848b by Serhiy Storchaka in branch '3.3':
Issue bpo-15301: Enhance os.*chown() testing. Based on patch by Larry Hastings.
http://hg.python.org/cpython/rev/e97b6394848b

New changeset d4bf997a34e9 by Serhiy Storchaka in branch 'default':
Issue bpo-15301: Enhance os.*chown() testing. Based on patch by Larry Hastings.
http://hg.python.org/cpython/rev/d4bf997a34e9

mdickinson · 2013-04-13T21:58:30Z

+1 on using PyNumber_Index.

serhiy-storchaka · 2013-04-14T10:54:38Z

Thank you, Mark. There is only one question. For what version is it appropriate? Only for 3.4 or for all maintained?

larryhastings · 2013-04-14T11:30:27Z

Oh...! Serhiy, I thought you already checked in the AsIndex stuff. Guess I was asleep at the switch.

Certainly the patch should go in for trunk. I'd be comfortable with it going in for 3.3 as a bugfix but that's ultimately Georg's call.

I'll give you a comment or two on the diff.

larryhastings · 2013-04-15T10:01:45Z

Okay, I got inspired and (in the words of Barry Warsaw) JFDI. Attached is my revised patch. I took Serhiy's patch and reworked it quite a bit:

I think it's now easier to follow. In particular:
The most common case (no overflow) is now first. In Serhiy's patch
the most common case is buried in the middle of the second "if".
I removed some extraneous tests.
I changed the error messages to call the values "uid" and "gid", to match the names of the parameters.
I noticed that _fd_converter had the same bad-idea PyFloat_Check, so I changed it to use PyNumber_Index instead.

In the process I also noticed that Serhiy's approach had a resource leak: it never decref'd the result of PyNumber_Index() when successful.

To make sure I duplicated Serhiy's semantics, I had a test harness that ran both his and mine and ensured they returned the same thing (or both threw an error). Obviously I removed all that before cutting the patch.

mdickinson · 2013-04-15T10:53:50Z

To answer Serhiy's question: I'd say that this level of cleanup is probably only appropriate for 3.4. Larry?

larryhastings · 2013-04-15T11:17:47Z

See my comment above (dated 2013-04-14 04:30). I'm passing the buck.

serhiy-storchaka · 2013-04-15T19:26:25Z

The most common case (no overflow) is now first. In Serhiy's patch
the most common case is buried in the middle of the second "if".

It's because my implementation is a simplified version of more complicated patch for bpo-2005, which supports signed uid_t and sizeof(uid_t) > sizeof(long). And this support can be added if necessary.

larryhastings · 2013-04-16T12:36:13Z

Whoops, forgot to write something here. Updated patch (in previous edit to the issue) incorporating Serhiy's suggestions from Rietveld.

larryhastings · 2013-04-22T18:05:49Z

Can I get an LGTM?

serhiy-storchaka · 2013-04-22T18:33:40Z

New tests needed (with Decimal and Fraction). And what is purpose of changing Py_TYPE(o) to o->ob_type?

larryhastings · 2013-04-22T19:07:19Z

Latest patch incorporating Serihy's comments.

larryhastings · 2013-04-22T19:10:17Z

Whoops, meant to throw in a Fraction too. Added that. Also hoisted the imports out of the function call, just to be a good guy.

larryhastings · 2013-04-23T23:32:46Z

Serhiy: Oh, I get it, your critique was about the "Note:" part at the bottom. It would have been helpful if you had inserted your feedback there, instead of at the top. Yeah, you're right, the note was incorrect.

I considered adding a note saying that the code was wrong if sizeof(uid_t) > sizeof(long) but thought better of it ;-)

New patch attached.

serhiy-storchaka · 2013-04-24T07:03:23Z

Yes, it is what I had in mean. Sorry for indistinctness.

As far as you change _fd_converter() it will be good to add also a test for float/decimal/fractional fd. Otherwise the patch LGTM.

ScottLeerssen · 2013-07-02T12:50:35Z

Regarding whether or not to include the fix in 2.7, I'd like to suggest that it be included there as well. It will be quite some time before the project on which I work moves to Python 3, and I just hit the same issue.

serhiy-storchaka · 2013-08-06T11:56:52Z

Let's push.

larryhastings · 2013-08-08T06:44:55Z

Benjamin, do you want my elaborate fix in for 2.7? It means adding two new converters, one for pid and one for gid, and switching everything that takes pid/gid arguments to use those.

python-dev · 2013-08-08T07:23:02Z

New changeset f871f8662509 by Larry Hastings in branch 'default':
Issue bpo-15301: Parsing fd, uid, and gid parameters for builtins
http://hg.python.org/cpython/rev/f871f8662509

larryhastings · 2013-08-08T07:23:59Z

Now fixed in trunk. I am waiting to hear from Georg and the only-recently-pinged Benjamin to see if they want these fixes in 3.3 or 2.7.

benjaminp · 2013-08-08T07:32:30Z

I think we should leave 2.7 at rest for the moment.

serhiy-storchaka · 2013-08-08T07:35:04Z

The original issue was fixed in bpo-4591. Larry's patch only adds support of PyNumber_Index() and refactors already existing uid/gid converters.

birkenfeld · 2013-08-08T08:39:59Z

Looks like an unnecessary change for the maintenance releases then.

larryhastings · 2013-08-08T09:34:01Z

Okay then, closing.

pitrou · 2013-08-09T20:15:18Z

The Windows buildbots are broken:

======================================================================
ERROR: test_chown_uid_gid_arguments_must_be_index (test.test_os.MakedirTests)
----------------------------------------------------------------------

Traceback (most recent call last):
  File "E:\Data\buildslave\cpython\3.x.snakebite-win2k3r2sp2-x86\build\lib\test\test_os.py", line 875, in test_chown_uid_gid_arguments_must_be_index
    self.assertRaises(TypeError, os.chown, support.TESTFN, value, gid)
AttributeError: 'module' object has no attribute 'chown'

python-dev · 2013-08-11T00:59:19Z

New changeset 9b0e9e9812f8 by Terry Jan Reedy in branch 'default':
Issue bpo-15301: skip new test method so Windows builtbots stop failing.
http://hg.python.org/cpython/rev/9b0e9e9812f8

terryjreedy · 2013-08-11T01:03:15Z

This was the only failure on 2 of 4 buildbots when I last looked.

test_os has several methods like this:

    def test_dup2(self):
        if hasattr(os, "dup2"):
            self.check(os.dup2, 20)

Should the hasattr be moved to skipUnless for all of these?
They now 'pass' without the needed os method.

benjaminp · 2013-08-11T01:55:05Z

Likely they were written before the invention of unittest test
skpping. They could be converted.

2013/8/10 Terry J. Reedy <report@bugs.python.org>:

Terry J. Reedy added the comment:

This was the only failure on 2 of 4 buildbots when I last looked.

test_os has several methods like this:
def test_dup2(self):
    if hasattr(os, "dup2"):
        self.check(os.dup2, 20)
Should the hasattr be moved to skipUnless for all of these?
They now 'pass' without the needed os method.

----------
nosy: +terry.reedy
status: open -> closed

Python tracker <report@bugs.python.org>
<http://bugs.python.org/issue15301\>

serhiy-storchaka · 2013-08-11T06:18:32Z

Likely they were written before the invention of unittest test
skpping. They could be converted.

See bpo-18702.

markshannon added the type-bug An unexpected behavior, bug, or error label Jul 13, 2012

larryhastings self-assigned this Jul 14, 2012

larryhastings closed this as completed Aug 8, 2013

pitrou reopened this Aug 9, 2013

terryjreedy closed this as completed Aug 11, 2013

ezio-melotti transferred this issue from another repository Apr 10, 2022

Navigation Menu

os.chown: OverflowError: Python int too large to convert to C long #59506

os.chown: OverflowError: Python int too large to convert to C long #59506

Comments

do1 mannequin commented Jul 9, 2012

do1 mannequin commented Jul 9, 2012

do1 mannequin commented Jul 9, 2012

markshannon commented Jul 13, 2012

ned-deily commented Jul 13, 2012

larryhastings commented Jul 14, 2012

do1 mannequin commented Jul 14, 2012

larryhastings commented Jul 14, 2012

ned-deily commented Jul 14, 2012

larryhastings commented Jul 14, 2012

larryhastings commented Jul 15, 2012

larryhastings commented Jul 15, 2012

serhiy-storchaka commented Feb 14, 2013

serhiy-storchaka commented Feb 14, 2013

python-dev mannequin commented Feb 20, 2013

mdickinson commented Apr 13, 2013

serhiy-storchaka commented Apr 14, 2013

larryhastings commented Apr 14, 2013

larryhastings commented Apr 15, 2013

mdickinson commented Apr 15, 2013

larryhastings commented Apr 15, 2013

serhiy-storchaka commented Apr 15, 2013

larryhastings commented Apr 16, 2013

larryhastings commented Apr 22, 2013

serhiy-storchaka commented Apr 22, 2013

larryhastings commented Apr 22, 2013

larryhastings commented Apr 22, 2013

larryhastings commented Apr 23, 2013

serhiy-storchaka commented Apr 24, 2013

ScottLeerssen mannequin commented Jul 2, 2013

serhiy-storchaka commented Aug 6, 2013

larryhastings commented Aug 8, 2013

python-dev mannequin commented Aug 8, 2013

larryhastings commented Aug 8, 2013

benjaminp commented Aug 8, 2013

serhiy-storchaka commented Aug 8, 2013

birkenfeld commented Aug 8, 2013

larryhastings commented Aug 8, 2013

pitrou commented Aug 9, 2013

python-dev mannequin commented Aug 11, 2013

terryjreedy commented Aug 11, 2013

benjaminp commented Aug 11, 2013

serhiy-storchaka commented Aug 11, 2013