Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

shutil rmtree fails on readonly files in Windows #63842

Closed
ivanradic mannequin opened this issue Nov 18, 2013 · 27 comments
Closed

shutil rmtree fails on readonly files in Windows #63842

ivanradic mannequin opened this issue Nov 18, 2013 · 27 comments
Assignees
Labels
OS-windows stdlib Python modules in the Lib dir type-feature A feature request or enhancement

Comments

@ivanradic
Copy link
Mannequin

ivanradic mannequin commented Nov 18, 2013

BPO 19643
Nosy @pfmoore, @tjguk, @bitdancer, @zware
Files
  • issue19643-doc.patch
  • issue19643-doc.2.patch
  • Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.

    Show more details

    GitHub fields:

    assignee = 'https://github.com/tjguk'
    closed_at = <Date 2014-05-07.17:34:58.251>
    created_at = <Date 2013-11-18.12:11:11.635>
    labels = ['type-feature', 'library', 'OS-windows']
    title = 'shutil rmtree fails on readonly files in Windows'
    updated_at = <Date 2014-10-16.17:37:23.941>
    user = 'https://bugs.python.org/ivanradic'

    bugs.python.org fields:

    activity = <Date 2014-10-16.17:37:23.941>
    actor = 'paul.moore'
    assignee = 'tim.golden'
    closed = True
    closed_date = <Date 2014-05-07.17:34:58.251>
    closer = 'tim.golden'
    components = ['Library (Lib)', 'Windows']
    creation = <Date 2013-11-18.12:11:11.635>
    creator = 'ivan.radic'
    dependencies = []
    files = ['35168', '35170']
    hgrepos = []
    issue_num = 19643
    keywords = ['patch']
    message_count = 27.0
    messages = ['203285', '203297', '203298', '203301', '203308', '203313', '203314', '203315', '203316', '208647', '208651', '208654', '208657', '208662', '213867', '217997', '218021', '218023', '218044', '218051', '218052', '218053', '218054', '218063', '229538', '229541', '229542']
    nosy_count = 8.0
    nosy_names = ['paul.moore', 'tim.golden', 'r.david.murray', 'python-dev', 'zach.ware', 'Jovik', 'ivan.radic', 'sushisource']
    pr_nums = []
    priority = 'normal'
    resolution = 'fixed'
    stage = 'resolved'
    status = 'closed'
    superseder = None
    type = 'enhancement'
    url = 'https://bugs.python.org/issue19643'
    versions = ['Python 3.5']

    @ivanradic
    Copy link
    Mannequin Author

    ivanradic mannequin commented Nov 18, 2013

    shutil.rmtree works nice on Windows until it hits file with read only attribute set. Workaround is to provide a onerror parameter as a function that checks and removes file attribute before attempting to delete it. Can option to delete read_only files be integrated in shutil.rmtree?

    Example output in In Python 2.7:
    shutil.rmtree("C:\\2")

    Traceback (most recent call last):
      File "<pyshell#60>", line 1, in <module>
        shutil.rmtree("C:\\2")
      File "C:\Program Files (x86)\Python.2.7.3\lib\shutil.py", line 250, in rmtree
        onerror(os.remove, fullname, sys.exc_info())
      File "C:\Program Files (x86)\Python.2.7.3\lib\shutil.py", line 248, in rmtree
        os.remove(fullname)
    WindowsError: [Error 5] Access is denied: 'C:\\2\\read_only_file.txt'
    
    Example output in In Python 3.3:
    shutil.rmtree("C:\\2")
    Traceback (most recent call last):
      File "<pyshell#1>", line 1, in <module>
        shutil.rmtree("C:\\2")
      File "C:\Program Files (x86)\Python.3.3.0\lib\shutil.py", line 460, in rmtree
        return _rmtree_unsafe(path, onerror)
      File "C:\Program Files (x86)\Python.3.3.0\lib\shutil.py", line 367, in _rmtree_unsafe
        onerror(os.unlink, fullname, sys.exc_info())
      File "C:\Program Files (x86)\Python.3.3.0\lib\shutil.py", line 365, in _rmtree_unsafe
        os.unlink(fullname)
    PermissionError: [WinError 5] Access is denied: 'C:\\2\\read_only_file.txt'

    @ivanradic ivanradic mannequin added type-bug An unexpected behavior, bug, or error topic-IO labels Nov 18, 2013
    @bitdancer
    Copy link
    Member

    You are essentially asking that we have an option to make the windows behavior mirror the posix behavior? (A read only file in a writable directory can be deleted in posix, since only the directory entry, not the file, is being deleted.)

    That makes some sense to me. I wonder what the windows devs think?

    @bitdancer bitdancer added stdlib Python modules in the Lib dir OS-windows type-feature A feature request or enhancement and removed topic-IO type-bug An unexpected behavior, bug, or error labels Nov 18, 2013
    @tjguk
    Copy link
    Member

    tjguk commented Nov 18, 2013

    This, unfortunately, is the classic edge-case where intra-platform consistency and inter-platform consistency clash.

    I (on Windows) would certainly be surprised if a tree delete removed read-only files without my specifying some kind of override. I understand why cross-platform behaviour might be preferred, and I'm open to being convinced, but I start at -0.

    @bitdancer
    Copy link
    Member

    Well, it would *definitely* need to be a new explicit option whose default value was the current behavior.

    @ivanradic
    Copy link
    Mannequin Author

    ivanradic mannequin commented Nov 18, 2013

    You are essentially asking that we have an option to make the windows behavior mirror the posix behavior? (A read only file in a writable directory can be deleted in posix, since only the directory entry, not the file, is being deleted.)

    Actually, your explanation is perfect.
    I want to be able to remove some directory after I am done using it. When similar operation is done through file manager, dialog pops up asking for confirmation, I would like to have function parameter equivalent of "yes to all" dialog that file manager gives me.

    The thing is, anyone working with files is used to think in "rm -rf" kind of way, and on Windows read_only files break this workflow. I discovered this problem few days ago when I was working on custom backup script that needs to work both on Linux (at home) and Windows (at work). Currently, I need to have some extra *windows only* code just to be able to successfully remove a directory.

    Quick Google search discovered the workaround (http://stackoverflow.com/questions/1889597/deleting-directory-in-python), so I am set, but the original
    question: "Why oh why is this such a pain?"
    and the comment: "Maybe nobody has taken the five minutes to file a bug at bugs.python.org"
    resonated in my head long enough to give it a try.

    For me it makes sense to have this option configurable. And it make a ton of sense to support one line equivalent of "rm -rf".

    @zware
    Copy link
    Member

    zware commented Nov 18, 2013

    I like the idea of a remove_readonly flag. I was going to say that I'm a bit worried about the fact that shutil.rmtree already has a couple of keyword arguments, but it's nowhere near what, say, copytree has. Call me +0.75.

    @bitdancer
    Copy link
    Member

    That's not a good name for the flag. The problem is that 'read-only' means different things on Windows than it does on Unix. Which is why I suggested that the flag control whether or not it acts "posix like" on Windows. So perhaps 'posix_compat'? That feels a bit odd, though, since on posix it does nothing...so it's really about behavioral consistency across platforms....

    Hmm. It's really hard to think of a name that conveys succinctly what we are talking about here.

    A more radical notion would be something like 'delete_control' and have it be tri-valued: 'unixlike', 'windowslike', and 'native', with the default being native. Bad names, most likely, but you get the idea. The disadvantage is that it would be even more code to implement ;)

    @tjguk
    Copy link
    Member

    tjguk commented Nov 18, 2013

    TBH I'm still fairly -0 and edging towards -0.5. If we didn't already
    have two keyword args I might be convinced towards a jfdi=True flag.
    But, as the OP described, the current params already allow for a
    workaround of sorts and another param of the semantics we're discussing
    would surely just be confusing?

    (Or you could just "preprocess" on Windows, eg attrib -r /s)

    @zware
    Copy link
    Member

    zware commented Nov 18, 2013

    I make no claims of being good at naming things :)

    @pfmoore
    Copy link
    Member

    pfmoore commented Jan 21, 2014

    The most obvious solution would be if the onerror argument allowed for retries. At the moment, all it can do is report issues, not recover. Suppose that returning True from onerror meant "retry the operation". Then you could do

        def set_rw(operation, name, exc):
            os.chmod(name, stat.S_IWRITE)
            return True
    
        shutil.rmtree('path', onerror=set_rw)

    @bitdancer
    Copy link
    Member

    See bpo-8523 for a discussion of changing the way onerror behaves. I think it is addressing this same use case, but I didn't reread it in detail.

    @pfmoore
    Copy link
    Member

    pfmoore commented Jan 21, 2014

    It's similar. But the problem is that it only returns a list of errors, it doesn't let you address the error *while the rmtree is in progress*.

    The key thing is that if you can fix the problem in onerror, you can avoid needing to restart the whole tree walk, which is the key aspect of rmtree.

    As things stand, you can use the set_rw function I showed above, and run the rmtree twice:

        shutil.rmtree('path', onerror=set_rw)
        shutil.rmtree('path')

    The first run fixes the error and then the second one deletes the remaining files. But this is clearly inefficient, and makes the limitations of "report errors to the user who can then address them" fairly obvious.

    @bitdancer
    Copy link
    Member

    OK, rereading that issue, I disagree with Tarek and I think that the patch on that issue is ill-advised as it looks like it changes behavior in a non-backward-compatible way.

    If you changed your set_rw onerror handler to a rm_ro_file error handler, would things work? That is, have the handler delete the file?

    Adding a 'retry' capability is interesting, but would be a non-trivial change in logic, and should be addressed in a new issue, not this one.

    @pfmoore
    Copy link
    Member

    pfmoore commented Jan 21, 2014

    Looks like that works. At least in my case - I just did

        def del_rw(action, name, exc):
            os.chmod(name, stat.S_IWRITE)
            os.remove(name)
        shutil.rmtree(path, onerror=del_rw)

    Something more robust might check if name is a directory and os.rmdir that - I didn't need it for my case though.

    Thanks.

    @Jovik
    Copy link
    Mannequin

    Jovik mannequin commented Mar 17, 2014

    This could be at least part of docs; I found that people tend to avoid shutil.rmtree(...) on Windows because of such issues. Some of them call subprocess("rmdir /S /Q <path>") to get desired behavior.

    @tjguk
    Copy link
    Member

    tjguk commented May 6, 2014

    Ok, so to move this forward we have essentially two proposals:

    1. Add a remove_readonly flag

    2. Add a doc example which shows how to use the onerror handler to remove a recalcitrant file.

    I'm -0.5 on (1) because it feels like Windows-specific clutter; and +0 on (2).

    @zware
    Copy link
    Member

    zware commented May 6, 2014

    I'm good with just adding an example to the docs, along the lines of Paul's del_rw. I think it would be better to use a more conservative example though, something like:

       def readonly_handler(rm_func, path, exc_info):
           if issubclass(exc_info[0], PermissionError) and exc_info[1].winerror == 5:
               os.chmod(path, stat.S_IWRITE)
               return rm_func(path)
           raise exc_info[1]

    @pfmoore
    Copy link
    Member

    pfmoore commented May 6, 2014

    Checking the exact error could be a bit fragile. I have a feeling I recently saw an issue somewhere with code that stopped working on Python 3.4 because the precise error raised for a read-only file changed. I don't recall where the issue was, unfortunately.

    It's also worth noting that trapping too broad a set of errors won't actually matter much, because the retry will simply fail again if the actual problem wasn't a read-only file...

    @tjguk
    Copy link
    Member

    tjguk commented May 7, 2014

    The attached patch adds an example to the shutil documentation showing how to use an onerror handler to reattempt the removal of a read-only file. It's deliberately low-tech and simply removes the attribute and retries. If there's some other obstacle, it will continue to fail as it would have done in any case.

    @zware
    Copy link
    Member

    zware commented May 7, 2014

    Fair point, Paul.

    Patch looks good to me, Tim, barring a couple of nits pointed out on Rietveld.

    @tjguk
    Copy link
    Member

    tjguk commented May 7, 2014

    Thanks, Zach. Updated patch.

    @tjguk tjguk self-assigned this May 7, 2014
    @zware
    Copy link
    Member

    zware commented May 7, 2014

    LGTM!

    @tjguk
    Copy link
    Member

    tjguk commented May 7, 2014

    Thanks. I'll hold off pushing until I've had a chance to run it on a
    Unix system. I'm not 100% whether it will operate in the same way there.

    @python-dev
    Copy link
    Mannequin

    python-dev mannequin commented May 7, 2014

    New changeset 31d63ea5dffa by Tim Golden in branch 'default':
    bpo-19643 Add an example of shutil.rmtree which shows how to cope with readonly files on Windows
    http://hg.python.org/cpython/rev/31d63ea5dffa

    New changeset a7560c8f38ee by Tim Golden in branch 'default':
    bpo-19643 Fix whitespace
    http://hg.python.org/cpython/rev/a7560c8f38ee

    @tjguk tjguk closed this as completed May 7, 2014
    @sushisource
    Copy link
    Mannequin

    sushisource mannequin commented Oct 16, 2014

    Although this is closed, I stumbled across it while looking to see if this behavior had changed at all recently, and I have a suggestion I think might work.

    How about we take Tim's example error function which was added to the docs, and it's bound to something like shutil.REMOVE_WINDOWS_READONLY, so it can be used in the following way (probably with a better name):

    shutil.rmtree("C:/readonlyfilesinhere", onerror=shutil.REMOVE_WINDOWS_READONLY)

    Good idea? Too weird?

    @bitdancer
    Copy link
    Member

    I think it is an interesting idea. Probably worth opening a new enhancement request with the suggestion.

    @pfmoore
    Copy link
    Member

    pfmoore commented Oct 16, 2014

    Not a bad idea. I would want the implementation to remain in the documentation as well, as code that wants to be portable back to earlier versions of Python will need it. But for code that targets Python 3.5+ only, having it available "out of the box" is a nice addition.

    @ezio-melotti ezio-melotti transferred this issue from another repository Apr 10, 2022
    Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
    Labels
    OS-windows stdlib Python modules in the Lib dir type-feature A feature request or enhancement
    Projects
    None yet
    Development

    No branches or pull requests

    4 participants