classification
Title: ctime: I don't think that word means what you think it means.
Type: Stage:
Components: Library (Lib) Versions:
process
Status: open Resolution:
Dependencies: Superseder:
Assigned To: Nosy List: pitrou, zooko
Priority: normal Keywords:

Created on 2009-04-07 21:57 by zooko, last changed 2009-06-13 17:44 by zooko.

Messages (3)
msg85750 - (view) Author: Zooko O'Whielacronx (zooko) Date: 2009-04-07 21:57
The stat module currently uses the "st_ctime" slot to hold two kinds
values which are semantically different but which are frequently
confused with one another.  It chooses which kind of value to put in
there based on platform -- Windows gets the file creation time and all
other platforms get the "ctime".  The only sane way to use this API is
then to switch on platform:

if platform.system() == "Windows":
    metadata["creation time"] = s.st_ctime
else:
    metadata["unix ctime"] = s.st_ctime

(That is an actual code snippet from the Allmydata-Tahoe project.)

Many or even most programmers incorrectly think that unix ctime is file
creation time, so instead of using the sane idiom above, they write the
following:

metadata["ctime"] = s.st_ctime

thus passing on the confusion to the users of their metadata, who may
not know on which platform this metadata was created.  This is the
situation we have found ourselves in for the Allmydata-Tahoe project --
we now have a bunch of "ctime" values stored in our filesystem and no
way to tell which kind they were.

More and more filesystems such as ZFS and Macintosh apparently offer
creation time nowadays.

I propose the following changes:

1.  Add a "st_crtime" field which gets populated on filesystems
(Windows, ZFS, Mac) which can do so.

That is hopefully not too controversial and we could proceed to do so
even if the next proposal gets bogged down:

2.  Add a "st_unixctime" field which gets populated *only* by the unix
ctime and never by any other value (even on Windows, where the unix
ctime *is* available even though nobody cares about it), and deprecate
the hopelessly ambiguous "st_ctime" field.

You may be interested in http://allmydata.org/trac/tahoe/ticket/628
("mtime" and "ctime": I don't think that word means what you think it
means.) where the Allmydata-Tahoe project is carefully unpicking the
mess we made for ourselves by confusing ctime with file-creation time.
msg85987 - (view) Author: Antoine Pitrou (pitrou) * (Python committer) Date: 2009-04-15 10:18
Sounds like a good idea, perhaps you could launch a discussion on
python-dev?
msg89336 - (view) Author: Zooko O'Whielacronx (zooko) Date: 2009-06-13 17:44
Okay, I posted to python-dev:

http://mail.python.org/pipermail/python-dev/2009-June/090021.html
History
Date User Action Args
2009-06-13 17:44:27zookosetmessages: + msg89336
2009-04-15 10:18:45pitrousetnosy: + pitrou
messages: + msg85987
2009-04-07 21:57:48zookocreate