This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

classification
Title: fileobject.c can switch between fread and fwrite without an intervening flush or seek, invoking undefined behaviour
Type: security Stage:
Components: IO Versions: Python 2.7
process
Status: closed Resolution: not a bug
Dependencies: Superseder:
Assigned To: pitrou Nosy List: davidsarah, georg.brandl, pitrou
Priority: normal Keywords:

Created on 2010-02-18 00:21 by davidsarah, last changed 2022-04-11 14:56 by admin. This issue is now closed.

Messages (4)
msg99483 - (view) Author: David-Sarah Hopwood (davidsarah) Date: 2010-02-18 00:21
The C standard (any version, or POSIX), says in the description of fopen that:
{{{
When a file is opened with update mode ( '+' as the second or third character in the mode argument), both input and output may be performed on the associated stream. However, the application shall ensure that output is not directly followed by input without an intervening call to fflush() or to a file positioning function ( fseek(), fsetpos(), or rewind()), and input is not directly followed by output without an intervening call to a file positioning function, unless the input operation encounters end-of-file.
}}}

Objects/fileobject.c makes calls to fread and fwrite without taking this into account. So calls from Python to read or write methods of a file object opened in any "rw" mode, may invoke undefined behaviour. It isn't reasonable to rely on Python code to avoid this situation, even if were considered acceptable in C. (Arguably this is a bug in the C standard, but it is unlikely to be fixed there or in POSIX, because of differences in philosophy about language safety.)

To fix this, fileobject.c should keep track of whether the last I/O operation was an input or output, and perform a call to fflush whenever an input follows an output or vice versa. This should not significantly affect performance in any case where the behaviour was previously defined (in cases where it wasn't, correctness trumps performance). fflush does not affect the file position and should have no other negative effect, because the stdio implementation is free to flush buffered data at any time (and certainly on I/O operations).

Despite the undefined behaviour, I don't currently know of a platform where this would lead to an exploitable security bug. I'm marking this issue as security-relevant anyway, because it may prevent analysing whether Python applications behave securely only on the basis of documented behaviour.
msg99492 - (view) Author: David-Sarah Hopwood (davidsarah) Date: 2010-02-18 01:36
Correction: when input is followed by output, the call needed to avoid undefined behaviour has to be to a "file positioning function" (fseek, fsetpos, or rewind, but not fflush). Since fileobject.c does not use wide I/O operations, it should be sufficient to use _portable_fseek(fp, 0, SEEK_SET).

(_portable_fseek may call some function that is not strictly defined to be a "file positioning function", e.g. fseeko() or fseek64(). However, it would be insane for a stdio implementation not to treat those as being file positioning functions as far as the intent of the C or POSIX standards is concerned.)
msg99493 - (view) Author: David-Sarah Hopwood (davidsarah) Date: 2010-02-18 01:39
"... it should be sufficient to use _portable_fseek(fp, 0, SEEK_SET)."

I meant SEEK_CUR here.
msg112320 - (view) Author: Antoine Pitrou (pitrou) * (Python committer) Date: 2010-08-01 12:23
The 2.x file object mostly mirrors abilities of the standard C buffered IO functions (including, for example, special behaviour of text files under Windows). Therefore, Python code should call flush() manually if needed. It should be noted that the 2.x file object has been existing for ages and this issue is, to my knowledge, very rarely brought up.

The 3.x IO library should not have this problem since we wrote our own buffering layer, and we have dedicated tests for mixed reads and writes.
History
Date User Action Args
2022-04-11 14:56:57adminsetgithub: 52200
2010-08-01 12:23:13pitrousetstatus: open -> closed

nosy: + georg.brandl
messages: + msg112320

resolution: not a bug
2010-08-01 07:59:40georg.brandlsetassignee: pitrou

nosy: + pitrou
2010-02-18 01:39:38davidsarahsetmessages: + msg99493
2010-02-18 01:36:38davidsarahsetmessages: + msg99492
2010-02-18 00:21:53davidsarahcreate