Issue 25476: close() behavior on non-blocking BufferedIO objects with sockets

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

This issue has been migrated to GitHub: https://github.com/python/cpython/issues/69662

classification

Title:	close() behavior on non-blocking BufferedIO objects with sockets
Type:	behavior	Stage:	resolved
Components:	IO	Versions:	Python 3.6, Python 3.4, Python 3.5, Python 2.7

process

Status:	closed	Resolution:	rejected
Dependencies:		Superseder:
Assigned To:		Nosy List:	dabeaz, eryksun, martin.panter
Priority:	normal	Keywords:

Created on 2015-10-25 21:46 by dabeaz, last changed 2022-04-11 14:58 by admin. This issue is now closed.

Messages (6)
msg253438 - (view)	Author: David Beazley (dabeaz)	Date: 2015-10-25 21:46
First comment: In the I/O library, there is documented behavior for how things work in the presence of non-blocking I/O. For example, read/write methods returning None on raw file objects. Methods on BufferedIO instances raise a BlockingIOError for operations that can't complete. However, the implementation of close() is currently broken. If buffered I/O is being used and a file is closed, it's possible that the close will fail due to a BlockingIOError occurring as buffered data is flushed to output. However, in this case, the file is closed anyways and there is no possibility to retry. Here is an example to illustrate: >>> from socket import * >>> s = socket(AF_INET, SOCK_STREAM) >>> s.connect(('somehost', port)) >>> s.setblocking(False) >>> f = s.makefile('wb', buffering=10000000) # Large buffer >>> f.write(b'x'1000000) >>> Now, watch carefully >>> f <_io.BufferedWriter name=4> >>> f.closed False >>> f.close() Traceback (most recent call last): File "<stdin>", line 1, in <module> BlockingIOError: [Errno 35] write could not complete without blocking >>> f <_io.BufferedWriter name=-1> >>> f.closed True >>> I believe this can be fixed by changing a single line in Modules/_io/bufferedio.c: --- bufferedio_orig.c 2015-10-25 16:40:22.000000000 -0500 +++ bufferedio.c 2015-10-25 16:40:35.000000000 -0500 @@ -530,10 +530,10 @@ res = PyObject_CallMethodObjArgs((PyObject )self, _PyIO_str_flush, NULL); if (!ENTER_BUFFERED(self)) return NULL; - if (res == NULL) - PyErr_Fetch(&exc, &val, &tb); - else - Py_DECREF(res); + if (res == NULL) + goto end; + else + Py_DECREF(res); res = PyObject_CallMethodObjArgs(self->raw, _PyIO_str_close, NULL); With this patch, the close() method can be retried as appropriate until all buffered data is successfully written.
msg253456 - (view)	Author: Eryk Sun (eryksun) *	Date: 2015-10-26 07:46
Per issue 16597, when an exception occurs in flush(), the file is closed anyway. You'd have to check the exception and only skip to the end for EWOULDBLOCK or EAGAIN. That way you're not introducing a regression for ENOSPC and other exceptions for which retrying will just repeatedly fail. I'm adding 2.7 to keep the io module consistent; however, socket.makefile in 2.7 doesn't use the io module.
msg253464 - (view)	Author: David Beazley (dabeaz)	Date: 2015-10-26 08:48
Please don't make flush() close the file on a BlockingIOError. That would be an unfortunate mistake and make it impossible to implement non-blocking I/O correctly with buffered I/O.
msg253465 - (view)	Author: Eryk Sun (eryksun) *	Date: 2015-10-26 10:04
I meant that you need a check in buffered_close such as the following: if (res == NULL) { if (PyErr_ExceptionMatches(PyExc_BlockingIOError)) goto end; PyErr_Fetch(&exc, &val, &tb); } else Py_DECREF(res); For 2.7 you could create a function similar to _PyIO_trap_eintr, but trap the errors that Python 3 maps to BlockingIOError: EAGAIN/EWOULDBLOCK, EALREADY, and EINPROGRESS.
msg253528 - (view)	Author: Martin Panter (martin.panter) *	Date: 2015-10-27 11:19
David, what’s your use case for doing non-blocking buffered writing “correctly”? Would you be able to use the context manager functionality? I would have thought you would explicitly call flush() as many times as necessary, but only call close() once when you are done. At least in blocking mode, close() is meant do as much as possible, despite any intermediate exceptions. It doesn’t seem wise to break this rule in non-blocking mode.
msg268412 - (view)	Author: Martin Panter (martin.panter) *	Date: 2016-06-13 02:14
I propose to reject this. Close() should always close the underlying file descriptor or socket, even if there is a blocking error or other exception.

History
Date	User	Action	Args
2022-04-11 14:58:23	admin	set	github: 69662
2019-03-15 08:11:52	martin.panter	set	status: pending -> closed stage: resolved
2016-06-13 02:14:53	martin.panter	set	status: open -> pending resolution: rejected messages: + msg268412
2015-10-27 11:19:55	martin.panter	set	nosy: + martin.panter messages: + msg253528
2015-10-26 10:04:59	eryksun	set	messages: + msg253465
2015-10-26 08:48:20	dabeaz	set	messages: + msg253464
2015-10-26 07:46:54	eryksun	set	nosy: + eryksun messages: + msg253456 versions: + Python 2.7, Python 3.4, Python 3.6
2015-10-25 21:46:32	dabeaz	create