`OverflowError: signed integer is greater than maximum` in ssl.py for files larger than 2GB #87019

amacd31 · 2021-01-07T08:03:10Z

BPO	42853
Nosy	@ronaldoussoren, @tiran, @methane, @ambv, @bmerry, @jakirkham, @matan1008, @amacd31, @jan-xyz
PRs	[3.9] bpo-42853: Fix http.client fails to download >2GiB data over TLS #27405

^{Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.}

Show more details

GitHub fields:

assignee = 'https://github.com/methane'
closed_at = <Date 2021-07-28.13:28:20.739>
created_at = <Date 2021-01-07.08:03:09.551>
labels = ['type-bug', 'library', '3.9']
title = '`OverflowError: signed integer is greater than maximum` in ssl.py for files larger than 2GB'
updated_at = <Date 2021-07-29.16:08:26.159>
user = 'https://github.com/amacd31'

bugs.python.org fields:

activity = <Date 2021-07-29.16:08:26.159>
actor = 'lukasz.langa'
assignee = 'methane'
closed = True
closed_date = <Date 2021-07-28.13:28:20.739>
closer = 'lukasz.langa'
components = ['Library (Lib)']
creation = <Date 2021-01-07.08:03:09.551>
creator = 'amacd31'
dependencies = []
files = []
hgrepos = []
issue_num = 42853
keywords = ['patch']
message_count = 21.0
messages = ['384565', '384566', '384649', '384658', '391359', '394947', '396454', '396458', '396459', '396460', '396461', '396463', '396509', '396510', '398377', '398388', '398408', '398412', '398436', '398464', '398494']
nosy_count = 9.0
nosy_names = ['ronaldoussoren', 'christian.heimes', 'methane', 'lukasz.langa', 'bmerry', 'jakirkham', 'matan1008', 'amacd31', 'jan-xyz']
pr_nums = ['27405']
priority = 'normal'
resolution = 'fixed'
stage = 'resolved'
status = 'closed'
superseder = None
type = 'behavior'
url = 'https://bugs.python.org/issue42853'
versions = ['Python 3.9']

amacd31 · 2021-01-07T08:03:09Z

When attempting to read a large file (> 2GB) over HTTPS the read fails with "OverflowError: signed integer is greater than maximum".

This occurs with Python >=3.8 and I've been able to reproduce the problem with the below snippet of code on Linux, Mac OS X, and Windows (the remote file can be any HTTPS hosted file larger than 2GB, e.g. an empty file generated with dd if=/dev/zero of=2g.img bs=1 count=0 seek=2G will also do the job.).

import http.client
connection = http.client.HTTPSConnection("mirror.aarnet.edu.au")
connection.request("GET", "/pub/centos/8/isos/x86_64/CentOS-8.3.2011-x86_64-dvd1.iso")
response = connection.getresponse()
data = response.read()

Doing a git bisect it looks like this is the result of a change in commit d6bf6f2 (d6bf6f2). Looking over the associated issue and commit message it seems like this was not an intended outcome for the change.

tiran · 2021-01-07T08:15:02Z

I cannot lift the overflow restriction until we drop support for OpenSSL 1.0.2. The function SSL_write() and SSL_read() are limited to signed 32bit int. OpenSSL 1.1.1 has new SSL_write_ex() and SSL_read_ex() functions that support size_t. Even size_t limits the maximum value to unsigned 32bit (~4GB) on 32bit systems.

ronaldoussoren · 2021-01-08T09:52:17Z

The API documentation already implies that write might not write the entire buffer because it returns the number of bytes actually written (just like os.write).

A possible workaround on the SSL layer is hence to clamp the amount of bytes to write to MAX_INT (or later MAX_SSIZE_T) bytes.

That said, this does require checking that users of the SSL layer write method in the stdib actually check for the number of bytes written, otherwise we'd exchange the exception to a silent error.

tiran · 2021-01-08T13:05:26Z

That's a good idea, Ronald! socket.c:sock_send_impl() already clamps the input length on Windows:

#ifdef MS_WINDOWS
    if (ctx->len > INT_MAX)
        ctx->len = INT_MAX;
    ctx->result = send(s->sock_fd, ctx->buf, (int)ctx->len, ctx->flags);
#else
    ctx->result = send(s->sock_fd, ctx->buf, ctx->len, ctx->flags);
#endif

I could implement a similar logic for SSLSocket. Applications have to check the return value of send() any way or use sendall(). The socket.send() method / send(2) libc function may also write less bytes.

tiran · 2021-04-19T04:04:53Z

Python 3.10 will use SSL_write_ex() and SSL_read_ex(), which support > 2 GB data.

jakirkham · 2021-06-02T21:35:28Z

Would it be possible to check for these newer OpenSSL symbols during the builds of Python 3.8 & 3.9 (using them when available and otherwise falling back to the older API otherwise)? This would allow people to build Python 3.8 & 3.9 with the newer OpenSSL benefiting from the fix

That said, not sure if there are other obstacles to using OpenSSL 1.1.1 with Python 3.8 & 3.9

matan1008 · 2021-06-24T06:51:09Z

A bit of extra context to save clicking through:
the PR which introduced the regression: #12698

and the bug: https://bugs.python.org/issue36050

Maybe some people have context about why we couldn't just roll back that PR and increase MAXAMOUNT to something > 1 MiB and < 2 GiB?

methane · 2021-06-24T08:12:08Z

Because this is SSL issue. HTTPS is not the only user of SSL.
So we should try to fix SSL issue before reverting #56907.

tiran · 2021-06-24T08:28:40Z

The ssl module supports sending or receiving buffers of more than 2GB in Python 3.10. It cannot be reliable fixed in Python 3.9 and earlier.

methane · 2021-06-24T08:33:01Z

I see. But Python 3.8 is now security fix mode.
Let's revert the optimization in the 3.9 branch.

jakirkham · 2021-06-24T08:34:13Z

Right with this change ( #25468 ). Thanks for adding that Christian :)

I guess what I'm wondering is if in older Python versions we could do an #ifdef check to try and use SSL_read_ex & SSL_write_ex if the symbols are found at build time? This would allow package maintainers the option to build with a newer OpenSSL to fix this issue (even on older Pythons)

tiran · 2021-06-24T08:54:07Z

No, it would be a backwards incompatible change and introduce inconsistent behavior. SSL_read_ex() is not available in LibreSSL, OpenSSL 1.0.2, and OpenSSL 1.1.0. Your code would work on CentOS 8, Debian 10, and Ubuntu 20.04, but break on CentOS 7, Debian 9, Ubuntu 18.04, and OpenBSD.

jakirkham · 2021-06-24T20:15:36Z

Not following. Why would it break? Presumably once one builds Python for a particular OS they keep there (like system package managers for example).

Or alternatively they build on a much older OS and then deploy to newer ones. The latter case is what we do in conda-forge (Anaconda does the same thing). We also build our own OpenSSL so have control of that knob too. Though I've seen application developers do similar things.

Do you have an example where this wouldn't work? Maybe that would help as we can define a better solution there.

tiran · 2021-06-24T20:39:31Z

Right now sending and receiving buffers >2 GB over TLS consistently fails on all platforms with Python 3.9. A backport of bf62403 to Python 3.9 would make the behavior inconsistent. Your code would work on your laptop with a recent Fedora or Ubuntu version that has OpenSSL 1.1.1. But it would suddenly fail on your production system with Debian 10, because it has OpenSSL 1.1.0 and Python would have to fall back to SSL_read().

In my experience these kinds of inconsistencies cause headaches and frustrations. A consistent error gives you a chance to notice a problem early and to implement a workaround.

ambv · 2021-07-28T13:27:57Z

New changeset 153365d by Inada Naoki in branch '3.9':
[3.9] bpo-42853: Fix http.client fails to download >2GiB data over TLS (GH-27405)
153365d

bmerry · 2021-07-28T14:15:46Z

This fix is going to cause a regression of bpo-36050. Would it not be possible to fix this in _ssl.c (by breaking a large read into multiple smaller calls to SSL_read)? It seems like fixing this at the SSL layer is more appropriate than trying to work around it at the HTTP layer, and thus impacting the performance of all HTTP fetches (whether using TLS or not, and whether >2GB or not).

jakirkham · 2021-07-28T17:34:55Z

Agree with Bruce. It seems like we could have support for OpenSSL 1.1.1 at that level with a compile time fallback for previous OpenSSL versions that break up the work. Would hope this solution also yields something we can backport more easily

bmerry · 2021-07-28T18:09:24Z

It seems like we could have support for OpenSSL 1.1.1 at that level with a compile time fallback for previous OpenSSL versions that break up the work. Would hope this solution also yields something we can backport more easily

I'd have to look at exactly how the SSL_read API works, but I think once we're in C land and can read into regions of a buffer, reading in 2GB chunks is unlikely to cause a performance hit (unlike the original bpo-36050, where Python had to read a bunch of separate buffers then join them together). So trying to have 3.9 support both SSL_read_ex AND have a fallback sounds like it's adding complexity and risking inconsistency if the fallback doesn't perfectly mimic the SSL_read_ex path, for very little gain.

If no-one else steps up sooner I can probably work on a patch, but before sinking time into it I'd like to hear if there is agreement that this is a reasonable approach and ideally have a volunteer to review it (hopefully someone who is familiar with OpenSSL, since I've only briefly dealt with it years ago and crypto isn't somewhere you want to make mistakes).

tiran · 2021-07-28T22:34:25Z

A patch would not land in Python 3.9 since this would be a new feature and out-of-scope for a released version.

Do you really want to store gigabytes of downloads in RAM instead of doing chunked reads and store them on disk? If you cannot or don't want to write to disk, then there are easier and better ways to deal with large buffers. You could either mmap() or you could use a memoryview of a bytearray buffer:

   buf = bytearray(4 * 1024**3)
   view = memoryview(buf)
   pos = 0
   while True:
       read = conn.recv_into(view[pos:pos+1048576])
       if not read:
           break
       pos += read

bmerry · 2021-07-29T07:30:33Z

A patch would not land in Python 3.9 since this would be a new feature and out-of-scope for a released version.

I see it as a fix for this bug. While there is already a fix, it regresses another bug (bpo-36050), so this would be a better fix.

Do you really want to store gigabytes of downloads in RAM instead of doing chunked reads and store them on disk?

I work on HPC applications where large quantities of data are stored in an S3-compatible object store and fetched over HTTP at 25Gb/s for processing. The data access layer tries very hard to avoid even making extra copies in memory (which is what caused me to file bpo-36050 in the first place) as it make a significant difference at those speeds. Buffering to disk would be right out.

then there are easier and better ways to deal with large buffers

Your example code is probably fine if one is working directly on an SSLSocket, but http.client wraps it in a buffered reader (via socket.makefile), and that implements readinto by reading into a temporary and copying it (

cpython/Modules/_io/bufferedio.c

Line 88 in 8d06474

memcpy(buffer->buf, PyBytes_AS_STRING(data), len);

), which would add overhead.

I appreciate that what I'm proposing is a relatively complex change for a released version. A less intrusive option would to be change MAXAMOUNT in http.client from 1MiB to 2GiB-1byte (as suggested by @matan1008). That would still leave 3.9 slower than 3.8 when reading >2GiB responses over plain HTTP, but at least everything in the range [1MiB, 2GiB) would operate at full speed (which is the region I actually care about).

ambv · 2021-07-29T16:08:26Z

This fix is going to cause a regression of bpo-36050. Would it not be possible to fix this in _ssl.c (by breaking a large read into multiple smaller calls to SSL_read)? It seems like fixing this at the SSL layer is more appropriate than trying to work around it at the HTTP layer, and thus impacting the performance of all HTTP fetches (whether using TLS or not, and whether >2GB or not).

Bruce, you're discussing the *revert* of the bpo-36050 change as if it was a new change. It isn't, it is literally returning to a state that was previously tested and worked fine since Python 3.5. By reverting, we're choosing correctness over performance here.

Introducing a new implementation for bpo-36050 is now out of scope for 3.9 as it's a performance-oriented change which we cannot accept in a bugfix release.

amacd31 mannequin assigned tiran Jan 7, 2021

amacd31 mannequin added topic-SSL 3.10 only security fixes 3.8 only security fixes 3.9 only security fixes type-bug An unexpected behavior, bug, or error labels Jan 7, 2021

amacd31 mannequin assigned tiran Jan 7, 2021

amacd31 mannequin added topic-SSL 3.10 only security fixes labels Jan 7, 2021

tiran added stdlib Python modules in the Lib dir and removed topic-SSL labels Jan 7, 2021

tiran assigned methane and unassigned tiran Jan 7, 2021

tiran removed the 3.10 only security fixes label Apr 19, 2021

tiran removed the 3.8 only security fixes label Jun 24, 2021

ambv closed this as completed Jul 28, 2021

ezio-melotti transferred this issue from another repository Apr 10, 2022

illia-v mentioned this issue Jun 10, 2022

OverflowError: signed integer is greater than maximum urllib3/urllib3#2513

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`OverflowError: signed integer is greater than maximum` in ssl.py for files larger than 2GB #87019

`OverflowError: signed integer is greater than maximum` in ssl.py for files larger than 2GB #87019

amacd31 mannequin commented Jan 7, 2021

amacd31 mannequin commented Jan 7, 2021

tiran commented Jan 7, 2021

ronaldoussoren commented Jan 8, 2021

tiran commented Jan 8, 2021

tiran commented Apr 19, 2021

jakirkham mannequin commented Jun 2, 2021

matan1008 mannequin commented Jun 24, 2021

methane commented Jun 24, 2021

tiran commented Jun 24, 2021

methane commented Jun 24, 2021

jakirkham mannequin commented Jun 24, 2021

tiran commented Jun 24, 2021

jakirkham mannequin commented Jun 24, 2021

tiran commented Jun 24, 2021

ambv commented Jul 28, 2021

bmerry mannequin commented Jul 28, 2021

jakirkham mannequin commented Jul 28, 2021

bmerry mannequin commented Jul 28, 2021

tiran commented Jul 28, 2021

bmerry mannequin commented Jul 29, 2021

ambv commented Jul 29, 2021

OverflowError: signed integer is greater than maximum in ssl.py for files larger than 2GB #87019

OverflowError: signed integer is greater than maximum in ssl.py for files larger than 2GB #87019

Comments

amacd31 mannequin commented Jan 7, 2021

amacd31 mannequin commented Jan 7, 2021

tiran commented Jan 7, 2021

ronaldoussoren commented Jan 8, 2021

tiran commented Jan 8, 2021

tiran commented Apr 19, 2021

jakirkham mannequin commented Jun 2, 2021

matan1008 mannequin commented Jun 24, 2021

methane commented Jun 24, 2021

tiran commented Jun 24, 2021

methane commented Jun 24, 2021

jakirkham mannequin commented Jun 24, 2021

tiran commented Jun 24, 2021

jakirkham mannequin commented Jun 24, 2021

tiran commented Jun 24, 2021

ambv commented Jul 28, 2021

bmerry mannequin commented Jul 28, 2021

jakirkham mannequin commented Jul 28, 2021

bmerry mannequin commented Jul 28, 2021

tiran commented Jul 28, 2021

bmerry mannequin commented Jul 29, 2021

ambv commented Jul 29, 2021

`OverflowError: signed integer is greater than maximum` in ssl.py for files larger than 2GB #87019

`OverflowError: signed integer is greater than maximum` in ssl.py for files larger than 2GB #87019