UBSAN: test_ctypes, test_faulthandler and test_hashlib are failing #91069

vstinner · 2022-03-03T16:19:36Z

BPO	46913
Nosy	@gpshead, @vstinner, @tiran, @vadmium, @pablogsal, @miss-islington
PRs	bpo-46913: Fix test_faulthandler.test_sigfpe() on UBSAN #31662 [3.10] bpo-46913: Fix test_faulthandler.test_sigfpe() on UBSAN (GH-31662) #31665 [3.9] bpo-46913: Fix test_faulthandler.test_sigfpe() on UBSAN (GH-31662) #31666 bpo-46913: Fix test_faulthandler.test_read_null() on UBSAN #31672 bpo-46913: test_hashlib skips _sha3 tests on UBSan #31673 bpo-46913: Skip test_ctypes.test_shorts() on UBSan #31674 [3.10] bpo-46913: Fix test_ctypes, test_hashlib, test_faulthandler on UBSan #31675 [3.9] bpo-46913: Fix test_ctypes, test_hashlib, test_faulthandler on UBSan (GH-31675) #31676

^{Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.}

Show more details

GitHub fields:

assignee = None
closed_at = None
created_at = <Date 2022-03-03.16:19:36.361>
labels = ['tests', '3.11']
title = 'UBSAN: test_ctypes, test_faulthandler and test_hashlib are failing'
updated_at = <Date 2022-03-10.17:29:40.327>
user = 'https://github.com/vstinner'

bugs.python.org fields:

activity = <Date 2022-03-10.17:29:40.327>
actor = 'vstinner'
assignee = 'none'
closed = False
closed_date = None
closer = None
components = ['Tests']
creation = <Date 2022-03-03.16:19:36.361>
creator = 'vstinner'
dependencies = []
files = []
hgrepos = []
issue_num = 46913
keywords = ['patch']
message_count = 20.0
messages = ['414455', '414456', '414457', '414458', '414471', '414477', '414486', '414489', '414491', '414494', '414495', '414497', '414499', '414501', '414507', '414512', '414694', '414867', '414870', '414872']
nosy_count = 6.0
nosy_names = ['gregory.p.smith', 'vstinner', 'christian.heimes', 'martin.panter', 'pablogsal', 'miss-islington']
pr_nums = ['31662', '31665', '31666', '31672', '31673', '31674', '31675', '31676']
priority = 'normal'
resolution = None
stage = 'patch review'
status = 'open'
superseder = None
type = None
url = 'https://bugs.python.org/issue46913'
versions = ['Python 3.11']

vstinner · 2022-03-03T16:19:36Z

I recently changed how tests are skipped in ASAN, MSAN and UBSAN CIs (buildbot workers and GitHub Action jobs):

3 tests are now failing on "AMD64 Arch Linux Usan 3.x":

test_ctypes
test_faulthandler
test_hashlib

First failed build:
https://buildbot.python.org/all/#/builders/719/builds/632

vstinner · 2022-03-03T16:20:36Z

test_ctypes: test_shorts() of ctypes.test.test_bitfields.C_Test is failing with:

---
test_shorts (ctypes.test.test_bitfields.C_Test) ... /home/vstinner/python/main/Modules/_ctypes/cfield.c:554:5: runtime error: shift exponent 18446744073709551614 is too large for 16-bit type 'short'
SUMMARY: UndefinedBehaviorSanitizer: undefined-behavior /home/vstinner/python/main/Modules/_ctypes/cfield.c:554:5 in
---

It's a test on the "h" format code:
----

#define LOW_BIT(x)  ((x) & 0xFFFF)
#define NUM_BITS(x) ((x) >> 16)

#define GET_BITFIELD(v, size)                                           \
    if (NUM_BITS(size)) {                                               \
        v <<= (sizeof(v)*8 - LOW_BIT(size) - NUM_BITS(size));           \
        v >>= (sizeof(v)*8 - NUM_BITS(size));                           \

static PyObject *
h_get(void *ptr, Py_ssize_t size)
{
    short val;
    memcpy(&val, ptr, sizeof(val));
    GET_BITFIELD(val, size); // <==== HERE
    return PyLong_FromLong((long)val);
}

static struct fielddesc formattable[] = {
    ...
    { 'h', h_set, h_get, NULL, h_set_sw, h_get_sw},
    ...
};

vstinner · 2022-03-03T16:21:12Z

test_hashlib: test_gil() fails with:
---
test_gil (test.test_hashlib.HashLibTestCase) ... /home/vstinner/python/main/Modules/_sha3/kcp/KeccakP-1600-opt64.c:467:9: runtime error: load of misaligned address 0x000002daafd7 for type 'UINT64' (aka 'unsigned long'), which requires 8 byte alignment
0x000002daafd7: note: pointer points here
23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23
^
SUMMARY: UndefinedBehaviorSanitizer: undefined-behavior /home/vstinner/python/main/Modules/_sha3/kcp/KeccakP-1600-opt64.c:467:9 in
---

vstinner · 2022-03-03T16:22:11Z

test_faulthandler: test_sigfpe() fails with:

======================================================================
FAIL: test_sigfpe (test.test_faulthandler.FaultHandlerTests)
----------------------------------------------------------------------

Traceback (most recent call last):
  File "/home/vstinner/python/main/Lib/test/test_faulthandler.py", line 228, in test_sigfpe
    self.check_fatal_error("""
    ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/vstinner/python/main/Lib/test/test_faulthandler.py", line 129, in check_fatal_error
    self.check_error(code, line_number, fatal_error, **kw)
    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/vstinner/python/main/Lib/test/test_faulthandler.py", line 122, in check_error
    self.assertRegex(output, regex)
    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AssertionError: Regex didn't match: '(?m)^Fatal Python error: Floating point exception\n\nCurrent thread 0x[0-9a-f]+ \\(most recent call first\\):\n  File "<string>", line 3 in <module>' not found in 'Modules/faulthandler.c:1112:11: runtime error: division by zero\nSUMMARY: UndefinedBehaviorSanitizer: undefined-behavior Modules/faulthandler.c:1112:11 in '

vstinner · 2022-03-03T20:45:08Z

New changeset 4173d67 by Victor Stinner in branch 'main':
bpo-46913: Fix test_faulthandler.test_sigfpe() on UBSAN (GH-31662)
4173d67

vstinner · 2022-03-03T22:05:14Z

Reproducer of test_ctypes undefined behavior:
---

from ctypes import *

class BITS(Structure):
    _fields_ = [("A", c_int, 1),
                ("B", c_int, 2),
                ("C", c_int, 3),
                ("D", c_int, 4),
                ("E", c_int, 5),
                ("F", c_int, 6),
                ("G", c_int, 7),
                ("H", c_int, 8),
                ("I", c_int, 9),

            ("M", c_short, 1),
            ("N", c_short, 2),
            ("O", c_short, 3),
            ("P", c_short, 4),
            ("Q", c_short, 5),
            ("R", c_short, 6),
            ("S", c_short, 7)]

b = BITS()
a = getattr(b, "M")

GET_BITFIELD(val, size); // \<==== HERE

I expanded the macro:
---

    if (NUM_BITS(size)) {
        size_t s = sizeof(val)*8;
        Py_ssize_t low = LOW_BIT(size);
        Py_ssize_t nbits = NUM_BITS(size);
        // val=0, size=65553 = (1 << 16) + 17
        // s=16, low=17, nbits=1
        // s - low - nbits = (size_t)-2
        val <<= (s - low - nbits);
        val >>= (s - nbits);
    }

The problem is that (s - low - nbits) is negative (-2), but becomes a very large number since it's unsigned (s is unsigned: "sizeof(v)*8" in the macro).

C code:
---

#include <stdio.h>

int main()
{
    short s = 0x7fff;
    size_t z = (size_t)-2;
    s <<= z;
    printf("s = %hi\n", s);
    return 0;
}

GCC and clang disagree :-D
---

$ gcc x.c -o x -O3 -Wall -Wextra -Wconversion && ./x
s = 0

$ clang x.c -o x -O3 -Wall -Wextra -Wconversion && ./x
s = -5784

Moreover, runtime the binary built by clang produces a different result at each run...
---

$ ./x
s = -4824
$ ./x
s = 4120
$ ./x
s = -22344
$ ./x
s = -26744
$ ./x
s = -18184

vstinner · 2022-03-03T22:45:34Z

python/buildmaster-config@0fd1e3e

Previously, 4 tests were skipped on the UBSAN buildbot

test_faulthandler
test_hashlib
test_concurrent_futures
test_ctypes

It's not a regression, it's just that I made the buildbot stricter. I'm planning to attempt to fix the 3 failing tests, or at least skip them in Python, rather than skipping them in the buildbot configuration.

vstinner · 2022-03-03T23:25:36Z

New changeset 65b92cc by Victor Stinner in branch 'main':
bpo-46913: Fix test_faulthandler.test_read_null() on UBSan (GH31672)
65b92cc

vstinner · 2022-03-03T23:36:53Z

See also bpo-46887: ./Programs/_freeze_module fails with MSAN: Uninitialized value was created by an allocation of 'stat.i'.

vstinner · 2022-03-03T23:41:43Z

New changeset 6d0d7d2 by Victor Stinner in branch 'main':
bpo-46913: test_hashlib skips _sha3 tests on UBSan (GH-31673)
6d0d7d2

vstinner · 2022-03-03T23:42:01Z

New changeset ad1b044 by Victor Stinner in branch 'main':
bpo-46913: Skip test_ctypes.test_shorts() on UBSan (GH-31674)
ad1b044

vstinner · 2022-03-03T23:54:34Z

I pushed changes just to turn back the buildbot back to green, but undefined behaviors in test_ctypes.test_shorts() and _sha3 (tested by test_hashlib) must be fixed.

Previously, test_ctypes, test_hashlib and test_faulthandler were fully skipped on UBSan. Now only 1 test_ctypes test and a few test_hashlib tests are skipped on the UBSan buildbot (test_faulthandler now pass on UBSan).

vstinner · 2022-03-04T00:12:17Z

New changeset 7b5b429 by Victor Stinner in branch '3.10':
[3.10] bpo-46913: Fix test_ctypes, test_hashlib, test_faulthandler on UBSan (GH-31675)
7b5b429

vstinner · 2022-03-04T00:32:04Z

New changeset 6a14330 by Victor Stinner in branch '3.9':
bpo-46913: Fix test_ctypes, test_hashlib, test_faulthandler on UBSan (GH-31675) (GH-31676)
6a14330

tiran · 2022-03-04T07:49:45Z

I don't agree with #75854. Did you try defining NO_MISALIGNED_ACCESSES instead?

vstinner · 2022-03-04T09:42:55Z

I don't agree with #75854. Did you try defining NO_MISALIGNED_ACCESSES instead?

Did you read the commit message? The change is not about skipping the test, but fixing the CI. Previously, test_hashlib was not run at all on the UBSan buildbot, now most test_hashlib tests are run. The intent is to make sure that we don't add *new* undefined behavior.

As I wrote in the commit, the UD must be fixed in _sha3.

No, I didn't try NO_MISALIGNED_ACCESSES, I don't know this macro. If you have an idea on how _sha3 can be fixed on the UBSan buildbot, please go head!

vadmium · 2022-03-07T19:24:56Z

The ctypes overflow is probably the same as described in bpo-28169 and bpo-15119

vstinner · 2022-03-10T17:01:28Z

Current status of tests skipped on ASAN, MSAN and UBSAN.

Only ASAN (1):

_test_multiprocessing.py:76:if support.check_sanitizer(address=True):

ASAN and MSAN (10):

test___all__.py:14:if support.check_sanitizer(address=True, memory=True):
test_concurrent_futures.py:35:if support.check_sanitizer(address=True, memory=True):
test_crypt.py:7: if check_sanitizer(address=True, memory=True):
test_decimal.py:5510: @unittest.skipIf(check_sanitizer(address=True, memory=True),
test_idle.py:5:if check_sanitizer(address=True, memory=True):
test_peg_generator/init.py:7:if support.check_sanitizer(address=True, memory=True):
test_tix.py:7:if check_sanitizer(address=True, memory=True):
test_tk.py:6:if check_sanitizer(address=True, memory=True):
test_tools/init.py:10:if support.check_sanitizer(address=True, memory=True):
test_ttk_guionly.py:6:if check_sanitizer(address=True, memory=True):

Only UB (1):

test_hashlib.py:68:SKIP_SHA3 = support.check_sanitizer(ub=True)

vstinner · 2022-03-10T17:10:29Z

I enabled the test on ASAN on test_crypt and I confirm that I get a crash on calling a NULL function.

The Python crypt.crypt() function calls the C function crypt_r() which is intercepted by libasan, but the libasan implementation calls a NULL function. I don't know why.

$ ./configure --with-address-sanitizer --with-pydebug
$ make clean
$ ASAN_OPTIONS="detect_leaks=0:allocator_may_return_null=1:handle_segv=0" make
$ ASAN_OPTIONS="detect_leaks=0:allocator_may_return_null=1:handle_segv=0" gdb -args ./python -m test -v test_crypt 
(gdb) run
(...)
0:00:00 load avg: 0.53 Run tests sequentially
0:00:00 load avg: 0.53 [1/1] test_crypt

Program received signal SIGSEGV, Segmentation fault.
0x0000000000000000 in ?? ()

(gdb) where
#0 0x0000000000000000 in ?? ()
#1 0x00007ffff761189f in __interceptor_crypt_r.part.0 () from /lib64/libasan.so.6
#2 0x00007fffe6a40821 in crypt_crypt_impl (module=<module at remote 0x6080004ae7c0>, word=0xfcb050 <importlib.bootstrap_external_toplevel_consts_7+48> "", salt=0x6080004bc660 "$6$d8Imx7a5WbE12iK4")
at /home/vstinner/python/main/Modules/_cryptmodule.c:44
#3 0x00007fffe6a40695 in crypt_crypt (module=<module at remote 0x6080004ae7c0>, args=0x629000001368, nargs=2) at /home/vstinner/python/main/Modules/clinic/_cryptmodule.c.h:58
(...)

$ ASAN_OPTIONS="detect_leaks=0:allocator_may_return_null=1:handle_segv=0" ./python -m test -v test_crypt 
(...)
0:00:00 load avg: 0.56 Run tests sequentially
0:00:00 load avg: 0.56 [1/1] test_crypt
Fatal Python error: Segmentation fault

Current thread 0x00007f367c6c77c0 (most recent call first):
File "/home/vstinner/python/main/Lib/crypt.py", line 82 in crypt
File "/home/vstinner/python/main/Lib/crypt.py", line 94 in _add_method
File "/home/vstinner/python/main/Lib/crypt.py", line 105 in <module>
File "<frozen importlib._bootstrap>", line 241 in _call_with_frames_removed
File "<frozen importlib._bootstrap_external>", line 931 in exec_module
File "<frozen importlib._bootstrap>", line 690 in _load_unlocked
File "<frozen importlib._bootstrap>", line 1149 in _find_and_load_unlocked
File "<frozen importlib._bootstrap>", line 1178 in _find_and_load
File "/home/vstinner/python/main/Lib/test/test_crypt.py", line 6 in <module>
(...)

vstinner · 2022-03-10T17:29:40Z

The crypt_r() interceptor issue was reported in January 2021 to libasan:
google/sanitizers#1365

I enabled the test on ASAN on test_crypt and I confirm that I get a crash on calling a NULL function.

Note: I built Python with GCC ASAN (-fsanitize=address).

vstinner · 2022-11-03T17:43:21Z

Well, IMO I enhanced the situation enough. I close the issue :-) Open new issues if you want to fix a specific test on UBSan buildbots ;-)

vstinner added 3.11 only security fixes tests Tests in the Lib/test dir labels Mar 3, 2022

ezio-melotti transferred this issue from another repository Apr 10, 2022

serhiy-storchaka added the topic-ctypes label May 21, 2022

vstinner closed this as completed Nov 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UBSAN: test_ctypes, test_faulthandler and test_hashlib are failing #91069

UBSAN: test_ctypes, test_faulthandler and test_hashlib are failing #91069

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 4, 2022

vstinner commented Mar 4, 2022

tiran commented Mar 4, 2022

vstinner commented Mar 4, 2022

vadmium commented Mar 7, 2022

vstinner commented Mar 10, 2022

vstinner commented Mar 10, 2022

vstinner commented Mar 10, 2022

vstinner commented Nov 3, 2022

UBSAN: test_ctypes, test_faulthandler and test_hashlib are failing #91069

UBSAN: test_ctypes, test_faulthandler and test_hashlib are failing #91069

Comments

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 3, 2022

vstinner commented Mar 4, 2022

vstinner commented Mar 4, 2022

tiran commented Mar 4, 2022

vstinner commented Mar 4, 2022

vadmium commented Mar 7, 2022

vstinner commented Mar 10, 2022

vstinner commented Mar 10, 2022

vstinner commented Mar 10, 2022

vstinner commented Nov 3, 2022