Title: Unicode identifiers not necessarily unique
Diego Argueta (da) * Date: 2022-01-29 17:06
The way Python 3 handles identifiers containing mathematical characters appears to be broken. I didn't test the entire range of U+1D400 through U+1D59F but I spot-checked them and the bug manifests itself there:

    Python 3.9.7 (default, Sep 10 2021, 14:59:43) 
    [GCC 11.2.0] on linux
    Type "help", "copyright", "credits" or "license" for more information.

    >>> foo = 1234567890
    >>> bar = 1234567890
    >>> foo is bar
    >>> 𝖇𝖆𝖗 = 1234567890

    >>> foo is 𝖇𝖆𝖗
    >>> bar is 𝖇𝖆𝖗

    >>> 𝖇𝖆𝖗 = 0
    >>> bar

This differs from the behavior with other non-ASCII characters. For example, ASCII 'a' and Cyrillic 'a' are properly treated as different identifiers:

    >>> а = 987654321    # Cyrillic lowercase 'a', U+0430
    >>> a = 123456789    # ASCII 'a'
    >>> а        # Cyrillic
    >>> a        # ASCII

While a bit of a pathological case, it is a nasty surprise. It's possible this is a symptom of a larger bug in the way identifiers are resolved.

This is similar but not identical to

Note: I did not find this myself; I give credit to Cooper Stimson ( for finding this bug. I merely reported it.
Pablo Galindo Salgado (pablogsal) * Date: 2022-01-29 17:37
This seems coherent with to me. The parser ensures all identifiers are converted into the normal form NFKC while parsing; comparison of identifiers is based on NFKC.
Eryk Sun (eryksun) * Date: 2022-01-29 19:24
Please read "Identifiers and keywords" [1] in the documentation. For example:

    >>> import unicodedata as ud
    >>> ud.normalize('NFKC', '𝖇𝖆𝖗') == 'bar'

    >>>'NFKC', c))

Diego Argueta (da) * Date: 2022-01-30 00:06
I did read PEP-3131 before posting this but I still thought the behavior was counterintuitive.
