This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author jooon
Recipients jooon
Date 2022-03-25.10:06:05
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1648202765.72.0.11778383238.issue47117@roundup.psfhosted.org>
In-reply-to
Content
Some bytes that are non utf-8 segfaults python repl in 3.10 and later on linux. Example:

$ python3.10
Python 3.10.4 (main, Mar 24 2022, 14:20:44) [GCC 9.4.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> �
Segmentation fault (core dumped)

It is treated correctly in Python 3.9 and earlier

$ python3.9
Python 3.9.12 (main, Mar 24 2022, 14:21:53) 
[GCC 9.4.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> �
  File "<stdin>", line 0
    
SyntaxError: (unicode error) 'utf-8' codec can't decode byte 0xb6 in position 0: invalid start byte

How to reproduce:

In Gnome on Ubuntu 20.04 with the Swedish keyboard layout, holding left alt and pressing the ö key enters the byte 0xb6 into the terminal.

I have only been able to make it crash the repl. I can't make it crash the parser. For instance trying to eval the byte.
History
Date User Action Args
2022-03-25 10:06:05jooonsetrecipients: + jooon
2022-03-25 10:06:05jooonsetmessageid: <1648202765.72.0.11778383238.issue47117@roundup.psfhosted.org>
2022-03-25 10:06:05jooonlinkissue47117 messages
2022-03-25 10:06:05joooncreate