This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author vstinner
Recipients RalfM, ezio.melotti, methane, roufique7, serhiy.storchaka, vstinner, xtreak
Date 2019-06-21.21:06:40
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1561151200.65.0.931341254007.issue24214@roundup.psfhosted.org>
In-reply-to
Content
> UnicodeDecodeError: 'utf-8' codec can't decode byte 0xf1 in position 1: invalid continuation byte

Python is right: b'f\xf1\xf6rd' is not a valid UTF-8 string:

$ python3
Python 3.7.3 (default, May 11 2019, 00:38:04) 
>>> b'f\xf1\xf6rd'.decode()
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xf1 in position 1: invalid continuation byte

This change is deliberate: it makes UTF-8 incremental decoder correct (respect the UTF-8 standard). I close the issue.
History
Date User Action Args
2019-06-21 21:06:40vstinnersetrecipients: + vstinner, ezio.melotti, methane, serhiy.storchaka, RalfM, xtreak, roufique7
2019-06-21 21:06:40vstinnersetmessageid: <1561151200.65.0.931341254007.issue24214@roundup.psfhosted.org>
2019-06-21 21:06:40vstinnerlinkissue24214 messages
2019-06-21 21:06:40vstinnercreate