Message 346248 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	vstinner
Recipients	RalfM, ezio.melotti, methane, roufique7, serhiy.storchaka, vstinner, xtreak
Date	2019-06-21.21:06:40
SpamBayes Score	-1.0
Marked as misclassified	Yes
Message-id	<1561151200.65.0.931341254007.issue24214@roundup.psfhosted.org>
In-reply-to

Content
> UnicodeDecodeError: 'utf-8' codec can't decode byte 0xf1 in position 1: invalid continuation byte Python is right: b'f\xf1\xf6rd' is not a valid UTF-8 string: $ python3 Python 3.7.3 (default, May 11 2019, 00:38:04) >>> b'f\xf1\xf6rd'.decode() Traceback (most recent call last): File "<stdin>", line 1, in <module> UnicodeDecodeError: 'utf-8' codec can't decode byte 0xf1 in position 1: invalid continuation byte This change is deliberate: it makes UTF-8 incremental decoder correct (respect the UTF-8 standard). I close the issue.

> UnicodeDecodeError: 'utf-8' codec can't decode byte 0xf1 in position 1: invalid continuation byte

Python is right: b'f\xf1\xf6rd' is not a valid UTF-8 string:

$ python3
Python 3.7.3 (default, May 11 2019, 00:38:04) 
>>> b'f\xf1\xf6rd'.decode()
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xf1 in position 1: invalid continuation byte

This change is deliberate: it makes UTF-8 incremental decoder correct (respect the UTF-8 standard). I close the issue.

History
Date	User	Action	Args
2019-06-21 21:06:40	vstinner	set	recipients: + vstinner, ezio.melotti, methane, serhiy.storchaka, RalfM, xtreak, roufique7
2019-06-21 21:06:40	vstinner	set	messageid: <1561151200.65.0.931341254007.issue24214@roundup.psfhosted.org>
2019-06-21 21:06:40	vstinner	link	issue24214 messages
2019-06-21 21:06:40	vstinner	create