Issue 34979: Python throws “SyntaxError: Non-UTF-8 code start with \xe8...” when parse source file

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

This issue has been migrated to GitHub: https://github.com/python/cpython/issues/79160

classification

Title:	Python throws “SyntaxError: Non-UTF-8 code start with \xe8...” when parse source file
Type:	behavior	Stage:	resolved
Components:	Interpreter Core	Versions:	Python 3.6

process

Status:	closed	Resolution:	duplicate
Dependencies:		Superseder:
Assigned To:	serhiy.storchaka	Nosy List:	ausaki, ezio.melotti, serhiy.storchaka, terry.reedy, xiang.zhang, xtreak
Priority:	normal	Keywords:	patch

Created on 2018-10-14 02:01 by ausaki, last changed 2022-04-11 14:59 by admin. This issue is now closed.

Pull Requests
URL	Status	Linked	Edit
PR 9923	closed	ausaki, 2018-10-17 08:49

Messages (11)
msg327686 - (view)	Author: susaki (ausaki) *	Date: 2018-10-14 02:01
``` # demo.py s = '测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试' ``` The file on above is for testing, it's encoding is utf-8, the length of `s` is 1020 bytes(3 * 340). When execute `python3 demo.py` on terminal, Python will throws the following error: ``` $ python3 -V Python 3.6.4 $ python3 demo.py File "demo.py", line 2 SyntaxError: Non-UTF-8 code starting with '\xe8' in file demo.py on line 2, but no encoding declared; see http://python.org/dev/peps/pep-0263/ for details ``` I've found this error occurred on about line 630(the bottom of the function `decoding_fgets`) of the file `cpython/Parser/tokenizer.c` after I read Python-3.6.6's source code. When Python execute xxx.py, Python will call the function `decoding_fgets` to read one line of raw bytes from file and save the raw bytes to a buffer, the initial length of the buffer is 1024 bytes, `decoding_fgets` will use the function `valid_utf8` to check raw bytes's encoding. If the lenght of raw bytes is too long(like greater than 1023 bytes), then Python will call `decoding_fgets` multiple times and increase buffer's size by 1024 bytes every time.so raw bytes read by `decoding_fgets` is maybe incomplete, for example, raw bytes contains a part of bytes of a character, that will cause `valide_utf8` failed. I suggest that we should always use `fp_readl` to read source coe from file.
msg327689 - (view)	Author: Karthikeyan Singaravelan (xtreak) *	Date: 2018-10-14 05:06
Thanks for the report. Is this a case of encoding not being declared at the top of the file or am I missing something? ➜ cpython git:(master) cat ../backups/bpo34979.py s = '测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试' print("str len : ", len(s)) print("bytes len : ", len(s.encode('utf-8'))) ➜ cpython git:(master) ./python.exe ../backups/bpo34979.py File "../backups/bpo34979.py", line 1 SyntaxError: Non-UTF-8 code starting with '\xe8' in file ../backups/bpo34979.py on line 1, but no encoding declared; see http://python.org/dev/peps/pep-0263/ for details # With encoding declared ➜ cpython git:(master) cat ../backups/bpo34979.py # -- coding: utf-8 -- s = '测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试' print("str len : ", len(s)) print("bytes len : ", len(s.encode('utf-8'))) ➜ cpython git:(master) ./python.exe ../backups/bpo34979.py str len : 340 bytes len : 1020 # Double the original string ➜ cpython git:(master) cat ../backups/bpo34979.py # -- coding: utf-8 -- s = '测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试' print("str len : ", len(s)) print("bytes len : ", len(s.encode('utf-8'))) ➜ cpython git:(master) ./python.exe ../backups/bpo34979.py str len : 680 bytes len : 2040 Thanks
msg327697 - (view)	Author: susaki (ausaki) *	Date: 2018-10-14 08:22
If you declare the encoding at the top of the file, then everything is fine, because in this case Python will use `io.open` to open the file and use `stream.readline` to read one line of code, please see function `fp_setreadl` in `cpython/Parser/tokenizer.c` for detail. But if you did not declare the encoding, then Python will use `Py_UniversalNewlineFgets` to read one line of raw bytes and check these raw bytes's encoding by `valid_utf8`. In my opinion, when the encoding of the file is utf-8, and because the default file encoding of Python3 is utf-8, so whether we declare encoding or did not is ok. Karthikeyan Singaravelan <report@bugs.python.org> 于2018年10月14日周日下午1:06写道： > > Karthikeyan Singaravelan <tir.karthi@gmail.com> added the comment: > > Thanks for the report. Is this a case of encoding not being declared at > the top of the file or am I missing something? > > ➜ cpython git:(master) cat ../backups/bpo34979.py > s = > '测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试' > > print("str len : ", len(s)) > print("bytes len : ", len(s.encode('utf-8'))) > ➜ cpython git:(master) ./python.exe ../backups/bpo34979.py > File "../backups/bpo34979.py", line 1 > SyntaxError: Non-UTF-8 code starting with '\xe8' in file > ../backups/bpo34979.py on line 1, but no encoding declared; see > http://python.org/dev/peps/pep-0263/ for details > > # With encoding declared > > ➜ cpython git:(master) cat ../backups/bpo34979.py > # -- coding: utf-8 -- > > s = > '测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试' > > print("str len : ", len(s)) > print("bytes len : ", len(s.encode('utf-8'))) > ➜ cpython git:(master) ./python.exe ../backups/bpo34979.py > str len : 340 > bytes len : 1020 > > # Double the original string > > ➜ cpython git:(master) cat ../backups/bpo34979.py > # -- coding: utf-8 -- > > s = > '测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试' > > print("str len : ", len(s)) > print("bytes len : ", len(s.encode('utf-8'))) > ➜ cpython git:(master) ./python.exe ../backups/bpo34979.py > str len : 680 > bytes len : 2040 > > > Thanks > > ---------- > nosy: +xtreak > > _______________________________________ > Python tracker <report@bugs.python.org> > <https://bugs.python.org/issue34979> > _______________________________________ >
msg327699 - (view)	Author: Karthikeyan Singaravelan (xtreak) *	Date: 2018-10-14 09:10
Got it. Thanks for the details and patience. I tested with less number of characters and it seems to work fine so using the encoding at the top is not a good way to test the original issue as you have mentioned. Then I searched around and found issue14811 with test. This seems to be a very similar issue and there is a patch to detect this scenario to throw SyntaxError that the line is longer than the internal buffer instead of an encoding related error. I applied the patch to master and it throws an error about the internal buffer length as expected. But the patch was not applied and it seems Victor had another solution in mind as per msg167154. I tested with the patch as below : # master ➜ cpython git:(master) cat ../backups/bpo34979.py s = '测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试' print("str len : ", len(s)) print("bytes len : ", len(s.encode('utf-8'))) ➜ cpython git:(master) ./python.exe ../backups/bpo34979.py File "../backups/bpo34979.py", line 2 SyntaxError: Non-UTF-8 code starting with '\xe8' in file ../backups/bpo34979.py on line 2, but no encoding declared; see http://python.org/dev/peps/pep-0263/ for details # Applying the patch file from issue14811 ➜ cpython git:(master) ✗ ./python.exe ../backups/bpo34979.py File "../backups/bpo34979.py", line 2 SyntaxError: Line 2 of file ../backups/bpo34979.py is longer than the internal buffer (1024) # Patch on master diff --git a/Parser/tokenizer.c b/Parser/tokenizer.c index fc75bae537..48b3ac0ee9 100644 --- a/Parser/tokenizer.c +++ b/Parser/tokenizer.c @@ -586,6 +586,7 @@ static char * decoding_fgets(char s, int size, struct tok_state tok) { char line = NULL; + size_t len; int badchar = 0; for (;;) { if (tok->decoding_state == STATE_NORMAL) { @@ -597,6 +598,15 @@ decoding_fgets(char s, int size, struct tok_state tok) / We want a 'raw' read. / line = Py_UniversalNewlineFgets(s, size, tok->fp, NULL); + if (line != NULL) { + len = strlen(line); + if (1 < len && line[len-1] != '\n') { + PyErr_Format(PyExc_SyntaxError, + "Line %i of file %U is longer than the internal buffer (%i)", + tok->lineno + 1, tok->filename, size); + return error_ret(tok); + } + } break; } else { / We have not yet determined the encoding. If it's the same issue then I think closing this issue and discussing there will be good since the issue has a patch with test and relevant discussion. Also it seems BUFSIZ is platform dependent so adding your platform details would also help. TIL about difference Python 2 and 3 on handling unicode related files. Thanks again!
msg327702 - (view)	Author: susaki (ausaki) *	Date: 2018-10-14 11:12
I think these two issue is the same issue, and the following is a patch write by me, hope this patch will help. ``` diff --git a/Parser/tokenizer.c b/Parser/tokenizer.c index 1af27bf..ba6fb3a 100644 --- a/Parser/tokenizer.c +++ b/Parser/tokenizer.c @@ -617,32 +617,21 @@ decoding_fgets(char s, int size, struct tok_state tok) if (!check_coding_spec(line, strlen(line), tok, fp_setreadl)) { return error_ret(tok); } - } -#ifndef PGEN - /* The default encoding is UTF-8, so make sure we don't have any - non-UTF-8 sequences in it. / - if (line && !tok->encoding) { - unsigned char c; - int length; - printf("[DEBUG] - [decoding_fgets]: line = %s\n", line); - for (c = (unsigned char )line; c; c += length) - if (!(length = valid_utf8(c))) { - badchar = c; - break; + if(!tok->encoding){ + char cs = new_string("utf-8", 5, tok); + int r = fp_setreadl(tok, cs); + if (r) { + tok->encoding = cs; + tok->decoding_state = STATE_NORMAL; + } else { + PyErr_Format(PyExc_SyntaxError, + "You did not decalre the file encoding at the top of the file, " + "and we found that the file is not encoding by utf-8," + "see http://python.org/dev/peps/pep-0263/ for details."); + PyMem_FREE(cs); } + } } - if (badchar) { - /* Need to add 1 to the line number, since this line - has not been counted, yet. / - PyErr_Format(PyExc_SyntaxError, - "Non-UTF-8 code starting with '\\x%.2x' " - "in file %U on line %i, " - "but no encoding declared; " - "see http://python.org/dev/peps/pep-0263/ for details", - badchar, tok->filename, tok->lineno + 1); - return error_ret(tok); - } -#endif return line; } ``` by the way, my platform is macOS Mojave Version 10.14 Karthikeyan Singaravelan <report@bugs.python.org> 于2018年10月14日周日下午5:10写道： > > Karthikeyan Singaravelan <tir.karthi@gmail.com> added the comment: > > Got it. Thanks for the details and patience. I tested with less number of > characters and it seems to work fine so using the encoding at the top is > not a good way to test the original issue as you have mentioned. Then I > searched around and found issue14811 with test. This seems to be a very > similar issue and there is a patch to detect this scenario to throw > SyntaxError that the line is longer than the internal buffer instead of an > encoding related error. I applied the patch to master and it throws an > error about the internal buffer length as expected. But the patch was not > applied and it seems Victor had another solution in mind as per msg167154. > I tested with the patch as below : > > # master > > ➜ cpython git:(master) cat ../backups/bpo34979.py > > s = > '测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试' > > print("str len : ", len(s)) > print("bytes len : ", len(s.encode('utf-8'))) > ➜ cpython git:(master) ./python.exe ../backups/bpo34979.py > File "../backups/bpo34979.py", line 2 > SyntaxError: Non-UTF-8 code starting with '\xe8' in file > ../backups/bpo34979.py on line 2, but no encoding declared; see > http://python.org/dev/peps/pep-0263/ for details > > > # Applying the patch file from issue14811 > > ➜ cpython git:(master) ✗ ./python.exe ../backups/bpo34979.py > File "../backups/bpo34979.py", line 2 > SyntaxError: Line 2 of file ../backups/bpo34979.py is longer than the > internal buffer (1024) > > # Patch on master > > diff --git a/Parser/tokenizer.c b/Parser/tokenizer.c > index fc75bae537..48b3ac0ee9 100644 > --- a/Parser/tokenizer.c > +++ b/Parser/tokenizer.c > @@ -586,6 +586,7 @@ static char > decoding_fgets(char s, int size, struct tok_state tok) > { > char line = NULL; > + size_t len; > int badchar = 0; > for (;;) { > if (tok->decoding_state == STATE_NORMAL) { > @@ -597,6 +598,15 @@ decoding_fgets(char s, int size, struct tok_state > tok) > / We want a 'raw' read. / > line = Py_UniversalNewlineFgets(s, size, > tok->fp, NULL); > + if (line != NULL) { > + len = strlen(line); > + if (1 < len && line[len-1] != '\n') { > + PyErr_Format(PyExc_SyntaxError, > + "Line %i of file %U is longer than the > internal buffer (%i)", > + tok->lineno + 1, tok->filename, size); > + return error_ret(tok); > + } > + } > break; > } else { > / We have not yet determined the encoding. > > > If it's the same issue then I think closing this issue and discussing > there will be good since the issue has a patch with test and relevant > discussion. Also it seems BUFSIZ is platform dependent so adding your > platform details would also help. > > TIL about difference Python 2 and 3 on handling unicode related files. > Thanks again! > > ---------- > > _______________________________________ > Python tracker <report@bugs.python.org> > <https://bugs.python.org/issue34979> > _______________________________________ >
msg327706 - (view)	Author: Karthikeyan Singaravelan (xtreak) *	Date: 2018-10-14 13:25
Thanks for the confirmation. I think the expected solution is to use a buffer that can be resized. CPython accepts GitHub PRs so if you have time then I would suggest raising a PR against the linked issue since a lot of people have subscribed there and would get a good feedback. As a suggestion when you reply from email please remove the quoted content since it makes the message very long and hard to read in the bug tracker.
msg327709 - (view)	Author: susaki (ausaki) *	Date: 2018-10-14 13:53
Thanks for your suggestions. I will make a PR on github. The buffer is resizeable now, please see cpython/Parser/tokenizer.c#L1043 <https://github.com/python/cpython/blob/master/Parser/tokenizer.c#L1043> for details.
msg327878 - (view)	Author: Serhiy Storchaka (serhiy.storchaka) *	Date: 2018-10-17 09:14
This is a part of more general issue25643. I'll try to revive that issue.
msg356711 - (view)	Author: Terry J. Reedy (terry.reedy) *	Date: 2019-11-15 19:57
On Windows, with 3.7, 3.8.0, and master, none of the demo.py statement here and the examples in #38755 raise an error. I tried 'python -m module', running from IDLE editor, and interactive IDLE and REPL. Even the following worked. >>> s = (b'\xe2\x96\x91'*1111111).decode() >>> s[-10:] '░░░░░░░░░░' susaki, what OS, and do you have the same problem with current Python (at least 3.8)? Also, susuki, when replying by email, please delete the quoted message. When your message is added to the web page, the quoted message is redundant and distracting noise. If this issue effectively duplicates (part of) #14811 and/or #25643, it should be closed as a duplicate of one of them.
msg356741 - (view)	Author: susaki (ausaki) *	Date: 2019-11-16 04:34
I think this issue is duplicated with #14811, I will close it. The key point of this issue is that the size of `tok->buf` is fixed and equals to `BUFSIZ`(defined in stdio.h, have different value depends on OS). one line of code will be truncated If it’s size exceeds `BUFSIZ`, then the function `valid_utf8` will failed. You can increase the size of `s` to reproduce this issue. ✦ ➜ cat demo.py s = '测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试测试' ✦ ➜ ./python -V Python 3.7.4 ✦ ➜ ./python demo.py File "demo.py", line 1 SyntaxError: Non-UTF-8 code starting with '\xe6' in file demo.py on line 1, but no encoding declared; see http://python.org/dev/peps/pep-0263/ for details
msg356760 - (view)	Author: susaki (ausaki) *	Date: 2019-11-16 17:37
duplicated with #14811

History
Date	User	Action	Args
2022-04-11 14:59:07	admin	set	github: 79160
2019-11-16 17:37:53	ausaki	set	status: open -> closed resolution: duplicate messages: + msg356760 stage: patch review -> resolved
2019-11-16 04:34:09	ausaki	set	messages: + msg356741
2019-11-15 19:59:59	ezio.melotti	set	nosy: + ezio.melotti
2019-11-15 19:57:17	terry.reedy	set	nosy: + terry.reedy messages: + msg356711
2018-10-17 09:14:15	serhiy.storchaka	set	assignee: serhiy.storchaka messages: + msg327878 nosy: + serhiy.storchaka
2018-10-17 08:49:51	ausaki	set	keywords: + patch stage: patch review pull_requests: + pull_request9276
2018-10-14 13:53:15	ausaki	set	messages: + msg327709
2018-10-14 13:25:05	xtreak	set	messages: + msg327706
2018-10-14 11:12:29	ausaki	set	messages: + msg327702
2018-10-14 09:10:44	xtreak	set	messages: + msg327699
2018-10-14 08:22:55	ausaki	set	messages: + msg327697
2018-10-14 05:06:19	xtreak	set	nosy: + xtreak messages: + msg327689
2018-10-14 03:45:49	xiang.zhang	set	nosy: + xiang.zhang
2018-10-14 02:01:39	ausaki	create