Title: utf8 codec fails to parse a character
Created on 2010-05-20 22:01 by Roman.Gershman, last changed 2022-04-11 14:57 by admin. This issue is now closed.

1.txt Roman.Gershman, 2010-05-20 22:01 an input file which can not be read in python
msg106195 - (view) Author: Roman Gershman (Roman.Gershman) Date: 2010-05-20 22:01
The following code fails to parse the attached file:


if __name__ == '__main__':
 f = open("c:\\1.txt", mode ='r', encoding='utf-8')
 for line in f:
     print (line)
msg106196 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2010-05-20 22:06
$ hexdump  -C 1.txt 
00000000  ec 0d 0a                                          |...|

This file is *not* encoded to utf8.
