Message327362
OK, I see, http://www.unicode.org/versions/Unicode5.2.0/ch03.pdf (Table 3-7 on page 93) states that the only valid 3-bytes UTF-8 sequences starting with the byte 0xED have a value for the second byte in the range 0x80 to 0x9F. 0xA0 is just beyond that range (as that would result in an encoded surrogate). Python handles all invalid sequences according to that table with the same error message. I think this issue can be closed. |
|
Date |
User |
Action |
Args |
2018-10-08 16:48:24 | doerwalter | set | recipients:
+ doerwalter, vstinner, ezio.melotti, serhiy.storchaka |
2018-10-08 16:48:24 | doerwalter | set | messageid: <1539017304.94.0.545547206417.issue34935@psf.upfronthosting.co.za> |
2018-10-08 16:48:24 | doerwalter | link | issue34935 messages |
2018-10-08 16:48:24 | doerwalter | create | |
|