Message364114
sock_decode_hostname() of socketmodule.c currently uses PyUnicode_DecodeFSDefault() on Windows. PyUnicode_DecodeFSDefault() uses UTF-8 by default (PEP 529).
I understand that the ANSI code page should be used instead of UTF-8.
Would it work to use PyUnicode_DecodeLocale(name, "surrogatepass")? It's implemented with mbstowcs(), but I don't recall which encoding it uses on Windows.
Or can we use PyUnicode_DecodeMBCS(name, strlen(name), "surrogatepass")?
--
I understand that setting PYTHONLEGACYWINDOWSFSENCODING environment variable to 1 should work around the issue. |
|
Date |
User |
Action |
Args |
2020-03-13 17:31:04 | vstinner | set | recipients:
+ vstinner, paul.moore, tim.golden, ezio.melotti, python-dev, zach.ware, serhiy.storchaka, eryksun, steve.dower, abarry, williamdias, Владимир Мартьянов |
2020-03-13 17:31:04 | vstinner | set | messageid: <1584120664.43.0.733522038798.issue26227@roundup.psfhosted.org> |
2020-03-13 17:31:04 | vstinner | link | issue26227 messages |
2020-03-13 17:31:04 | vstinner | create | |
|