Message254778
Here is preliminary patch that refactors the lowest level of Python tokenizer, reading and decoding. It splits the code on smaller simpler functions, decreases the source size by 37 lines, and fixes bugs: issue14811, issue18961, and a number of others. Added tests for most of fixed bugs (except leaks and others hardly reproducible). But the fix for other bugs can be harder, especially for issues with null byte (issue1105770, issue20115).
Many bug easily can be fixed if read all Python file in memory instead of reading it line by line. I don't know if it is acceptable. |
|
Date |
User |
Action |
Args |
2015-11-17 01:27:33 | serhiy.storchaka | set | recipients:
+ serhiy.storchaka |
2015-11-17 01:27:33 | serhiy.storchaka | set | messageid: <1447723653.14.0.424824913135.issue25643@psf.upfronthosting.co.za> |
2015-11-17 01:27:32 | serhiy.storchaka | link | issue25643 messages |
2015-11-17 01:27:32 | serhiy.storchaka | create | |
|