Message327112
The Unicode .splitlines() splits strings on what Unicode defines as linebreak characters (all code points with character properties Zl or bidirectional property B).
This is different than what typical CSV file parsers or other parsers built for the ASCII text files treat as newline. They usually only break on CR, CRLF, LF, so the use of .splitlines() in this context is wrong, not the method itself.
It may make sense extending .splitlines() to pass in a list of linebreak characters to break on, but that would make it a lot slower and the same can already be had by using re.split() on Unicode strings.
Closing this as won't fix. |
|
Date |
User |
Action |
Args |
2018-10-05 08:07:02 | lemburg | set | recipients:
+ lemburg, doerwalter, nascheme, belopolsky, vstinner, ezio.melotti, r.david.murray, serhiy.storchaka, wpk, xtreak |
2018-10-05 08:07:02 | lemburg | set | messageid: <1538726822.65.0.545547206417.issue18291@psf.upfronthosting.co.za> |
2018-10-05 08:07:02 | lemburg | link | issue18291 messages |
2018-10-05 08:07:02 | lemburg | create | |
|