Author vstinner
Recipients brett.cannon, eric.araujo, lemburg, loewis, meatballhat, pitrou, vstinner
Date 2011-05-17.23:56:45
SpamBayes Score 8.85871e-06
Marked as misclassified No
Message-id <1305676605.78.0.639804793971.issue8796@psf.upfronthosting.co.za>
In-reply-to
Content
Python 3.2 has been published. Can we start deprecating StreamWriter and StreamReader in Python 3.3 (to remove them from Python 3.4)? The doc should explain how to convert code using codecs into code using the io module (it should be simple), and using a StreamReader/StreamWriter should emit a warning.

--

codecs.StreamWriter writes twice the BOM of UTF-8-SIG, UTF-16, UTF-32 encodings if the file is opened in append mode or after a seek(0). Bug fixed in io.TextIOWrapper (issue #5006). io.TextIOWrapper calls also encoder.setstate(0) on a seek different than seek(0), whereas codecs.StreamWriter doesn't (it is not an incremental encoder, it doesn't have the setstate method).

codecs.StreamReader doesn't ignore the BOM of UTF-8-SIG, UTF-16 or UTF-32 encodings after seek(0). Bug fixed in io.TextIOWrapper (issue #4862).

These bugs should maybe be mentioned in the codecs doc, with a pointer to the io module saying that the io module handles these encodings correctly.
History
Date User Action Args
2011-05-17 23:56:46vstinnersetrecipients: + vstinner, lemburg, loewis, brett.cannon, pitrou, eric.araujo, meatballhat
2011-05-17 23:56:45vstinnersetmessageid: <1305676605.78.0.639804793971.issue8796@psf.upfronthosting.co.za>
2011-05-17 23:56:45vstinnerlinkissue8796 messages
2011-05-17 23:56:45vstinnercreate