This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author vstinner
Recipients lars.gustaebel, lemburg, loewis, vstinner
Date 2010-06-10.17:18:58
SpamBayes Score 3.4055134e-05
Marked as misclassified No
Message-id <1276190344.13.0.674254471941.issue8784@psf.upfronthosting.co.za>
In-reply-to
Content
My tests with 7-zip and WinRAR conviced me that it's not a good idea to use utf-8 *by default* on Windows. But since mbcs doesn't support surrogateescape error handler, we should restore the previous behaviour just for this encoding.

tarfile_mbcs_errors.patch creates a function choose_errors() which determine the best error handler depending on the encoding and the mode (read or write):
 - "strict" to write with mbcs
 - "replace" to read with mbcs
 - "surrogateescape" otherwise

Please, review my changes on the documentation :-)

On Windows, patched tarfile works exactly as Python 3.1.
History
Date User Action Args
2010-06-10 17:19:04vstinnersetrecipients: + vstinner, lemburg, loewis, lars.gustaebel
2010-06-10 17:19:04vstinnersetmessageid: <1276190344.13.0.674254471941.issue8784@psf.upfronthosting.co.za>
2010-06-10 17:19:02vstinnerlinkissue8784 messages
2010-06-10 17:19:01vstinnercreate