This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author Kristian.Benoit
Recipients Kristian.Benoit, cvrebert, serhiy.storchaka, vstinner
Date 2014-05-17.15:06:59
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1400339220.28.0.730315077335.issue21509@psf.upfronthosting.co.za>
In-reply-to
Content
I added code to skip the bom if present when encoding is either None or "utf-8". The problem I have with Victor's solution is that users don't know these files are not plain UTF-8. Most text editor says it's utf-8 encoded, how can a user figure out there 3 hidden bytes at the start of the file ?

Kristian
History
Date User Action Args
2014-05-17 15:07:00Kristian.Benoitsetrecipients: + Kristian.Benoit, vstinner, cvrebert, serhiy.storchaka
2014-05-17 15:07:00Kristian.Benoitsetmessageid: <1400339220.28.0.730315077335.issue21509@psf.upfronthosting.co.za>
2014-05-17 15:07:00Kristian.Benoitlinkissue21509 messages
2014-05-17 15:06:59Kristian.Benoitcreate