Author vstinner
Recipients eric.smith, ezio.melotti, lemburg, mark.dickinson, ron_adam, vstinner, ysj.ray
Date 2011-02-18.15:05:33
SpamBayes Score 3.22901e-05
Marked as misclassified No
Message-id <>
> Oh, what if the trunked char* cannot be decoded correctly?
> e.g. a tow-bytes character is divided in the middle? 

Yes, but PyUnicode_FromFormatV() uses UTF-8 decoder with replace error handler, and so the incomplete byte sequence will be replaced by � (it doesn't fail with an error). Example:

>>> "abc€".encode("utf-8")[:-1].decode("utf-8", "replace")
Date User Action Args
2011-02-18 15:05:33vstinnersetrecipients: + vstinner, lemburg, mark.dickinson, eric.smith, ron_adam, ezio.melotti, ysj.ray
2011-02-18 15:05:33vstinnersetmessageid: <>
2011-02-18 15:05:33vstinnerlinkissue7330 messages
2011-02-18 15:05:33vstinnercreate