Author vvro
Recipients vvro
Date 2008-05-23.03:22:42
SpamBayes Score 0.090105
Marked as misclassified No
Message-id <1211512970.98.0.439490076144.issue2948@psf.upfronthosting.co.za>
In-reply-to
Content
The hashing algorithms don't support Unicode. Any Unicode text given to
them is first tried to convert ascii and then hashed. Not all strings
are convertible to ascii.
Now that Unicode is becoming the default encoding, specially for the web
side of python, where a lot of this hashing algorithms are used.
There should be some kind of Unicode support in them.

Example:
from hashlib import md5
md5(u'joão')
UnicodeEncodeError: 'ascii' codec can't encode character u'\xe3' in
position 2: ordinal not in range(128)
History
Date User Action Args
2008-05-23 03:22:51vvrosetspambayes_score: 0.090105 -> 0.090105
recipients: + vvro
2008-05-23 03:22:51vvrosetspambayes_score: 0.090105 -> 0.090105
messageid: <1211512970.98.0.439490076144.issue2948@psf.upfronthosting.co.za>
2008-05-23 03:22:49vvrolinkissue2948 messages
2008-05-23 03:22:47vvrocreate