Author hyeshik.chang
Recipients
Date 2007-06-12.11:12:55
SpamBayes Score
Marked as misclassified
Message-id
In-reply-to
Content
I attached a working fix for the problem. The patch encodes non-BMP characters as a surrogate pair in the lookup function.

The surrogate pair encoding can be thought as something to be included in the standard unicode API.  How about to provide UTF-32 codecs in the Python C-API to help this kind of use?
File Added: unicodedata-lookup-ucs2fix.diff
History
Date User Action Args
2007-08-23 14:53:16adminlinkissue1704793 messages
2007-08-23 14:53:16admincreate