Author zwol
Recipients zwol
Date 2016-07-12.13:10:48
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1468329049.33.0.201400438769.issue27496@psf.upfronthosting.co.za>
In-reply-to
Content
unicodedata.name() doesn't have name information for the C0 and C1 control characters.  To see this, run

pprint.pprint(["U+{:04X} {}".format(n, unicodedata.name(chr(n), "<missing>")) for n in range(256)])

and you will observe <missing> printed for U+0000 through U+001F and U+007F through U+009F.  These characters do have official Unicode names and they should be known to name().

I may see if I can come up with a patch for this one, in my copious free time.
History
Date User Action Args
2016-07-12 13:10:49zwolsetrecipients: + zwol
2016-07-12 13:10:49zwolsetmessageid: <1468329049.33.0.201400438769.issue27496@psf.upfronthosting.co.za>
2016-07-12 13:10:49zwollinkissue27496 messages
2016-07-12 13:10:48zwolcreate