classification
Title: Some numeric characters are still not recognized
Type: feature request Stage: test needed
Components: Unicode Versions: Python 3.1, Python 2.7
process
Status: open Resolution:
Dependencies: Generate numeric/space/linebreak from Unicode database.
View: 1571184
Superseder:
Assigned To: lemburg Nosy List: andersch, lemburg (2)
Priority: normal Keywords

Created on 2006-10-05 07:31 by andersch, last changed 2009-03-30 02:04 by ajaksu2.

Messages (3)
msg61013 - (view) Author: Anders Chrigström (andersch) Date: 2006-10-05 07:31
Looking into the documentation of the unicode database
i found that there are some numeric characters that are
not listed in the UnicodeData.txt file. They are intead
listen in the Unihan.txt file. (See
http://www.unicode.org/Public/5.0.0/ucd/UCD.html#Numeric_Type_Han).

I have a patch for this in the works.

msg61014 - (view) Author: Anders Chrigström (andersch) Date: 2006-10-05 08:05
Logged In: YES 
user_id=621306

I have uploaded patch #1571184 that fixes this.
msg61015 - (view) Author: Marc-Andre Lemburg (lemburg) Date: 2006-10-05 10:44
Logged In: YES 
user_id=38388

For quick reference, here's the patch URL:

https://sourceforge.net/tracker/index.php?func=detail&aid=1571184&group_id=5470&atid=305470

I'll comment there.

It's really sad that the Unicode Consortium is separating
out all kinds of properties into separate files... makes
following the standard a lot harder.
History
Date User Action Args
2009-03-30 02:04:52ajaksu2setdependencies: + Generate numeric/space/linebreak from Unicode database.
type: feature request
stage: test needed
versions: + Python 3.1, Python 2.7
2006-10-05 07:31:51anderschcreate