That can be done programmatically using the unicodedata module.  The regex module (that will hopefully be include in 3.3) is also able to match characters that belongs to specific categories.
