Message204629
There is no ligature for "lff", just "ffl". Ligatures are treated as one char. I guess Python would have to grow a str.reverse() method to handle ligatures and combining chars correctly.
At work I ran into the issue with ligatures and combining chars multiple times in medieval and early modern age scripts. Eventually I started to normalize all incoming data to NFKC. That solves most of the issues.
s = b'ba\xef\xac\x84e'.decode('utf-8')
>>> print("".join(reversed(s)))
efflab
>>> print("".join(reversed(unicodedata.normalize("NFKC", s))))
elffab |
|
Date |
User |
Action |
Args |
2013-11-28 00:07:37 | christian.heimes | set | recipients:
+ christian.heimes, larry |
2013-11-28 00:07:37 | christian.heimes | set | messageid: <1385597257.94.0.868204336657.issue19819@psf.upfronthosting.co.za> |
2013-11-28 00:07:37 | christian.heimes | link | issue19819 messages |
2013-11-28 00:07:37 | christian.heimes | create | |
|