Message382827
In addition, you are probably hitting normalization issues. There are two ways to get the Cyrillic character 'й' in your string, one of them is a single code point, the other is two code points:
>>> a = 'й'
>>> b = 'й'
>>> len(a), unicodedata.name(a)
(1, 'CYRILLIC SMALL LETTER SHORT I')
>>> len(b), unicodedata.name(b[0]), unicodedata.name(b[1])
(2, 'CYRILLIC SMALL LETTER I', 'COMBINING BREVE') |
|
Date |
User |
Action |
Args |
2020-12-10 12:15:13 | steven.daprano | set | recipients:
+ steven.daprano, ronaldoussoren, vstinner, ezio.melotti, hidr0.frbg |
2020-12-10 12:15:13 | steven.daprano | set | messageid: <1607602513.39.0.743587122132.issue42614@roundup.psfhosted.org> |
2020-12-10 12:15:13 | steven.daprano | link | issue42614 messages |
2020-12-10 12:15:13 | steven.daprano | create | |
|