This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author serhiy.storchaka
Recipients pitrou, serhiy.storchaka, vstinner
Date 2012-04-23.21:06:57
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1335215364.3101.289.camel@raxxla>
In-reply-to <1335215047.86.0.0172027590975.issue14654@psf.upfronthosting.co.za>
Content
Here are the results of benchmarking (numbers in MB/s).

On 32-bit Linux, AMD Athlon 64 X2 4600+ @ 2.4GHz:

                                          Py2.7         Py3.2         Py3.3        patch

utf-8     'A'*10000                       191 (+790%)   1170 (+45%)   1664 (+2%)    1700
utf-8     '\x80'*10000                    187 (+4%)     219 (-11%)    172 (+13%)    194
utf-8       '\x80'+'A'*9999               191 (+98%)    1152 (-67%)   376 (+1%)     378
utf-8     '\u0100'*10000                  188 (+15%)    221 (-2%)     164 (+32%)    217
utf-8       '\u0100'+'A'*9999             191 (+103%)   1150 (-66%)   382 (+1%)     387
utf-8       '\u0100'+'\x80'*9999          188 (+15%)    221 (-2%)     164 (+32%)    217
utf-8     '\u8000'*10000                  244 (-12%)    263 (-18%)    191 (+13%)    215
utf-8       '\u8000'+'A'*9999             191 (+102%)   1174 (-67%)   382 (+1%)     386
utf-8       '\u8000'+'\x80'*9999          188 (+15%)    216 (+0%)     164 (+32%)    217
utf-8       '\u8000'+'\u0100'*9999        188 (+15%)    216 (+0%)     164 (+32%)    217
utf-8     '\U00010000'*10000              251 (-15%)    248 (-14%)    199 (+7%)     213
utf-8       '\U00010000'+'A'*9999         191 (+97%)    1173 (-68%)   372 (+1%)     376
utf-8       '\U00010000'+'\x80'*9999      188 (+21%)    221 (+3%)     180 (+26%)    227
utf-8       '\U00010000'+'\u0100'*9999    188 (+21%)    221 (+3%)     180 (+26%)    227
utf-8       '\U00010000'+'\u8000'*9999    244 (-9%)     263 (-16%)    201 (+10%)    221

On 32-bit Linux, Intel Atom N570 @ 1.66GHz:

                                          Py2.7         Py3.2         Py3.3        patch

utf-8     'A'*10000                       117 (+414%)   349 (+72%)    597 (+1%)     601
utf-8     '\x80'*10000                    86 (-5%)      89 (-8%)      67 (+22%)     82
utf-8       '\x80'+'A'*9999               117 (+6%)     340 (-64%)    126 (-2%)     124
utf-8     '\u0100'*10000                  86 (-2%)      89 (-6%)      66 (+27%)     84
utf-8       '\u0100'+'A'*9999             117 (+5%)     339 (-64%)    78 (+58%)     123
utf-8       '\u0100'+'\x80'*9999          86 (-2%)      89 (-6%)      66 (+27%)     84
utf-8     '\u8000'*10000                  109 (-26%)    98 (-17%)     71 (+14%)     81
utf-8       '\u8000'+'A'*9999             116 (+7%)     339 (-63%)    78 (+59%)     124
utf-8       '\u8000'+'\x80'*9999          86 (-3%)      89 (-7%)      66 (+26%)     83
utf-8       '\u8000'+'\u0100'*9999        86 (-3%)      89 (-7%)      66 (+26%)     83
utf-8     '\U00010000'*10000              106 (-14%)    105 (-13%)    81 (+12%)     91
utf-8       '\U00010000'+'A'*9999         116 (+12%)    338 (-62%)    127 (+2%)     130
utf-8       '\U00010000'+'\x80'*9999      86 (+6%)      88 (+3%)      69 (+32%)     91
utf-8       '\U00010000'+'\u0100'*9999    86 (+6%)      88 (+3%)      69 (+32%)     91
utf-8       '\U00010000'+'\u8000'*9999    109 (-24%)    98 (-15%)     74 (+12%)     83

The results were ambiguous (everywhere plus, but in different ways). I
would like to see the results for 64-bit platforms. For scripts see
issue14624.
History
Date User Action Args
2012-04-23 21:06:57serhiy.storchakasetrecipients: + serhiy.storchaka, pitrou, vstinner
2012-04-23 21:06:57serhiy.storchakalinkissue14654 messages
2012-04-23 21:06:57serhiy.storchakacreate