Message 159132 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	loewis
Recipients	Arfrever, jcea, loewis, pitrou, serhiy.storchaka, vstinner
Date	2012-04-24.11:22:05
SpamBayes Score	-1.0
Marked as misclassified	Yes
Message-id	<1335266526.6.0.822959979186.issue14654@psf.upfronthosting.co.za>
In-reply-to

Content
I'm -1 on using signed char in the implementation. If this gives any advantage, it's because the compiler is not able to generate as efficient code for unsigned char as it does for signed char. So the performance results may again change if you switch compilers, or use the next compiler version. The code should do what is logically correct; IMO, UTF-8 is really a sequence of unsigned bytes, conceptually. So if you want to demonstrate any performance improvements, you need to do so with unsigned chars.

I'm -1 on using signed char in the implementation. If this gives any advantage, it's because the compiler is not able to generate as efficient code for unsigned char as it does for signed char. So the performance results may again change if you switch compilers, or use the next compiler version.

The code should do what is *logically* correct; IMO, UTF-8 is really a sequence of unsigned bytes, conceptually.

So if you want to demonstrate any performance improvements, you need to do so with unsigned chars.

History
Date	User	Action	Args
2012-04-24 11:22:06	loewis	set	recipients: + loewis, jcea, pitrou, vstinner, Arfrever, serhiy.storchaka
2012-04-24 11:22:06	loewis	set	messageid: <1335266526.6.0.822959979186.issue14654@psf.upfronthosting.co.za>
2012-04-24 11:22:06	loewis	link	issue14654 messages
2012-04-24 11:22:05	loewis	create