This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author mark.dickinson
Recipients mark.dickinson, serhiy.storchaka
Date 2012-09-20.20:16:58
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1348172219.51.0.84674051024.issue15992@psf.upfronthosting.co.za>
In-reply-to
Content
[Broken out of the discussion in issue 15144]

Some of the newly-optimized code in Objects/unicodeobject.c contains strict aliasing violations;  under the C standards, this is undefined behaviour (C99 6.5p7).

An example occurs in ascii_decode:

    unsigned long value = *(const unsigned long *) _p;

Here the pointer dereference violates the strict aliasing rule.

I think these portions of Objects/unicodeobject.c should be rewritten to avoid the undefined behaviour.

This is not a purely theoretical problem: compilers are known to make optimizations based on the assumption that strict aliasing is not violated.  Early versions of David Gay's dtoa.c gave incorrect results as a result of strict aliasing violations, for example; see [1].

[2] gives a stackoverflow reference explaining strict aliasing.

[1] http://patrakov.blogspot.co.uk/2009/03/dont-use-old-dtoac.html
[2] http://stackoverflow.com/questions/98650/what-is-the-strict-aliasing-rule
History
Date User Action Args
2012-09-20 20:16:59mark.dickinsonsetrecipients: + mark.dickinson, serhiy.storchaka
2012-09-20 20:16:59mark.dickinsonsetmessageid: <1348172219.51.0.84674051024.issue15992@psf.upfronthosting.co.za>
2012-09-20 20:16:59mark.dickinsonlinkissue15992 messages
2012-09-20 20:16:58mark.dickinsoncreate