This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author brett.cannon
Recipients brett.cannon
Date 2008-12-11.06:19:13
SpamBayes Score 6.4257943e-06
Marked as misclassified No
Message-id <1228976357.63.0.133847904092.issue4626@psf.upfronthosting.co.za>
In-reply-to
Content
When compile() is called with a string it is a reasonable assumption
that it has already been decoded. But this is not in fact the case and
leads to errors when trying to use non-ASCII identifiers::

 >>> source = "# coding=latin-1\n\u00c6 = '\u00c6'"
 >>> compile(source, '<test>', 'exec')
 Traceback (most recent call last):
   File "<stdin>", line 1, in <module>
   File "<test>", line 2
     Æ = 'Æ'
        ^
 SyntaxError: invalid character in identifier
 >>> compile(source.encode('latin-1'), '<test>', 'exec')
 <code object <module> at 0x389cc8, file "<test>", line 2>
History
Date User Action Args
2008-12-11 06:19:18brett.cannonsetrecipients: + brett.cannon
2008-12-11 06:19:17brett.cannonsetmessageid: <1228976357.63.0.133847904092.issue4626@psf.upfronthosting.co.za>
2008-12-11 06:19:16brett.cannonlinkissue4626 messages
2008-12-11 06:19:14brett.cannoncreate