This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author georg.brandl
Recipients Jean-Michel.Fauth, benjamin.peterson, ezio.melotti, georg.brandl, jmfauth, loewis, r.david.murray, terry.reedy
Date 2012-03-03.13:03:37
SpamBayes Score 4.268463e-06
Marked as misclassified No
Message-id <1330779818.84.0.165608980151.issue14176@psf.upfronthosting.co.za>
In-reply-to
Content
I'd like to encourage you to not try this sort of thing out from an interactive interpreter (incidentally, where does "<psi last command>" come from? It doesn't look like Python's REPL).

As David and Terry noted, interactions with such a console, be it Windows' "cmd" or IDLE, have their very own idiosyncrasies and bugs.

That said, in Python 2.x *source files* the following two expressions are identical:

* u'abcœé€'
* unicode('abcœé€', 'encoding the file is in')

Both result in a Unicode string with the six characters/codepoints you mentioned.  There won't be any code that works with one but not the other.

Of course there are libraries that do not handle Unicode strings in general (nothing to do with literals!) correctly, but as you yourself said, that is a problem with the libraries.

Lastly, please read PEP 414 if you are not completely sure what it is proposing.  You will see that it merely affects the available syntax for Unicode literals and allows the "u" again.
History
Date User Action Args
2012-03-03 13:03:38georg.brandlsetrecipients: + georg.brandl, loewis, terry.reedy, benjamin.peterson, ezio.melotti, jmfauth, r.david.murray, Jean-Michel.Fauth
2012-03-03 13:03:38georg.brandlsetmessageid: <1330779818.84.0.165608980151.issue14176@psf.upfronthosting.co.za>
2012-03-03 13:03:38georg.brandllinkissue14176 messages
2012-03-03 13:03:37georg.brandlcreate