Author dcjim
Recipients
Date 2002-08-23.19:16:04
SpamBayes Score
Marked as misclassified
Message-id
In-reply-to
Content
For Python 2.2.1 or the CVS head, as of this posting, 
with Python configured for 4-byte unicode
(--enable-unicode=ucs4)
searches against unicode regular expressions that use 
characters above \xff don't seem to work.

Here's an example:

  invalid_xml_char = re.compile(u'[\ud800-\udfff]')
  invalid_xml_char.search(u'\ud800')

returns None, rather than a match.
History
Date User Action Args
2007-08-23 14:05:16adminlinkissue599377 messages
2007-08-23 14:05:16admincreate