This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author lemburg
Recipients barry, ezio.melotti, lemburg, r.david.murray
Date 2013-08-03.15:33:58
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <51FD22DE.6030807@egenix.com>
In-reply-to <1375454262.69.0.534466060897.issue18624@psf.upfronthosting.co.za>
Content
On 02.08.2013 16:37, R. David Murray wrote:
> 
> I got the impression from what I read that -e included additional control sequences, but perhaps I misunderstood and that only meant that the data stream was expected to *use* additional control sequences but the control codes themselves are part of the base codec?
> 
> I'm specifically thinking of this statement from the linked reference:
> 
> "Because HTML uses the Unicode bidirectionality algorithm, conforming documents encoded using ISO 8859-8 must be labeled as "ISO-8859-8-i". Explicit directional control is also possible with HTML, but cannot be expressed with ISO 8859-8, so "ISO-8859-8-e" should not be used."
> 
> The "cannot be expressed" seems to imply there are differences in the codec.

No, not really. After some more research, I found that the -i and
-e suffixes are defined in RFC 1556:

http://tools.ietf.org/html/rfc1556

At the codec level, these encodings are all the same. The suffixes
define whether or not to interpret some of their control characters
with respect to bidi text when visualizing the text.
History
Date User Action Args
2013-08-03 15:33:59lemburgsetrecipients: + lemburg, barry, ezio.melotti, r.david.murray
2013-08-03 15:33:58lemburglinkissue18624 messages
2013-08-03 15:33:58lemburgcreate