Author janssen
Recipients gvanrossum, janssen, jimjjewett, lemburg, loewis, mgiuca, orsenthil, pitrou, thomaspinckney3
Date 2008-08-12.03:43:36
SpamBayes Score 0.0328205
Marked as misclassified No
Message-id <1218512618.41.0.823335524111.issue3300@psf.upfronthosting.co.za>
In-reply-to
Content
Some interesting notes here (from Erik van der Poel at Google; Guido,
you might want to stroll over to his location and talk with him):

http://lists.w3.org/Archives/Public/www-international/2007JanMar/0004.html

and more particularly

http://lists.w3.org/Archives/Public/www-international/2008AprJun/0092.html,
which says, in part,

``Within the context of HTML and HTTP, queries
[that is, the query part of a URL] don't have to say which
charset they are using, because there is already an agreement in
place: the major browsers and servers use the charset of the HTML.''

So, there's still a sizable number of Latin-1 pages out there, and
queries against these pages will use that encoding in the URL's they send.

And then there's this:

http://lists.w3.org/Archives/Public/www-international/2008AprJun/0014.html
History
Date User Action Args
2008-08-12 03:43:38janssensetrecipients: + janssen, lemburg, gvanrossum, loewis, jimjjewett, orsenthil, pitrou, thomaspinckney3, mgiuca
2008-08-12 03:43:38janssensetmessageid: <1218512618.41.0.823335524111.issue3300@psf.upfronthosting.co.za>
2008-08-12 03:43:37janssenlinkissue3300 messages
2008-08-12 03:43:36janssencreate