FWIW I recently made a talk at PyCon Finland called "Understanding Encodings" that goes through the things you mentioned in the last message.

I could turn that in a patch for the Unicode Howto.
