I agree that the documentation should be updated. Do you mind to create a pull request mbiggs?

There are UTF-8 variants which guarantee that the encoded text has no zero bytes (see Modified UTF-8), but Python only provides the standard UTF-8 and UTF-8 with BOM.
