Message118758
Windows 7, Python 2.7
Some windows applications (QuickTime) add content-types to Windows registry with non-ascii names. mimetypes in unaware of that and fails with UnicodeDecodeError:
>>> mimetypes.guess_type('test.js')
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "c:\Python27\lib\mimetypes.py", line 294, in guess_type
init()
File "c:\Python27\lib\mimetypes.py", line 355, in init
db.read_windows_registry()
File "c:\Python27\lib\mimetypes.py", line 260, in read_windows_registry
for ctype in enum_types(mimedb):
File "c:\Python27\lib\mimetypes.py", line 250, in enum_types
ctype = ctype.encode(default_encoding) # omit in 3.x!
UnicodeDecodeError: 'ascii' codec can't decode byte 0xe0 in position 0: ordinal not in range(128)
Example registry leaf is attached to previous message.
I believe the correct behavior would be either to wrap UnicodeDecodeError exception and skip those content-typer or use .decode() method for registry keys and get encoding using locale.getdefaultlocale() |
|
Date |
User |
Action |
Args |
2010-10-15 11:05:06 | vldmit | set | recipients:
+ vldmit |
2010-10-15 11:05:05 | vldmit | set | messageid: <1287140705.94.0.131601408046.issue10113@psf.upfronthosting.co.za> |
2010-10-15 11:05:04 | vldmit | link | issue10113 messages |
2010-10-15 11:05:04 | vldmit | create | |
|