Message 32611 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	zaex
Recipients
Date	2007-08-09.01:34:16
SpamBayes Score
Marked as misclassified
Message-id
In-reply-to

Content
Here is a list of chinese characters that can be read from a file [in GB18030 encoding], but unable to encode to GB18030 encoding detailed: used codecs.open(r'file name', encoding='GB18030') to read the characters from a file, and try to encode them word by word into GB18030 with word.encode('GB18030'). The action caused an exception with 'illegal multibyte sequence' the attachment is also the list. list: ä¬ä±ä…ŸäŒ·ä¦Ÿä¦·ä² ã§ãã˜šã˜ã±®ä´”ä´–ä´—ä¦†ã§Ÿä™¡ä™Œä´•ä–ä¬ä´™ä¥½ä¼ää“–ä²¡ä¥‡ä¦‚ä¦…ä´“ã©³ã§ã³ ä²¢ä´˜ã–äœ£ä¥ºä¶®äœ©ä¥ºä²Ÿä²£ä¦›ä¦¶ã‘³ã‘‡ã¥®ã¤˜ää¦ƒ

Here is a list of chinese characters that can be read from a file [in GB18030 encoding], but unable to encode to GB18030 encoding

detailed:
used codecs.open(r'file name', encoding='GB18030') to read the characters from a file, and try to encode them word by word into GB18030 with word.encode('GB18030'). The action caused an exception with 'illegal multibyte sequence'

the attachment is also the list.

list:
ä¬ä±ä…ŸäŒ·ä¦Ÿä¦·ä² ã§ãã˜šã˜ã±®ä´”ä´–ä´—ä¦†ã§Ÿä™¡ä™Œä´•ä–ä¬ä´™ä¥½ä¼ää“–ä²¡ä¥‡ä¦‚ä¦…ä´“ã©³ã§ã³ ä²¢ä´˜ã–äœ£ä¥ºä¶®äœ©ä¥ºä²Ÿä²£ä¦›ä¦¶ã‘³ã‘‡ã¥®ã¤˜ää¦ƒ

History
Date	User	Action	Args
2007-08-23 14:59:08	admin	link	issue1770551 messages
2007-08-23 14:59:08	admin	create