This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author jvanpraag
Recipients jvanpraag
Date 2010-06-18.19:43:47
SpamBayes Score 0.0034563763
Marked as misclassified No
Message-id <1276890229.73.0.493277821026.issue9029@psf.upfronthosting.co.za>
In-reply-to
Content
Platforms:
Windows Vista Ultimate 64 bit
Python v3.1.2

Goal is to read a tab-separated CSV file. Some records have garbage characters in them.

The following code fragment works when the program is run in IDLE, but fails when run from the Windows command line.

...
	fh_read = open(company_info, encoding='utf_8', errors='replace')
	fh_write = open(quotes_file, 'w')
	count = 0
	try:
		rdr = csv.reader(fh_read, delimiter='\t')
		wrt = csv.writer(fh_write, delimiter='\t', quotechar='"', lineterminator='\n')
		for row in rdr:
...

Within IDLE the code works with or without the encoding='utf_8' declaration; and it fails with or without the declaration when run from the Windows command line.

The error message from the Windows command line is as follows:

'charmap' codec can't encode character '\u2019' in position 207: character maps to <undefined>
History
Date User Action Args
2010-06-18 19:43:49jvanpraagsetrecipients: + jvanpraag
2010-06-18 19:43:49jvanpraagsetmessageid: <1276890229.73.0.493277821026.issue9029@psf.upfronthosting.co.za>
2010-06-18 19:43:48jvanpraaglinkissue9029 messages
2010-06-18 19:43:47jvanpraagcreate