Message48499
Sometimes csv files contain comment rows, for
temporarily commenting out data or occasionally for
documentation. The current csv module has no built-in
ability to skip rows; in order to skip all lines
beginning with '#', the programmer writes something like:
csv_reader = csv.reader(fp)
for row in csv_reader:
if row[0][0] != '#': #assuming no blank lines
print row
I propose adding a "commentchar" parameter to the csv
parser, so that the above code could be written (more
elegantly, in my opinion):
csv_reader = csv.reader(fp, commentchar='#')
for row in csv_reader:
print row
This requires only relatively minor changes to the
module, and by defaulting to using no comment
character, existing code will behave as before. If you
are interested, the patch (diffs against current cvs)
required for the second example to run are attached.
Note that that implementation adds SKIPPED_RECORD as a
pseudonym for START_RECORD, because setting status to
START_RECORD after skipping a record would cause a
blank record to be returned. Altering that behaviour
would cause more changes and the patch would be harder
to review. I've also held back on updating tests and
documentation to reflect this change, pending any
support for it.
It shoud be irrelevant, but this has been developed on
Debian testing against the cvs head of Python. |
|
Date |
User |
Action |
Args |
2007-08-23 15:43:21 | admin | link | issue1225769 messages |
2007-08-23 15:43:21 | admin | create | |
|