Message83356
Hi all,
This patch takes the approach of assuming utf-8 format encoding
for files opened with 'rb' directive.
That is:
1. Check if each line is Unicode Or Bytes Type.
2. If Bytes, get char array reference to internal buffer.
3. use PyUnicode_FromString to create a new unicode object from the
char* - This step assumes UTF-8.
4. get a Py_UNICODE reference to internal unicode object buffer and
continue as before.
Is this in the right direction at all?
Cheers,
Jervis |
|
Date |
User |
Action |
Args |
2009-03-09 05:11:33 | jdwhitley | set | recipients:
+ jdwhitley, georg.brandl, sjmachin, pitrou, vstinner, jaywalker |
2009-03-09 05:11:32 | jdwhitley | set | messageid: <1236575492.47.0.0523080771295.issue4847@psf.upfronthosting.co.za> |
2009-03-09 05:11:31 | jdwhitley | link | issue4847 messages |
2009-03-09 05:11:30 | jdwhitley | create | |
|