Message 83712 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	vstinner
Recipients	LambertDW, amaury.forgeotdarc, ocean-city, vstinner
Date	2009-03-17.23:23:34
SpamBayes Score	3.801719e-09
Marked as misclassified	No
Message-id	<1237332216.73.0.0574011104955.issue2382@psf.upfronthosting.co.za>
In-reply-to

Content
For an easier review, I splitted my patch in multiple small patches: - unicode_utf8size.patch: create _PyUnicode_UTF8Size() function: Number of bytes needed to encode the unicode character as UTF-8 - unicode_width.patch: create PyUnicode_Width(): Number of column needed to represent the string in the current locale. -1 is returned in case of an error. - adjust_offset.patch: Change unit of SyntaxError.offset, convert utf8 offset to unicode offset - print_exception.patch: process error text as an unicode string (instead of a byte string), convert offset from characters to "columns" Dependencies: - adjust_offset.patch depends on unicode_utf8size.patch - print_exception.patch depends on unicode_width.patch Changes since issue2382.patch: - PyUnicode_Width() doesn't change the locale - PyUnicode_Width() uses WideCharToMultiByte() on MS_WINDOWS, and wcswidth() otherwise (before: do nothing if HAVE_WCSWIDTH is not definied) - the offset was converted from utf8 index to unicode index only in print_error_text(), not on SyntaxError creation - _PyUnicode_UTF8Size() and PyUnicode_Width() are public

For an easier review, I splitted my patch in multiple small patches:
 - unicode_utf8size.patch: create _PyUnicode_UTF8Size() function: 
Number of bytes needed to encode the unicode character as UTF-8
 - unicode_width.patch: create PyUnicode_Width(): Number of column 
needed to represent the string in the current locale. -1 is returned 
in case of an error.
 - adjust_offset.patch: Change unit of SyntaxError.offset, convert 
utf8 offset to unicode offset
 - print_exception.patch: process error text as an unicode string 
(instead of a byte string), convert offset from characters 
to "columns"

Dependencies:
 - adjust_offset.patch depends on unicode_utf8size.patch
 - print_exception.patch depends on unicode_width.patch

Changes since issue2382.patch:
 - PyUnicode_Width() doesn't change the locale
 - PyUnicode_Width() uses WideCharToMultiByte() on MS_WINDOWS, and 
wcswidth() otherwise (before: do nothing if HAVE_WCSWIDTH is not 
definied)
 - the offset was converted from utf8 index to unicode index only in 
print_error_text(), not on SyntaxError creation
 - _PyUnicode_UTF8Size() and PyUnicode_Width() are public

History
Date	User	Action	Args
2009-03-17 23:23:37	vstinner	set	recipients: + vstinner, amaury.forgeotdarc, ocean-city, LambertDW
2009-03-17 23:23:36	vstinner	set	messageid: <1237332216.73.0.0574011104955.issue2382@psf.upfronthosting.co.za>
2009-03-17 23:23:35	vstinner	link	issue2382 messages
2009-03-17 23:23:35	vstinner	create