Author hyeshik.chang
Recipients hyeshik.chang
Date 2007-08-27.12:43:24
SpamBayes Score 0.0463641
Marked as misclassified No
Message-id <1188218605.0.0.993381507884.issue1037@psf.upfronthosting.co.za>
In-reply-to
Content
Illegal identifier makes python crash on UTF-8 source codes/interpreters.

Python 3.0x (py3k:57555M, Aug 27 2007, 21:23:47) 
[GCC 3.4.6 [FreeBSD] 20060305] on freebsd6
>>> compile(b'#coding:utf-8\n\xfc', '', 'exec')
zsh: segmentation fault (core dumped)  ./python

The problem is that tokenizer.c:verify_identifer doesn't check
return value from PyUnicode_DecodeUTF8 but some invalid utf8
sequences could be there.
History
Date User Action Args
2007-08-27 12:43:25hyeshik.changsetspambayes_score: 0.0463641 -> 0.0463641
recipients: + hyeshik.chang
2007-08-27 12:43:25hyeshik.changsetspambayes_score: 0.0463641 -> 0.0463641
messageid: <1188218605.0.0.993381507884.issue1037@psf.upfronthosting.co.za>
2007-08-27 12:43:24hyeshik.changlinkissue1037 messages
2007-08-27 12:43:24hyeshik.changcreate