Message 144722 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	loewis
Recipients	Arfrever, ezio.melotti, gvanrossum, loewis, tchrist, terry.reedy, vstinner
Date	2011-10-01.10:59:47
SpamBayes Score	0.001251407
Marked as misclassified	No
Message-id	<4E86F2A2.9020107@v.loewis.de>
In-reply-to	<26418.1317386261@chthon>

Content
> * Word characters are Alphabetic + Mn+Mc+Me + Nd + Pc. Where did you get that definition from? UTS#18 defines "<word_character>", which is Alphabetic + U+200C + U+200D (i.e. not including marks, but including those > I think you are looking for here are Word characters without > Nd + Pc, so just Alphabetic + Mn+Mc+Me. > > Is that right? With your definition of "Word character" above, yes, that's right. Marks won't start a word, though. As for terminology: I think the documentation should continue to speak about "words" and "letters", and then define what is meant in this context. It's not that the Unicode consortium invented the term "letter", so we should use it more liberally than just referring to the L* categories.

>  * Word characters are Alphabetic + Mn+Mc+Me + Nd + Pc.

Where did you get that definition from? UTS#18 defines
"<word_character>", which is Alphabetic + U+200C + U+200D
(i.e. not including marks, but including those

> I think you are looking for here are Word characters without 
> Nd + Pc, so just Alphabetic + Mn+Mc+Me.  
> 
> Is that right?

With your definition of "Word character" above, yes, that's right.
Marks won't start a word, though.

As for terminology: I think the documentation should continue to
speak about "words" and "letters", and then define what is meant
in this context. It's not that the Unicode consortium invented
the term "letter", so we should use it more liberally than just
referring to the L* categories.

History
Date	User	Action	Args
2011-10-01 10:59:48	loewis	set	recipients: + loewis, gvanrossum, terry.reedy, vstinner, ezio.melotti, Arfrever, tchrist
2011-10-01 10:59:47	loewis	link	issue12737 messages
2011-10-01 10:59:47	loewis	create