This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author serhiy.storchaka
Recipients Guillaume Sanchez, Socob, benjamin.peterson, ezio.melotti, lemburg, loewis, mrabarnett, r.david.murray, serhiy.storchaka, steven.daprano, terry.reedy, vstinner
Date 2017-08-03.11:21:37
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1501759297.91.0.0682923592822.issue30717@psf.upfronthosting.co.za>
In-reply-to
Content
Issue18406 is closed as a duplicate of this issue. There are useful links in issue18406. In particular see a proto-PEP of Unicode Indexing Helper Module:

http://mail.python.org/pipermail/python-dev/2001-July/015938.html

I agreed that providing grapheme iterator would be useful. But it would be useful to provide also word and sentence iterators.

Should iterators provide just substrings or their positions? I think emitting a pair (pos, substring) would be more useful. It is easier to create an iterator of substrings from the iterator of pairs than opposite.

Alternatively an iterator could emit slice objects. Or special objects similar to re match objects.
History
Date User Action Args
2017-08-03 11:21:37serhiy.storchakasetrecipients: + serhiy.storchaka, lemburg, loewis, terry.reedy, vstinner, benjamin.peterson, ezio.melotti, mrabarnett, steven.daprano, r.david.murray, Socob, Guillaume Sanchez
2017-08-03 11:21:37serhiy.storchakasetmessageid: <1501759297.91.0.0682923592822.issue30717@psf.upfronthosting.co.za>
2017-08-03 11:21:37serhiy.storchakalinkissue30717 messages
2017-08-03 11:21:37serhiy.storchakacreate