This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author josh.r
Recipients abarry, josh.r, rhettinger, seblin, socketpair
Date 2015-12-18.06:18:39
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1450419519.93.0.229300760177.issue25898@psf.upfronthosting.co.za>
In-reply-to
Content
Aho Corasick doesn't seem likely to be useful here; it's good if the haystack is huge (or you have many haystacks to search) and you have many needles to look for (and the needles never change), but it pays a fairly steep setup cost; for a utility that searches for a single subsequence once, with no history, Aho-Corasick wouldn't help much.

A variant on Boyer-Moore (which involves less preprocessing work on the needle) might help, but I'm not sure the feature is compelling enough to warrant acceleration in the first place.
History
Date User Action Args
2015-12-18 06:18:39josh.rsetrecipients: + josh.r, rhettinger, socketpair, abarry, seblin
2015-12-18 06:18:39josh.rsetmessageid: <1450419519.93.0.229300760177.issue25898@psf.upfronthosting.co.za>
2015-12-18 06:18:39josh.rlinkissue25898 messages
2015-12-18 06:18:39josh.rcreate