This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author nalza001
Recipients nalza001, tim.peters
Date 2021-09-16.04:33:03
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1631766784.51.0.868277931091.issue45180@roundup.psfhosted.org>
In-reply-to
Content
Here are the steps that I used to calculate 0.2 for the last example:

I used class difflib.HtmlDiff to find the number of changed chars (addedChars, deletedChars, and changedChars) which is 1172 (let us call it delta)

The size of both strings a and b in this example is 1470

I calculated the similality ratio using 1-(delta/totalSize) = 1-(1172/1470)=0.2

I am assuming both classes difflib.SequenceMatcher and difflib.HtmlDiff are both using the same algorithms and arguments and if so they should produce the same ratio. Is that right?
History
Date User Action Args
2021-09-16 04:33:04nalza001setrecipients: + nalza001, tim.peters
2021-09-16 04:33:04nalza001setmessageid: <1631766784.51.0.868277931091.issue45180@roundup.psfhosted.org>
2021-09-16 04:33:04nalza001linkissue45180 messages
2021-09-16 04:33:03nalza001create