You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
All below cases return 0.
fuzz.token_set_ratio("","")
fuzz.token_set_ratio(" "," ")
fuzz.token_set_ratio("%","%")
fuzz.token_set_ratio("%","%%")
As far as I understand all this characters are considered blank, but by my mind it would be natural to consider them same and return 100, as rapidfuzz does in this case.
The text was updated successfully, but these errors were encountered:
rapidfuzz just doesn't preprocess strings by default, while thefuzz runs a preprocessing function removing these characters. You can disable this behavior in thefuzz:
On Mon, Dec 25, 2023, 6:30 PM Max Bachmann ***@***.***> wrote:
rapidfuzz just doesn't preprocess strings by default, while thefuzz runs a
preprocessing function removing these characters. You can disable this
behavior in thefuzz:
>>> thefuzz.fuzz.token_set_ratio("%","%")0>>> thefuzz.fuzz.token_set_ratio("%","%", full_process=False)100
or enable it in rapidfuzz
>>> rapidfuzz.fuzz.token_set_ratio("%","%")100.0>>> rapidfuzz.fuzz.token_set_ratio("%","%", processor=rapidfuzz.utils.default_process)0.0
—
Reply to this email directly, view it on GitHub
<#70 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AFVD6C7UL7PBE2SULWBM6WTYLGLZJAVCNFSM6AAAAABBCM62PGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQNRZGAZDOOBYGQ>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
All below cases return 0.
fuzz.token_set_ratio("","")
fuzz.token_set_ratio(" "," ")
fuzz.token_set_ratio("%","%")
fuzz.token_set_ratio("%","%%")
As far as I understand all this characters are considered blank, but by my mind it would be natural to consider them same and return 100, as rapidfuzz does in this case.
The text was updated successfully, but these errors were encountered: