Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

process.dedupe() gives IndexError: list index out of range because of bug in process.extractWithoutOrder() #307

Open
Thijsvandepoll opened this issue Apr 2, 2021 · 0 comments

Comments

@Thijsvandepoll
Copy link

Hi all,

I found a bug in process.extractWithoutOrder() which causes process.dedupe() to fail unexpectedly. The example:

process.dedupe(["BRITT JEFFREY S", "BRITT JEFFREY S.", "WIEDEMAN SCOTT", "WIEDERMANN SCOTT", "斯科特·维德曼", "杰弗里·S·布里特"])

which results in:

IndexError: list index out of range

The expected result here is:

dict_keys(['BRITT JEFFREY S.', 'WIEDERMANN SCOTT', '斯科特·维德曼', '杰弗里·S·布里特'])

I looked into the source code and I believe I found a bug in process.extractWithoutOrder() which sets the used (pre)processor different for the query then for the choices. I will create a merge request to fix this issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant