You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ImplicitCF raises an IndexError if the user appears in the test dataset but not on the training dataset.
How do we replicate the issue?
Split a dataset using a method like TimeSeriesSplit or python_chrono_split. I.e: len(ImplicitCF.interact_status) < len(ImplicitCF.user_idx)
Expected behavior (i.e. solution)
Raisign a meaningful error if the dataset needs to be stratified, or assuming that if the user is not on the ImplicitCF.interact_status table, it should have the empty set of items.
I'm not certain under what circumstances 'user in test' should be used in 'interact_status.' Based on my understanding, 'interact_status' should only be applied to the training dataset. Could you provide me with more information on this?
I've become a bit unclear about the logic here, but I don't believe it's necessary for 'interact_status' to include users from
the test set. @loomlike Do you have time to further investigate this issue?
Description
ImplicitCF raises an
IndexError
if the user appears in the test dataset but not on the training dataset.How do we replicate the issue?
Split a dataset using a method like TimeSeriesSplit or python_chrono_split. I.e:
len(ImplicitCF.interact_status) < len(ImplicitCF.user_idx)
Expected behavior (i.e. solution)
Raisign a meaningful error if the dataset needs to be stratified, or assuming that if the user is not on the
ImplicitCF.interact_status
table, it should have the empty set of items.Other Comments
Meanwhile, I solved it by using:
This will create a the remaining "empty" users
Or just deleting items in test that don't appear in train
The text was updated successfully, but these errors were encountered: