Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The problem of answers out of context #16

Open
GMarzinotto opened this issue Apr 6, 2020 · 0 comments
Open

The problem of answers out of context #16

GMarzinotto opened this issue Apr 6, 2020 · 0 comments

Comments

@GMarzinotto
Copy link

Hello,

First I wanted to thank you for this great resource !
As a way to try to further improve it I wanted to point out some cases that I observed on the dataset

Some questions point to a part of a token (often a character from a word) as part of an answer.
I find this weird, especially because many times the word that contains the answer is out of context.

Some Exemples from train-v2.json

Question ID |||  Question ||| Answer ||| Word that contains the answer.

56cf609aaab44d1400b89187 |||| At what age did Chopin start playing publicly? ||| 7 |||| 1817 
56ce750daab44d1400b887b4 |||| In how many colors is the current iPod Touch available? ||| 5 |||| 2015
56d0f47a17492d1400aab69d |||| How many total CDs has Kanye West released in his career so far? ||| 7 |||| 2007 
56d1042317492d1400aab72f |||| How many times was The College Dropout's release put off? ||| 3 |||| 2003 

Here the annotators take dates and extract numbers from them and mark them as answers.
I gave only 4 examples but there are many more.
I can give you the entire list, or help you fix it, if you are interested in correcting this issue !

Moreover, I believe this raises a question on the possibility of out of context answers.
If I ask How many total CDs has Kanye West released in his career so far?
and the paragraph says Kanye has 7 cars, I believe marking 7 as correct is wrong despite there is an exact match in the strings.
I was wondering, how do you currently address this issues ?

Thank you very much,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant