Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Natural Language Inference datasets #1186

Open
wants to merge 11 commits into
base: main
Choose a base branch
from

Conversation

zhangguanheng66
Copy link
Contributor

@zhangguanheng66 zhangguanheng66 commented Feb 18, 2021

The following three datasets have been retired in the legacy.datasets folder. We are re-writing these by yielding the raw texts:

  • SNLI
  • MatchedMultiNLI (link)
  • MismatchedMultiNLI (link)

Unfortunately, The original link (http://www.nyu.edu/projects/bowman/xnli/XNLI-1.0.zip) for XNLI host is not available.

@zhangguanheng66 zhangguanheng66 changed the title [WIP] Add Natural Language Inference dataset [WIP] Add Natural Language Inference datasets Feb 18, 2021
@zhangguanheng66 zhangguanheng66 changed the title [WIP] Add Natural Language Inference datasets Add Natural Language Inference datasets Feb 19, 2021
@bentrevett
Copy link
Contributor

bentrevett commented Feb 19, 2021

Just FYI the XNLI dataset is available from: https://dl.fbaipublicfiles.com/XNLI/XNLI-1.0.zip, which I found in this repo.

That repo also has links to the XNLI-15way dataset (https://dl.fbaipublicfiles.com/XNLI/XNLI-15way.zip) and the XNLI-MT dataset (https://dl.fbaipublicfiles.com/XNLI/XNLI-MT-1.0.zip).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants