[BENCHMARK DATASET REQUEST] dutch-cola #419

BramVanroy · 2024-04-24T13:08:22Z

Dataset name

GroNLP/dutch-cola

Dataset link

https://huggingface.co/datasets/GroNLP/dutch-cola

Dataset languages

Describe the dataset

Dutch CoLA is a corpus of linguistic acceptability for Dutch: a dataset consisting of sentences in Dutch, each marked as either acceptable (class 1) or unacceptable (class 0). These sentences are collected from existing descriptions of Dutch grammar (see sources below) with expert-annotated acceptability labels.

I might add it through a PR when I find the time.

BramVanroy added the benchmark dataset request Request to add a new benchmark dataset label Apr 24, 2024

BramVanroy linked a pull request Apr 24, 2024 that will close this issue

Add Dutch CoLA #421

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BENCHMARK DATASET REQUEST] dutch-cola #419

[BENCHMARK DATASET REQUEST] dutch-cola #419

BramVanroy commented Apr 24, 2024

[BENCHMARK DATASET REQUEST] dutch-cola #419

[BENCHMARK DATASET REQUEST] dutch-cola #419

Comments

BramVanroy commented Apr 24, 2024

Dataset name

Dataset link

Dataset languages

Describe the dataset