Transrelation - CogALex VI Submission

This is a description of our system submitted to the CogaLex VI shared task. We use a single multilingual model in order to annotate the four different test sets. The labeled test sets can be found in the folder “annotated test sets”.

Our full paper can be found here: https://www.aclweb.org/anthology/2020.cogalex-1.7/ [1]

The task website is: https://sites.google.com/site/cogalexvisharedtask/

XLM-RoBERTa

Our system makes use of the multilingual language model XLM-RoBERTa [2]. We use the implementation provided by the transformers library, which offers the XLM-RoBERTa model pretained on 100 different languages using CommonCrawl data [3]. A linear layer is added on top of the pooled output in order to allow classification. The model was trained on the training data provided for the CogALex VI shared task, with the training being performed on all three languages at the same time. Hyperparameters were finetuned on the given validation set. The best results were achieved with the following Hyperparameters

Optimizer: AdamW
Learning rate: 2e-5
Epsilon: 1e-8
Epochs: 7
Batch size: 32
Warm up steps: 0
Weight Decay: 0
Model size: Base

Results on Validation Set

With XLMR Model

Chinese

Class	Precision	Recall	F1	Support
SYN	0.862	0.822	0.841	129
HYP	0.938	0.841	0.887	145
ANT	0.877	0.838	0.857	136
Overall	0.894	0.834	0.863

English

Class	Precision	Recall	F1	Support
SYN	0.492	0.463	0.477	259
HYP	0.597	0.473	0.528	292
ANT	0.801	0.497	0.613	308
Overall	0.639	0.478	0.543

German

Class	Precision	Recall	F1	Support
SYN	0.507	0.419	0.459	272
HYP	0.595	0.459	0.518	294
ANT	0.673	0.411	0.510	275
Overall	0.592	0.430	0.496

References

[1] Wachowiak, L., Lang, C., Heinisch, B., & Gromann, D. (2020, December). CogALex-VI Shared Task: Transrelation-A Robust Multilingual Language Model for Multilingual Relation Identification. In Proceedings of the Workshop on the Cognitive Aspects of the Lexicon (pp. 59-64).

[2] Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzmán, F., ... & Stoyanov, V. (2019). Unsupervised cross-lingual representation learning at scale. arXiv preprint arXiv:1911.02116.

[3] Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A., ... & Brew, J. (2019). HuggingFace's Transformers: State-of-the-art Natural Language Processing. ArXiv, arXiv-1910.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
annotated test sets		annotated test sets
data		data
CogALex_XLM_RoBERTa_Updated_Task_Version.ipynb		CogALex_XLM_RoBERTa_Updated_Task_Version.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

annotated test sets

annotated test sets

data

data

CogALex_XLM_RoBERTa_Updated_Task_Version.ipynb

CogALex_XLM_RoBERTa_Updated_Task_Version.ipynb

README.md

README.md

Repository files navigation

Transrelation - CogALex VI Submission

XLM-RoBERTa

Results on Validation Set

References

About

Releases

Packages

Contributors 3

Languages

Text2TCS/Transrelation

Folders and files

Latest commit

History

Repository files navigation

Transrelation - CogALex VI Submission

XLM-RoBERTa

Results on Validation Set

References

About

Topics

Resources

Stars

Watchers

Forks

Languages