Skip to content

The winning system of the Text2TCS project team submitted to the CogaLex VI shared task.

Notifications You must be signed in to change notification settings

Text2TCS/Transrelation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Transrelation - CogALex VI Submission

This is a description of our system submitted to the CogaLex VI shared task. We use a single multilingual model in order to annotate the four different test sets. The labeled test sets can be found in the folder “annotated test sets”.

Our full paper can be found here: https://www.aclweb.org/anthology/2020.cogalex-1.7/ [1]

The task website is: https://sites.google.com/site/cogalexvisharedtask/

XLM-RoBERTa

Our system makes use of the multilingual language model XLM-RoBERTa [2]. We use the implementation provided by the transformers library, which offers the XLM-RoBERTa model pretained on 100 different languages using CommonCrawl data [3]. A linear layer is added on top of the pooled output in order to allow classification. The model was trained on the training data provided for the CogALex VI shared task, with the training being performed on all three languages at the same time. Hyperparameters were finetuned on the given validation set. The best results were achieved with the following Hyperparameters

  • Optimizer: AdamW
  • Learning rate: 2e-5
  • Epsilon: 1e-8
  • Epochs: 7
  • Batch size: 32
  • Warm up steps: 0
  • Weight Decay: 0
  • Model size: Base

Results on Validation Set

With XLMR Model

Chinese

Class Precision Recall F1 Support
SYN 0.862 0.822 0.841 129
HYP 0.938 0.841 0.887 145
ANT 0.877 0.838 0.857 136
Overall 0.894 0.834 0.863

English

Class Precision Recall F1 Support
SYN 0.492 0.463 0.477 259
HYP 0.597 0.473 0.528 292
ANT 0.801 0.497 0.613 308
Overall 0.639 0.478 0.543

German

Class Precision Recall F1 Support
SYN 0.507 0.419 0.459 272
HYP 0.595 0.459 0.518 294
ANT 0.673 0.411 0.510 275
Overall 0.592 0.430 0.496

References

[1] Wachowiak, L., Lang, C., Heinisch, B., & Gromann, D. (2020, December). CogALex-VI Shared Task: Transrelation-A Robust Multilingual Language Model for Multilingual Relation Identification. In Proceedings of the Workshop on the Cognitive Aspects of the Lexicon (pp. 59-64).

[2] Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzmán, F., ... & Stoyanov, V. (2019). Unsupervised cross-lingual representation learning at scale. arXiv preprint arXiv:1911.02116.

[3] Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A., ... & Brew, J. (2019). HuggingFace's Transformers: State-of-the-art Natural Language Processing. ArXiv, arXiv-1910.

About

The winning system of the Text2TCS project team submitted to the CogaLex VI shared task.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published