Skip to content

t-systems-on-site-services-gmbh/fasttext-on-wikipedia

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 

Repository files navigation

fastText on Wikipedia

In this repository we publish several fastText embeddings trained on Wikipedia data. Used software and data:

commands

  • fasttext skipgram -input data/dewiki-20220201-clean.txt -output de-wikipedia-skipgram-64 -dim 64
  • fasttext skipgram -input data/ft-train-de/train.txt -output de-wikipedia-skipgram-64 -dim 64 -autotune-validation data/ft-train-de/val.txt -autotune-duration 172800
  • fasttext skipgram -input data/ft-train-en/train.txt -output en-wikipedia-skipgram-64 -dim 64 -autotune-validation data/ft-train-en/val.txt -autotune-duration 345600

About

fastText trained on Wikipedia text corpus

Resources

Stars

Watchers

Forks

Packages

No packages published