Skip to content

62mkv/estonian-forms

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Things to consider

  1. Some lexemes have multiple paradigms. This is defined as combination of "declination type" + "option_number" in the fmsynth output
  2. Currently, option number is ignored during import, which is bad. it has to be added, with a caveat:
    1. For some lexemes, fmsynth will report same paradigm twice (see "saks"). Duplicated paradigms must be ignored while imported.
  3. From the article, "initial forms" could be parsed to compare with fmsynth-provided paradigms, and non-compliant paradigms must be ignored.
  4. This should provide for much safer articleForm sets.

About

Simple one-off script to publish Estonian language lexemes at Wikidata

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages