Skip to content

Russian-Belarusian neural translator

Notifications You must be signed in to change notification settings

nlprocby/Translator

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

Translator

Russian-Belarusian neural translator

The data is a part of my bachelor thesis about neural translation for the language pair Russian-Belarusian.

Repository

The repo consists of

  • 429k aligned sentence pairs (under Data/AlignedData), split into 10 batches

  • chunks to align (under Data/ChunksToAlign)

  • Data/TabbedCorpusMiddleSent.txt is a sample of 65966 sentences, at max 80 characters each, and is handy to train a model only on a sample of data.

  • neural network code.

Data source

? The main source of the data (web-pages,..)

Collection

? How the data was collected

This is an open-source project, data can be used freely. Any reviews are much than welcome.


Author: Tsimafei Prakapenka

About

Russian-Belarusian neural translator

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published