Skip to content
#

character-ngrams

Here are 5 public repositories matching this topic...

Language: All
Filter by language

This repository presents an approach to predict the language in which a document is written. In particular, the proposed approach transforms a text into character n-gram features and uses them to support the predictive power of a machine-learned classifier. Experimental results show that it is capable of identifying 14 languages with high accura…

  • Updated Aug 12, 2020
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the character-ngrams topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the character-ngrams topic, visit your repo's landing page and select "manage topics."

Learn more