Skip to content

A java project using MapReduce, Jtidy and Lucene to transform HTML to usable text for Hadoop

Notifications You must be signed in to change notification settings

cberez/Hadoop-html-to-text

Repository files navigation

About

A java project using MapReduce, Jtidy and Lucene to transform HTML to usable text for Hadoop

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published