Skip to content

nikhilkumawat03/Extracting-Relevant-Document

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Big-Data

Projects based on Big Data.

This project is based on Document analysis where large number of corpus is given and the relevant document should be extracted with the help of Term Frequency- Inverse Document Frequency(TF-IDF) which is known as feature extration.

This project uses Hadoop Map Reduce, Spark RDD and Spark SQL.

All these programs are available in these file with explanation.

Releases

No releases published

Packages

No packages published

Languages