Skip to content

Using Spark In Python For Movie Similarities With Jaccard Index

License

Notifications You must be signed in to change notification settings

nikoshet/pyspark-movie-similarities

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Use Of PySpark For Movie Similarities With Jaccard Index

Dataset

The dataset is the MovieLens 100K Dataset that can be found here. It includes 100,000 ratings from 1000 users on 1700 movies and was released 4/1998. The needed files for the app are uploaded with changed name.

Requirements

  • PySpark

Example Usage

To find similar movies with 'Star Wars (1977)' movie:

spark-submit movie-similarites.py 50

About

Using Spark In Python For Movie Similarities With Jaccard Index

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages