Skip to content
This repository has been archived by the owner on Mar 21, 2022. It is now read-only.

andorfermichael/imdb-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

IMDb Scrapper

IMDb Scrapper stores a given number of movies from the Internet Movie Database as JSON.

Requirements

  • python 2.7

Installation & Setup

Download and install required libs and data:

pip install imdbpie

Usage API-Scrapper

Collect and store the first 10,000 movies from the IMDb:

python imdb-api-scrapper.py 10000 save

Collect and store the movies 15,000 to 25,000 from the IMDb:

python imdb-api-scrapper.py --start 15000 10000 save

Collect and store the movies 30,000 to 35,000 from the IMDb and append them to the existing data in the movies.json:

python imdb-api-scrapper.py --start 30000 5000 --overwrite no save

Collect and store the movies and episodes 1 to 2,000 from the IMDb:

python imdb-api-scrapper.py 2000 --episodes yes save

Usage Page-Scrapper

Collect and store the most voted feature movies from 1950 to the current year from the IMDb:

python imdb-page-scrapper.py save

Collect and store the first 80,000 most voted feature movies from 1950 to the current year from the IMDb:

python imdb-page-scrapper.py 80000 save