Goodreads User Scraper

Scrape Goodreads User Data: Profile, Book Shelves, Books, Authors

Usage

Using pip:

pip install goodreads-user-scraper
goodreads-user-scraper --user_id <your id> --output_dir goodreads-data

Using pipx:

pipx run goodreads-user-scraper --user_id <your id> --output_dir goodreads-data

Arguments

`--user_id`

Description: The user whose data should be scraped. Find your user id using these directions.
Required: Yes

`--output_dir`

Description: The directory where all scraped data will be output.
Required: No
Default: goodreads-data

`--skip_user_info`

Description: Whether the script should skip scraping user information.
Required: No
Default: False

`--skip_shelves`

Description: Whether the script should skip scraping shelves.
Required: No
Default: False

`--skip_authors`

Description: Whether the script should skip scraping authors.
Required: No
Default: False

Troubleshooting

Ensure that your profile is viewable by anyone:

Navigate to the Goodreads Account Settings page
Click on the Settings tab
In the Privacy section, under the Who Can View My Profile question, select "anyone"

Development

Clone the GitHub repository

git clone https://github.com/YashTotale/goodreads-user-scraper.git

Run the install script
```
sh scripts/install.sh
```
Make changes
Run the test script
```
sh scripts/test.sh
```

Publishing

Create .env

TWINE_USERNAME=<foo>
TWINE_PASSWORD=<bar>

Run the publish script

sh scripts/publish.sh <patch|minor|major>

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.github		.github
.idea		.idea
.vscode		.vscode
scraper		scraper
scripts		scripts
static		static
.bumpversion.cfg		.bumpversion.cfg
.editorconfig		.editorconfig
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AUTHORS		AUTHORS
CODEOWNERS		CODEOWNERS
LICENSE.md		LICENSE.md
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

License

YashTotale/goodreads-user-scraper

Folders and files

Latest commit

History

Repository files navigation

Goodreads User Scraper

Contents

Usage

Arguments

--user_id

--output_dir

--skip_user_info

--skip_shelves

--skip_authors

Troubleshooting

Development

Publishing

About

Topics

Resources

License

Stars

Watchers

Forks

Languages

`--user_id`

`--output_dir`

`--skip_user_info`

`--skip_shelves`

`--skip_authors`