Amazon-Scraper-with-GoColly

Application that consists of 2 services - Scraper and Aggregator, and MongoDB as the document store.

Objective

Note: Below services are containerized into two different images.

Scraper Service - This will scrape an Amazon Web Page given its URL.
- Fetch details such as Name, Image URL, Description and Price.
- Utilize Colly framework for scraping.
- Call Aggregator Service to persist the above scraped data in a document store database.
Aggregator Service - This take in the payload from the scraper service and update the database.
- Write/Update the payload into the database which is MongoDB in our case.
- Send back a status with details such as URL and ID.

Local Configuration

Note: I have developed on Windows 10 x64 bit + WSL2 Ubuntu-20.04 using Docker-Desktop v 4.9.1

Software	Version
Go	`'1.13'`
Docker	`"20.10.16, build aa7e414"`
MongoDB	`4.4.2`

API Endpoints

Sno.	Port	Method	URL	REQ BODY	Info
1	8080	POST	localhost:8080/scraper	Amazon Page URL	Colly visits the mentioned URL and scrapes the required data.
2	8081	POST	localhost:8081/aggregator	Product Details in JSON Format	It could either insert/update in the database.
3	8081	GET	localhost:8081/aggregator	NA	Returns all the records from the collection.

How to run locally?

git clone https://github.com/jerinthomas1404/Amazon-Scraper-with-GoColly.git
docker-compose build
docker-compose up -d
Using POSTMAN/Other Application send a POST request to scraper API with a url in the body as JSON.
Sample URLs:
- https://www.amazon.com/Controller-Compatible-Programming-Vibration-PlayStation-4/dp/B08L7T1VC7/ref=sr_1_2_sspa?th=1

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
aggregatorAPI		aggregatorAPI
assests/images		assests/images
scraperAPI		scraperAPI
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aggregatorAPI

aggregatorAPI

assests/images

assests/images

scraperAPI

scraperAPI

README.md

README.md

docker-compose.yml

docker-compose.yml

Repository files navigation

Amazon-Scraper-with-GoColly

Objective

Local Configuration

API Endpoints

How to run locally?

Screenshots

About

Releases

Packages

Languages

jerinthomas1404/Amazon-Scraper-with-GoColly

Folders and files

Latest commit

History

Repository files navigation

Amazon-Scraper-with-GoColly

Objective

Local Configuration

API Endpoints

How to run locally?

Screenshots

About

Resources

Stars

Watchers

Forks

Languages