Scrapy-News

Description

This crawler will crawl bbc.com/news website and store the details in mongodb hosted using compose.io
WebServices is included and hosted using amazon ec2

Usage

To crawl

scrapy crawl bbcspider

List all news

curl http://ec2-52-221-187-243.ap-southeast-1.compute.amazonaws.com/news

Search specific news

curl http://ec2-52-221-187-243.ap-southeast-1.compute.amazonaws.com/news/<keyword>

TODO

Limitation

Time!

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
bbcspider		bbcspider
web		web
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg

Provide feedback