Skip to content

Newscrawler is a python console tool designed to independently crawl multiple domains, recognise news-articles and download their html-source.

License

Notifications You must be signed in to change notification settings

JBH168/Newscrawler

Repository files navigation

Newscrawler

Newscrawler is a software developed by the CColon-team in the context of the lecture "Softwareprojekt" by the University of Konstanz in the summer term 2016.

The team consisted of Jonathan Hassler (@JBH168), Franziska Schlor (@franziscl), Matt Sharinghousen (@msharing), Claudio Spener (@claudeeee) and Moritz Bock (@movabo).

Its goal is to independently crawl multiple domains, recognise news-articles and download their html-source. Furthermore, it saves meta data to a database and is able to keep a downloaded collection of news-articles up to date.

For further information check out the wiki!

About

Newscrawler is a python console tool designed to independently crawl multiple domains, recognise news-articles and download their html-source.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages