Skip to content

josep11/scraping-uoc-with-apify

Repository files navigation

Scraping UOC courses with Apify

Intro

This project fetches and scraps the syllabus of any Degree in UOC. Behind the scenes it uses Apify with node.js and compromises two processes:

  1. Spawn the PuppeteerCrawler to scrap all the desired data about the courses inside datasets
  2. Read the datasets and transform them from json into xslx format.

Goal

The Goal is to extract the information of the subjects of the syllabus and their evaluation mode into an xlsx so that we don't need to do the process manually.

Execute

rm -rf apify_storage/request_queues/*
node index.js
node transform-dataset-to-xlsx.js

TODO

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published