Skip to content

A Small Scraping Script written in Python that helps you collect and merge all questions for a subject on sanfoundry.com into a HTML document with additional data.

Notifications You must be signed in to change notification settings

raprocks/sanfoundry-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

sanfoundry-scraper

Extracts all MCQs of a subject that you input the link of.

run the main.py file after getting the code using the Download zip button or following this link.

just run main.py using the following command

python main.py

or

python main.py --url {url-of-quiz}

this scrapper also uses multithreading

python main.py --url {url-of-quiz} --thread --workers 15

Here workers are the number of threads. Default workers are 5 .

You can change by passing --workers {thread-choice} .

More workers can cause unstability

Input the URL of the Subject (for example, "https://www.sanfoundry.com/1000-object-oriented-programming-oops-questions-answers/") and run it to get a file in a folder named Saved_MCQs of all the MCQs of the subject.

run the following command if you are using the program for the first time.

pip install -r requirements.txt

and

python main.py --help

to install requirements

LOG: This is getting traction hehe so gonna make it better to use and better at output

  1. PDF Output instead of HTML or maybe optional idk
  2. good docs 💀
  3. plain text output format with just questions and answers in txt. easier to "study" xD.

About

A Small Scraping Script written in Python that helps you collect and merge all questions for a subject on sanfoundry.com into a HTML document with additional data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages