Skip to content

A simple tool created to make life easier for the people applying for GSOC. It extracts previous year's GSOC data and allows you to search organisations that are best suited for you

Notifications You must be signed in to change notification settings

ronak66/GSOC_Data_Extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GSOC_Data_Extractor

It is a simple tool created to make life easier for people applying for GSOC.
Check out the website here: https://gsoc-data-extractor.herokuapp.com

Features

1. Compares and gives a list of gsoc organization that are common between different years
2. Provides a list of gsoc organization that uses the given technology
3. Provides list of oraganizations that uses the given technology and occur almost every year  
   (first and second feature combined)  

Installation

  1. FuzzyWuzzy Python library (Run):
    * $pip install fuzzywuzzy
    * $pip install python-Levenshtein
    
  2. BeautifulSoup library (Run):
    * $sudo apt-get install python3-bs4
    

How to use

  1. Run cd Scraping\ Code/

  2. Run python gsoc_yearly_data_generator.py

    • Input the gsoc year you want to search for
    • Wait till the count down drops to zero
  3. Run python query.py

    • If you want to get list of organizations that uses a specific technology, your input should in the form:
      gt <gsoc year you want to search>
      Example: gt 2016

    • If you want all the organizations that are common between different years, your input should in the form:
      co <gsoc year1>,<gsoc year2>,...
      Example: co 2016,2017,2018

Screen Shot

Note : Don't manually delete any GSOC_<'year'>_Data.ods file, and even if you do, remove the line containing that year from ExtractedData.txt

About

A simple tool created to make life easier for the people applying for GSOC. It extracts previous year's GSOC data and allows you to search organisations that are best suited for you

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published