Author: Shatil Masud

This project is a search engine website built using Express.js and MongoDB to allow users to search fruits on a sample dummy website and all outgoing links connected to it. This search engine makes use of web crawlers to scrape all data from each page include the outgoing links, and recursively repeats the process using a specified selection policy. Using a combination of the elasticlunr library and by implementing Google's PageRank algorithm, a user can search a word or a series of words and will be outputted a specified number of ranked results of web pages where there search occurs.

Install packages with 'npm install'
To crawl the fruit website, uncomment line 29 in src/app.js and wait until it console logs 'Done'. (There is an issue with the personal website crawling) To run, 'npm start'. If the crawling is done, comment out line 29 if you wish to not keep crawling over and over then search away!
Go to /fruits and search using the UI or use the url by entering /query/numberofPages/boost
To see details for a specific page, enter the id in the url using /fruits/id
To see more details of the specific page, click the button beside the results

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
crawlers		crawlers
img		img
models		models
routes		routes
src		src
views		views
.DS_Store		.DS_Store
.env		.env
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

crawlers

crawlers

img

img

models

models

routes

routes

src

src

views

views

.DS_Store

.DS_Store

.env

.env

.gitignore

.gitignore

README.md

README.md

package-lock.json

package-lock.json

package.json

package.json

Repository files navigation

About

Releases

Packages

Languages

smasud98/Practice-Search-Engine

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Stars

Watchers

Forks

Languages