Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Crawl entire site from sitemap when available (faster than recursive ) #346

Open
antoniancu opened this issue Oct 6, 2021 · 2 comments
Open
Labels
enhancement New feature or request

Comments

@antoniancu
Copy link

I suspect performance when checking an entire site would be better with the ability to run the link checker on a set of pages provided by the sitemap when available, vs the recursive crawling process.

An anticipated complexity is really large sites with paginated sitemaps: sitemap.xml?page=1, sitemap.xml?page=2 etc..

@antoniancu antoniancu changed the title Crawl entire site from sitemap (faster than recursive) Crawl entire site from sitemap when available (faster than recursive ) Oct 7, 2021
@antoniancu
Copy link
Author

I solved this by passing an array of pages from sitemapper. Still, it would be neat to have this logic built in.

@JustinBeckwith JustinBeckwith added the enhancement New feature or request label Oct 7, 2021
@JustinBeckwith
Copy link
Owner

Interesting! I have to go do some learnin' about sitemaps again.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants