Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sitemap for Large sites with filter date/year #649

Open
advwebin opened this issue Oct 29, 2023 · 1 comment
Open

Sitemap for Large sites with filter date/year #649

advwebin opened this issue Oct 29, 2023 · 1 comment
Labels
[Component] Sitemap [Type] Feature Something new we need to write from the ground up.

Comments

@advwebin
Copy link

Hello, The sitemap feature can be optimized a little more.
It takes over 30 seconds to generate.

For large websites, like news websites which host a lot of old data, the sitemap generation can take a lot of processing power and waste resources.

I have seen this approach from metro.co.uk/sitemap.xml

They have developed https://github.com/Automattic/msm-sitemap in collaboration with automatic.

With this approach, you are hitting two issues.

  1. Your crawl budget is optimized as Crawling takes place with only the newer or updated sitemaps.
  2. Generation of new sitemaps is faster as it ads only the sitemap with a newer date, month, year. As per your setup.
@advwebin advwebin changed the title Sitemap for Large sites by year Sitemap for Large sites with filter date/year Oct 29, 2023
@sybrew
Copy link
Owner

sybrew commented Nov 5, 2023

Thank you for the info and insights.

I see this as the only viable alternative to TSF's "optimized" sitemap, and it's well-suited for news sites.

Because the pagination is logically indexed by publication date, we can quickly find, clear, and ping old sitemaps individually on an archaic post's update.

Of course, this is for a niche sector, but I'm sure many people will appreciate it, even though they won't need it.

@sybrew sybrew added [Type] Feature Something new we need to write from the ground up. [Component] Sitemap labels Nov 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
[Component] Sitemap [Type] Feature Something new we need to write from the ground up.
Projects
None yet
Development

No branches or pull requests

2 participants