You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
robots.txt is purely voluntary compliance, so there's no realistic guarantee that most or all bots will adhere to it, leaving us with fully blocking their requests on our end as the main option. In a perfect world this responsibility should fall on the organizations running said bots, but I digress.
Requirements
Blocks requests from most known and possibly unknown but misbehaving bots.
Does not block requests from bots that behave themselves, e.g. the Internet Archive's crawler, community tools to capture page snapshots for gameplay purposes, etc.
Solutions
A combination of methods may be needed, such as Cloudflare rules, robots.txt rules (for those that obey them), and Drupal/PHP blocking via contrib and/or custom Drupal modules.
robots.txt
is purely voluntary compliance, so there's no realistic guarantee that most or all bots will adhere to it, leaving us with fully blocking their requests on our end as the main option. In a perfect world this responsibility should fall on the organizations running said bots, but I digress.Requirements
Solutions
A combination of methods may be needed, such as Cloudflare rules,
robots.txt
rules (for those that obey them), and Drupal/PHP blocking via contrib and/or custom Drupal modules.Links
The text was updated successfully, but these errors were encountered: