Core conceptsVerified bots

Verified bots

How Botect recognizes legitimate search and AI crawlers and allows them by default — so good bots are never blocked for looking automated.

Not every bot is hostile. Search engines, AI crawlers, uptime monitors, and link previewers are bots you want visiting your site. Botect maintains a verified-bot allowlist so these are recognized and allowed — never caught by enforcement meant for abusive automation.

How verification works

A session is marked verified_bot only on actual verification — never on a user-agent claim alone (anyone can send Googlebot in their UA). When a session matches the allowlist, the combiner short-circuits:

  • verified_bottrue
  • verified_bot_category → the bot's category (e.g. Search, AI Crawler, AI Search)
  • bandverified
  • bot banding is skipped entirely

Allowing verified bots

The allow_verified toggle is on by default. With it on, any verified bot resolves to allow as the very first step of verdict resolution — before rules and other toggles. This is almost always what you want.

{ "allow_verified": true }

Verified-bot handling runs first. A block_definite toggle or a low-score rule won't catch a verified crawler while allow_verified is on.

Acting on verified bots

If you do want to treat some verified bots differently — say, challenge AI crawlers on a paid section — you have two levers:

  • Turn allow_verified off to stop the blanket short-circuit, then let rules and toggles decide.
  • Write a rule that matches on verified_bot / verified_bot_category:
verified_bot AND verified_bot_category == "AI Crawler" AND path == "/premium"
→ block

When allow_verified is on, your enforcement rules should include AND NOT verified_bot so a broad rule never accidentally targets a crawler you rely on for indexing.

Using it in rules

verified_bot (bool) and verified_bot_category (string) are both rule-usable signal fields:

NOT verified_bot AND score < 30        → block        # bots only, never crawlers
verified_bot_category in ["Search"]    → allow         # always let search engines through

The verified-bot list is maintained from public crawler registries and refreshed periodically, so newly-announced legitimate crawlers are recognized without any change on your side.