Crawl control check

Free Robots.txt Tester

Fetch any site's robots.txt or paste your own, then test any URL path against Googlebot, Bingbot, GPTBot or a custom crawler. You get an Allowed or Blocked verdict plus the exact rule and line number that decided it.

100% Free

No Registration

Instant Results

How robots.txt rules actually get applied

Google retired its standalone robots.txt tester in December 2023 and folded a reduced version into Search Console, where it only covers properties you have verified. That leaves a gap when you want to test a rule change before deploying it, debug a competitor's setup, or check a client site you have no Search Console access to. This tool fills that gap with the same matching logic crawlers use, defined in RFC 9309.

The precedence rules surprise most people. First, a crawler picks exactly one group: the one whose user-agent token is the longest match for its own name, falling back to User-agent: * only when nothing else matches. Rules in other groups are ignored entirely. Second, within that group, the rule with the longest path pattern wins, regardless of the order rules appear in the file. When an Allow and a Disallow of equal length both match, Allow wins. That is why Allow: /blog/ beats Disallow: / for blog URLs, and why rule order almost never matters.

The classic footguns are worth testing for explicitly. A Disallow: / left over from a staging environment silently removes the whole site from crawling. Paths are case sensitive, so Disallow: /Admin/ does nothing for /admin/. Wildcards are greedy, so Disallow: /*? blocks every URL with a query string, including tracked campaign links. And robots.txt controls crawling only; a blocked page can still appear in search results if other sites link to it.

One newer reason to audit this file: AI assistants now fetch and cite pages through their own crawlers, and overly broad rules written years ago often block them. Run the AI crawler access checker to see how GPTBot, ClaudeBot, and PerplexityBot read your file. While you are auditing crawlability, verify your structured data with the schema validator and your titles with the title tag checker.

More free AI search tools

  • Schema Markup Validator Validate every JSON-LD block on a page and catch the mistakes that block rich results.
  • Title Tag Checker Preview your title and description in Google and as an AI citation, with length checks.
  • AI Crawler Checker See which AI crawlers your robots.txt blocks and what that costs you in AI search.
  • llms.txt Generator Generate a spec-compliant llms.txt file that gives AI assistants a clean map of your site.
  • Meta Description Generator Get three SEO-ready meta descriptions under 160 characters, with live length checks.
  • AI Visibility Checker Run one prompt across ChatGPT, Claude, and Gemini and see whether your brand is mentioned or cited.
  • All free tools Local rank audit, Google Business Profile audit, Maps rank checker, schema validator, and more.

Frequently asked.

Did Google remove its robots.txt tester?
Yes. Google retired the standalone robots.txt tester in December 2023. Its replacement is the robots.txt report inside Search Console, which shows the file Google last fetched and any parse errors, but only for properties you have verified, and it no longer offers the interactive test-a-URL feature the old tool had. This tester restores that workflow for any site and any user-agent.
Which rule wins when both an Allow and a Disallow match?
The rule with the longest path pattern wins, counted by characters in the pattern. If an Allow and a Disallow of exactly equal length both match, Allow wins the tie. Order in the file does not matter. Example: with Disallow: /shop/ and Allow: /shop/sale/, the URL /shop/sale/item is allowed because the Allow pattern is longer.
Does Disallow remove a page from Google's index?
No. Disallow stops crawling, and that is all it does. A page Google cannot crawl can still be indexed and shown in results if other pages link to it, usually with a bare title and no snippet. To keep a page out of the index, allow crawling and add a noindex robots meta tag or X-Robots-Tag header. Disallowing a noindexed page is a common mistake because it prevents Google from ever seeing the noindex.
Are robots.txt paths case sensitive?
Yes. Path matching is byte for byte, so Disallow: /Private/ does not block /private/. User-agent tokens are the opposite: they match case-insensitively, so user-agent: googlebot and User-agent: Googlebot are equivalent. If your site serves the same content at mixed-case URLs, you need a rule for each variant or a wildcard pattern that covers both.
Where does robots.txt have to live?
At the root of the exact host, so https://www.example.com/robots.txt. It is scoped per protocol, host, and port: the file on www.example.com does not cover shop.example.com or the http version, and a robots.txt placed in a subdirectory is ignored completely. If you serve traffic on multiple subdomains, each one needs its own file.

Ready to Improve

Your Rankings?

Use our free tools to get instant insights into your SEO performance and discover opportunities to rank higher