RevealTheme logo

robots.txt Inspector

Parse robots.txt files. See rules grouped by user-agent and flagged issues.

What is robots.txt Inspector?

robots.txt is a voluntary protocol for telling web crawlers what to access. Major search engines respect it; malicious bots ignore it. Common mistakes include blocking critical resources, using wildcards incorrectly, and forgetting to include a Sitemap directive. This inspector parses any robots.txt and groups rules by user-agent.

Frequently asked questions

Where does robots.txt live?
Always the root of the domain — example.com/robots.txt. Subdirectories don't work.
What's the difference between Disallow and noindex?
Disallow prevents crawling; noindex (in a meta tag) prevents indexing. They're not interchangeable.

Related tools