2 Nov, 2008 in SEO by admin

How to Check robots.txt file

The robots.txt analysis tool reads the robots.txt file in the same way Googlebot does. If the tool interprets a line as a , Googlebot doesn’t understand that line. If the tool shows that a URL is allowed, Googlebot interprets that URL as allowed.

This tool provides results only for Google user-agents (such as Googlebot). Other bots may not interpret the robots.txt file in the same way. For instance, Googlebot supports an extended definition of the standard. It understands Allow: directives, as well as some pattern matching. So while the tool shows lines that include these extensions as understood, remember that this applies only to Googlebot and not necessarily to other bots that may crawl your site.

If a robots.txt file exists in the root directory of the host, this tool lists the information that Google has about it, including:

  • A link to the current robots.txt file on your site.
  • When Google last downloaded the file – if you’ve made changes to the file after this date and time, our cached version won’t reflect the changes
  • The status of the file – the HTTP response we received when we tried to downloaded it. (Learn more about status codes.) If we receive a 404 (File Not Found) error, this doesn’t present a problem. You don’t have to have a robots.txt file – but if you don’t, bots will be able to crawl all the pages on your site.
  • The MIME type – if the file is a type other than text, we can’t process it.
  • Whether the robots.txt is blocking access to your home page or to any Sitemaps you’ve submitted.
  • If we had trouble parsing lines in the file.

Bookmark This

No Responses so far | Have Your Say!

Leave a Feedback

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>