User-agent: * Disallow: # The following example "/robots.txt" file specifies that no robots should visit any URL starting with # "/cyberworld/map/" or "/tmp/", or / foo.html: # robots.txt for http://www.example.com/ # User-agent: * # Disallow: /cyberworld/map/ # This is an infinite virtual URL space # Disallow: /tmp/ # these will soon disappear # Disallow: /foo.html # This example indicates that no robots should visit this site further: # go away # User-agent: * # Disallow: / # /robots.txt file for http://webcrawler.com/ # mail webmaster@webcrawler.com for constructive criticism # User-agent: webcrawler # Disallow: # User-agent: lycra # Disallow: / # User-agent: * # Disallow: /tmp # Disallow: /logs # The first two lines, starting with '#', specify a comment # The first paragraph specifies that the robot called 'webcrawler' has nothing disallowed: it may go # anywhere. # The second paragraph indicates that the robot called 'lycra' has all relative URLs starting with '/' # disallowed. Because all relative URL's on a server start with '/', this means the entire site is # closed off. # The third paragraph indicates that all other robots should not visit URLs starting with /tmp or /log. # Note the '*' is a special token, meaning "any other User-agent"; you cannot use wildcard patterns or # regular expressions in either User-agent or Disallow lines. #Two common errors: # Wildcards are _not_ supported: instead of 'Disallow: /tmp/*' just say 'Disallow: /tmp'. # You shouldn't put more than one path on a Disallow line (this may change in a future version of the # spec)