Robots.txt

paso obligado a ver una web cuando alguien te cae mal "robots.txt" :P

Get the status of your domain,including AlexaRank,Age,indexation in DMOZ & Archive.org,robots.txt&WHOIS records. http://bit.ly/4iDqHf #seo

google's own meta tag to disable google crawling. <meta name="robots" content="noindex"> also robots.txt works well.

Create Robots.txt file: syntax, example, generator tools http://bit.ly/bRg4hE

Is Robots.txt the key your Search Engine Rankings? http://bit.ly/dnkeu2

@BrandonSheley @betatime They handle 1k+ active users but it went insane. They slowed the SE down via robots.txt to bring it back.

#FB allow pas grand chose en fait : http://www.facebook.com/robots.txt on se croirait presque protégé #e-reputation

The effect of Robots.txt on your Search Engine Future? http://bit.ly/dnkeu2

Corolário: disallow no robots.txt + meta noindex no html = seu site ainda pode aparecer no índice do Google! (via @ArielL)

Will Robots.txt Prevent your Search Engine Future? http://bit.ly/dnkeu2

@DianeBourque ...y compris les moteurs de recherche.." WP écrit dans le fichier robots.txt et ton blog est de nouveau accessible (je pense).

@DianeBourque N'oublie pas le S au robots.txt comme un de mes clients qui a eu une mauvaise surprise

@DianeBourque J'utilise un robot.txt pour ça : http://fr.wikipedia.org/wiki/Robots.txt

@DianeBourque Oui dans fichier robots.txt ou page par page avec la commande meta noindex www.robotstxt.org & www.robotstxt.org/meta.html

Evil Demo - Robots.txt User Agent Generator Googlebot: http://bit.ly/8f4E0f via @addthis
The /robots.txt checker can check your site's /robots.txt file and meta tags. The IP Lookup can help find out more about what robots are visiting you.
The Robot Exclusion Standard, also known as the Robots Exclusion Protocol or robots.txt protocol, is a convention to prevent cooperating web spiders and other web robots from ...
# robots.txt for http://www.example.com/ User-agent: * Disallow: /cyberworld/map/ # This is an infinite virtual URL space # Cybermapper knows where to go.
robots.txt generator designed by an SEO for public use. Includes tutorial.
Information on using the robots.txt file to keep web crawlers, spiders and robots from indexing certain sections of a site.
.