# Serve relevant ads on any page: User-agent: Mediapartners-Google User-agent: Adsbot-Google Disallow: # Big, public search engines can access most of the site: User-agent: ArchitextSpider User-agent: Baiduspider User-agent: DuckDuckBot User-agent: Googlebot User-agent: Googlebot-Image User-agent: Googlebot-Mobile User-agent: ia_archiver User-agent: MJ12bot User-agent: MSNBot User-agent: MSNBot-Media User-agent: Robozilla User-agent: Scooter User-agent: Slurp User-agent: Teoma User-agent: Yahoo-MMCrawler User-agent: Yahoo-Blogs User-agent: Yandex Disallow: /include/ Disallow: /icons/ Disallow: /this-is-a-bad-url/ # Everyone else is not welcome to crawl: User-agent: * Disallow: / # And here's where to find everything: Sitemap: http://linux.die.net/sitemap.xml.gz