Free tools
Robots.txt Examples
Robots.txt file content for smh.com.au.
Robot.txt file for: smh.com.au
# Nine Entertainment Co expressly prohibits the use of any Nine # content or data, including associated metadata, for any machine # learning and/or artificial intelligence including for the purposes # of training or development of AI technology, tools and machine # learning language models. # view our terms of use - https://login.nine.com.au/terms?client_id=smh # Sitemaps Sitemap: https://www.smh.com.au/sitemaps/news/brands/smh Sitemap: https://www.smh.com.au/sitemaps/smh-sitemaps-videos.xml Sitemap: https://www.smh.com.au/sitemaps/smh-navigation-pages.xml Sitemap: https://www.smh.com.au/sitemaps/smh-sitemaps-articles.xml Sitemap: https://www.smh.com.au/rss/feed.xml # All visitors User-agent: * Allow: / Disallow: /search?text=* # Specific agents User-agent: anthropic-ai Disallow: / User-agent: Applebot-Extended Disallow: / User-agent: Bytespider Disallow: / User-agent: CCBot Disallow: / User-agent: ChatGPT-User Disallow: / User-agent: ClaudeBot Disallow: / User-agent: Claude-Web Disallow: / User-agent: cohere-ai Disallow: / User-agent: Diffbot Disallow: / User-agent: Meta-ExternalAgent Disallow: / User-agent: Google-Extended Disallow: / User-agent: Google-CloudVertexBot Disallow: / User-agent: GPTBot Disallow: / User-agent: PerplexityBot Disallow: / User-agent: Scrapy Disallow: / User-agent: Timpibot Disallow: / User-agent: omgili Disallow: / User-agent: Omgilibot Disallow: / User-agent: Webzio-Extended Disallow: / User-agent: YouBot Disallow: /