Free tools

Robots.txt Examples

Robots.txt file content for news.com.au.

Robot.txt file for: news.com.au

      User-agent: *

Disallow: /*/comments-*
Disallow: /*/print/
Disallow: /*SIT*
Disallow: /*.swf
Disallow: /printpage/
Disallow: /double-rainbow
Disallow: /double-rainbow/
Disallow: /*-simplify/*
Disallow: /it-test-only/
Disallow: /enewsletters/
Disallow: /search?q=*
Disallow: */rss$
Disallow: */rss2/$
Disallow: */feed$
Disallow: /*/coupons/visit/*
Disallow: /app-route/
Disallow: /app/
Disallow: /finance/business/stockhead/news/*/news-story/*

Sitemap: https://www.news.com.au/sitemap.xml
Sitemap: https://www.news.com.au/news-sitemap.xml
Sitemap: https://www.news.com.au/coupons/sitemap.xml
Sitemap: https://www.news.com.au/video-sitemap.xml

#Agent Specific Disallowed Sections

User-agent: NewsNow
Disallow: /

User-agent: CCBot
Disallow: / 

User-agent: GPTBot
Allow: /

User-agent: ChatGPT-User 
Allow: /

User-agent: anthropic-ai 
Disallow: /

User-agent: cohere-ai
Disallow: / 

User-agent: ia_archiver 
Disallow: / 

User-agent: MJ12bot 
Disallow: / 

User-agent: PiplBot
Disallow: / 

User-agent: Google-Extended
Disallow: /

User-agent: Applebot-Extended
Disallow: /

User-Agent: PerplexityBot 

Disallow: /

User-agent: FacebookBot
Disallow: /

User-agent: Meta-ExternalAgent
Disallow: /

User-agent: Meta-ExternalFetcher
Disallow: /