Free tools

Robots.txt Examples

Robots.txt file content for newsbreak.com.

Robot.txt file for: newsbreak.com

      Sitemap: https://www.newsbreak.com/sitemap.xml
Sitemap: https://www.newsbreak.com/sitemap-publisher.xml
Sitemap: https://www.newsbreak.com/sitemap-local-index.xml
Sitemap: https://www.newsbreak.com/sitemap-news-index.xml

User-agent: *
Disallow: /_api/
Disallow: /api/
Disallow: /channels/
Disallow: /af-landing
Disallow: /share/
Disallow: /trending/
Allow: /trending/*/
Disallow: /privacy
Disallow: /terms
Disallow: /newsletter
Disallow: /t-*
Disallow: /redirect-external
Disallow: /me/
Disallow: /_next/data/*.json
Disallow: /following

User-agent: anthropic-ai
Disallow: /

User-agent: AmazonBot
Disallow: /

User-agent: Claude-Web
Disallow: /

User-agent: cohere-ai
Disallow: /

User-agent: ClaudeBot
Disallow: /

User-agent: ChatGPT-User
Disallow: /

User-agent: GPTBot
Disallow: /

User-agent: CCBot
Disallow: /

User-agent: Google-Extended
Disallow: /

User-agent: DataForSeoBot
Disallow: /

User-agent: Diffbot
Disallow: /

User-agent: FacebookBot
Disallow: /

User-agent: magpie-crawler
Disallow: /

User-agent: NewsNow
Disallow: /

User-agent: news-please
Disallow: /

User-agent: omgili
Disallow: /

User-agent: omgilibot
Disallow: /

User-agent: peer39_crawler
Disallow: /

User-agent: peer39_crawler/1.0
Disallow: /

User-agent: PerplexityBot
Disallow: /

User-agent: Scrapy
Disallow: /

User-agent: TurnitinBot
Disallow: /

User-agent: Applebot-Extended
Disallow: /

User-agent: Meta-ExternalAgent
Disallow: /

User-agent: meta-externalagent
Disallow: /

User-agent: OAI-SearchBot
Disallow: /

User-agent: Bytespider
Disallow: /

User-agent: Bard
Disallow: /

User-agent: Claude
Disallow: /

User-agent: Anthropic
Disallow: /