Robots.txt for newsbreak.com
Sitemap: https://www.newsbreak.com/sitemap.xml Sitemap: https://www.newsbreak.com/sitemap-publisher.xml Sitemap: https://www.newsbreak.com/sitemap-local-index.xml Sitemap: https://www.newsbreak.com/sitemap-news-index.xml User-agent: * Disallow: /_api/ Disallow: /api/ Disallow: /channels/ Disallow: /af-landing Disallow: /share/ Disallow: /trending/ Allow: /trending/*/ Disallow: /privacy Disallow: /terms Disallow: /newsletter Disallow: /t-* Disallow: /redirect-external Disallow: /me/ Disallow: /_next/data/*.json Disallow: /following User-agent: anthropic-ai Disallow: / User-agent: AmazonBot Disallow: / User-agent: Claude-Web Disallow: / User-agent: cohere-ai Disallow: / User-agent: ClaudeBot Disallow: / User-agent: ChatGPT-User Disallow: / User-agent: GPTBot Disallow: / User-agent: CCBot Disallow: / User-agent: Google-Extended Disallow: / User-agent: DataForSeoBot Disallow: / User-agent: Diffbot Disallow: / User-agent: FacebookBot Disallow: / User-agent: magpie-crawler Disallow: / User-agent: NewsNow Disallow: / User-agent: news-please Disallow: / User-agent: omgili Disallow: / User-agent: omgilibot Disallow: / User-agent: peer39_crawler Disallow: / User-agent: peer39_crawler/1.0 Disallow: / User-agent: PerplexityBot Disallow: / User-agent: Scrapy Disallow: / User-agent: TurnitinBot Disallow: / User-agent: Applebot-Extended Disallow: / User-agent: Meta-ExternalAgent Disallow: / User-agent: meta-externalagent Disallow: / User-agent: OAI-SearchBot Disallow: / User-agent: Bytespider Disallow: / User-agent: Bard Disallow: / User-agent: Claude Disallow: / User-agent: Anthropic Disallow: /