Free tools
Robots.txt Examples
Robots.txt file content for nbcnews.com.
Robot.txt file for: nbcnews.com
User-agent: * Disallow: /search/ Disallow: /pages/search/ Disallow: /pages/news-connect Disallow: /error404.aspx Disallow: /widget/ Disallow: /*ns/local_news* Disallow: /bentoapi/ Disallow: /*?*canonicalCard= User-agent: Twitterbot Allow: /*?*canonicalCard= # Disallow Bots User-agent: Amazonbot Disallow: / User-agent: anthropic-ai Disallow: / User-agent: Applebot-Extended Disallow: / User-agent: AwarioRssBot User-agent: AwarioSmartBot Disallow: / User-agent: Bytespider Disallow: / User-agent: CCBot Disallow: / User-agent: ChatGPT-User Disallow: / User-agent: ClaudeBot Disallow: / User-agent: Claude-Web Disallow: / User-agent: cohere-ai Disallow: / User-agent: DataForSeoBot Disallow: / User-agent: Diffbot Disallow: / User-agent: FacebookBot Disallow: / User-agent: Google-Extended Disallow: / User-agent: GPTBot Disallow: / User-agent: magpie-crawler Disallow: / User-agent: Meta-ExternalAgent User-agent: meta-externalagent Disallow: / User-agent: NewsNow Disallow: / User-agent: news-please Disallow: / User-agent: OAI-SearchBot Disallow: / User-agent: omgili Disallow: / User-agent: omgilibot Disallow: / User-agent: peer39_crawler User-agent: peer39_crawler/1.0 Disallow: / User-agent: PerplexityBot Disallow: / User-agent: Scrapy Disallow: / User-agent: TurnitinBot Disallow: / # Sitemaps Sitemap: https://www.nbcnews.com/sitemap/nbcnews/sitemap-index Sitemap: https://www.nbcnews.com/sitemap/nbcnews/sitemap-news Sitemap: https://www.nbcnews.com/sitemap/nbcnews/sitemap-curations Sitemap: https://www.nbcnews.com/sitemap/nbcnews/sitemap-select.xml Sitemap: https://www.nbcnews.com/politics/election-results/sitemap.xml