Free tools
Robots.txt Examples
Robots.txt file content for spiegel.de.
Robot.txt file for: spiegel.de
User-agent: * Allow: / Disallow: /*CR-Dokumentation.pdf$ Disallow: /gutscheine/suche? Disallow: /gutscheine/*?code=* Disallow: /gutscheine/*&code=* User-agent: Mozilla/5.0 (compatible; OGDWCtxCrawler) Allow: / User-agent: GPTBot Disallow: / User-agent: Google-Extended Disallow: / User-agent: Applebot-Extended Disallow: / User-agent: CCBot Disallow: / User-agent: magpie-crawler Disallow: / User-agent: ia_archiver Disallow: / User-Agent: omgili Disallow: / User-Agent: omgilibot Disallow: / User-agent: Baiduspider Disallow: / User-agent: AhrefsBot Disallow: / User-agent: DataForSeoBot Disallow: / User-agent: Yeti Disallow: / User-agent: SemrushBot Disallow: / User-agent: sentibot Disallow: / User-agent: MJ12bot Disallow: / User-agent: Bytespider Disallow: / User-agent: SirdataBot Disallow: / User-agent: LCC Disallow: / User-agent: TurnitinBot Disallow: / User-agent: BLEXBot Disallow: / User-agent: dotbot Disallow: / User-Agent: ImagesiftBot Disallow: / User-agent: anthropic-ai Disallow: / User-agent: Claude-Web Disallow: / User-agent: ClaudeBot Disallow: / User-agent: Timpibot Disallow: / User-agent: cohere-ai Disallow: / User-agent: Meta-ExternalAgent Disallow: / User-agent: FacebookBot Disallow: / User-agent: Diffbot Disallow: / Sitemap: https://www.spiegel.de/sitemaps/news-de.xml Sitemap: https://www.spiegel.de/sitemaps/videos/sitemap.xml Sitemap: https://www.spiegel.de/plus/sitemap.xml Sitemap: https://www.spiegel.de/sitemap.xml Sitemap: https://gutscheine.spiegel.de/sitemap.xml # Legal notice: spiegel.de expressly reserves the right to use its content for commercial text and data mining (§ 44b Urheberrechtsgesetz). # The use of robots or other automated means to access spiegel.de or collect or mine data without the express permission of spiegel.de is strictly prohibited. # spiegel.de may, in its discretion, permit certain automated access to certain spiegel.de pages, # If you would like to apply for permission to crawl spiegel.de, collect or use data, please email syndication@spiegel.de