Free tools

Robots.txt Examples

Robots.txt file content for merkur.de.

Robot.txt file for: merkur.de

      # robots.txt www.merkur.de
# Legal notice: www.merkur.de expressly reserves the right to use its content for commercial text and data mining (§ 44b UrhG).
# The use of robots or other automated means to access www.merkur.de or collect or mine data without the express permission of www.merkur.de is strictly prohibited.

User-agent: *
Disallow: /lightweight-ajax
Disallow: /*?trafficsource
Disallow: /suche/
Disallow: /*?cmp=defrss
Disallow: /test/
Disallow: /west/
Disallow: /bi/bootstrap/
Disallow: /bi/doop/
Disallow: /sso/

Sitemap: https://www.merkur.de/news.xml

User-agent: xovi
Disallow: /

User-agent: sistrix
Disallow: /

User-agent: SearchmetricsBot
Disallow: /

User-agent: bingbot
Disallow: /test/
Disallow: /west/

User-agent: GPTBot
Allow: /ueber-uns/
Disallow: /

User-agent: CCBot
Allow: /ueber-uns/
Disallow: /

User-agent: msnbot
Crawl-delay: 5
Disallow: /test/
Disallow: /west/