Robots.txt Generator

Generate valid robots.txt files with presets and custom rules

What is Robots.txt?

Robots.txt is a plain text file at the root of a website that instructs web crawlers (Googlebot, Bingbot, etc.) which pages to crawl or avoid. It follows the Robots Exclusion Protocol (REP) and supports User-agent, Allow, Disallow, Crawl-delay, and Sitemap directives.

Last verified: March 2026

Configuration

Preset

User-agent

Delay

Sitemap URL

How to Use This Generator

Select a preset: Choose a common robots.txt preset (allow all, block all, standard) or start from scratch.
Add user agents: Specify which crawlers the rules apply to (* for all, or specific bot names).
Configure rules: Add Allow and Disallow directives for specific paths or directories.
Add sitemap URL: Enter your XML sitemap URL to help crawlers discover all your pages.
Copy or download: Copy the generated robots.txt content and save it as robots.txt in your site root.

Generated robots.txt

User-agent: *
Allow: /
Disallow: /admin/

Sitemap: https://example.com/sitemap.xml

Common Bot Names

Bot	Crawler
*	All crawlers
Googlebot	Google Search
Bingbot	Microsoft Bing
GPTBot	OpenAI (ChatGPT)
Google-Extended	Google AI (Gemini)
Twitterbot	Twitter/X previews

Formula & Methodology

Formula

User-agent: {bot} Disallow: {path} Allow: {path} Sitemap: {url}

User-agent = Which crawler the rules apply to (* = all)

Disallow = Path or directory the crawler should not access

Allow = Path or directory explicitly permitted (overrides Disallow)

Sitemap = Full URL to XML sitemap for crawler discovery

Worked Example

Standard robots.txt for a blog with admin area

User-agent* (all crawlers)

Allow/ (entire site)

Disallow/admin/ and /api/

Sitemaphttps://example.com/sitemap.xml

ResultAllows all crawlers to access the site except /admin/ and /api/ directories. Sitemap is referenced for efficient crawling.

💡

Did you know? The Robots Exclusion Protocol was first proposed by Martijn Koster in 1994 on the www-talk mailing list. Google formally adopted it as an internet standard (RFC 9309) in September 2022, making it the first official IETF specification for robots.txt (source: IETF RFC 9309).

Sources

IETF RFC 9309 — Robots Exclusion Protocol
Google Search Central — robots.txt Specification
Bing Webmaster Guidelines — Robots.txt

Frequently Asked Questions

Bot

Crawler

All crawlers

Googlebot

Google Search

Bingbot

Microsoft Bing

GPTBot

OpenAI (ChatGPT)

Google-Extended

Google AI (Gemini)

Twitterbot

Twitter/X previews

Robots.txt Generator

What is Robots.txt?

Configuration

How to Use This Generator

Generated robots.txt

Common Bot Names

Formula & Methodology

Related Tech & Digital

ASCII Converter

Aspect Ratio Calculator

Bandwidth Calculator

Base64 Encoder/Decoder

Border Radius Generator

Box Shadow Generator

Frequently Asked Questions

Robots.txt Generator

What is Robots.txt?

Configuration

How to Use This Generator

Generated robots.txt

Common Bot Names

Formula & Methodology

Related Tech & Digital

ASCII Converter

Aspect Ratio Calculator

Bandwidth Calculator

Base64 Encoder/Decoder

Border Radius Generator

Box Shadow Generator

Frequently Asked Questions

Robots.txt Generator

What is Robots.txt?

Configuration

How to Use This Generator

Generated robots.txt

Common Bot Names

Formula & Methodology

Related Tech & Digital

ASCII Converter

Aspect Ratio Calculator

Bandwidth Calculator

Base64 Encoder/Decoder

Border Radius Generator

Box Shadow Generator

Frequently Asked Questions

What is robots.txt?

Does robots.txt block pages from appearing in search results?

What is the User-agent directive?

What is a crawl delay?

Where should I place the Sitemap directive?

Robots.txt Generator

What is Robots.txt?

Configuration

How to Use This Generator

Generated robots.txt

Common Bot Names

Formula & Methodology

Related Tech & Digital

ASCII Converter

Aspect Ratio Calculator

Bandwidth Calculator

Base64 Encoder/Decoder

Border Radius Generator

Box Shadow Generator

Frequently Asked Questions

What is robots.txt?

Does robots.txt block pages from appearing in search results?

What is the User-agent directive?

What is a crawl delay?

Where should I place the Sitemap directive?