What is a robots.txt file?

A robots.txt file sits at the root of your website and tells search engine crawlers which pages or directories they should not access. It is the first file most crawlers request when visiting a site.

Which CMS platforms are auto-detected?

ToolMint's robots.txt generator detects WordPress, Shopify, and OpenCart. For each platform it applies the appropriate default disallow rules — for example, blocking /wp-admin/ for WordPress.

Can search engines ignore robots.txt?

Reputable crawlers like Googlebot follow robots.txt by convention, but it is not enforced by any technical mechanism. For truly sensitive content, use server-level access controls combined with robots.txt.

How do I upload the robots.txt file?

Upload the file to your website's root directory so it is accessible at https://yourdomain.com/robots.txt. On most hosts this is the public_html or www folder.

Will this tool overwrite my existing robots.txt?

No. The generator reads your existing file and uses it as reference, but you download the newly generated content separately. Your live file is unaffected until you manually replace it.

Robots.txt Generator – Platform Detection & Sensitive Path Blocker

Generate a ready-to-use robots.txt file for any website. Enter your URL and the tool automatically detects your CMS (WordPress, Shopify, OpenCart), scans for sensitive paths to block, and reads your existing sitemap reference — producing a well-formed robots.txt you can download and deploy instantly.

Auto Generate robots.txt

Enter a website URL and let the tool automatically detect a recommended robots.txt for that website.

Website URL

Platform

Pending

Sitemap

Missing

Detected Paths

Existing robots.txt

Generated robots.txt

Enter a website URL and click Start to generate robots.txt automatically.

🤖

Automatic detection

Finds existing robots.txt, sitemap references, and likely platform rules automatically.

⚡

One-click workflow

Just enter the site URL and generate a recommended robots.txt without manual setup.

🗂️

Ready to use

Copy or download the generated robots.txt file instantly.

What This Generator Builds for You

Platform Auto-Detector

Enter your site URL and the tool detects your CMS — WordPress, Shopify, OpenCart, or generic — and applies the right disallow rules automatically.

Sensitive Path Scanner

Fetches your existing robots.txt and scans for admin panels, login pages, and other sensitive paths to block from crawlers.

Sitemap Reference Finder

Reads your existing robots.txt and sitemap references and carries them into the generated file automatically.

Robots.txt Download

Copy the generated robots.txt to clipboard or download it as a ready-to-upload file in one click.

How to Generate a robots.txt File

Enter your website URL

Type your site's base URL and click Start — the tool fetches your current robots.txt and detects your platform.

Review detected settings

The tool shows your detected platform, sitemap status, and sensitive paths it found and will block.

Check the generated file

Review the auto-built robots.txt with all user-agent rules, disallow paths, and default sitemap reference.

Download robots.txt

Copy to clipboard or download the file, then upload it to your website root (e.g. https://example.com/robots.txt).

What to Block in robots.txt (and Common Mistakes)

Robots.txt is used to prevent crawlers from wasting time on pages that should not appear in search results. Pages worth blocking include: admin panels and login pages (/wp-admin/, /admin/, /login/), internal search result pages, URL parameter variants that create near-duplicate content (?sort=, ?ref=, ?session=), staging or preview environments, and private API endpoints. Do not block: your sitemap URL, public content pages you want indexed, CSS and JavaScript files (Google needs these to render and understand your pages — blocking them was a common old-school mistake that hurts rankings), and image files unless you specifically want to exclude image search. The most damaging robots.txt mistake is accidentally disallowing the entire site with "Disallow: /" under Googlebot — this is a single line that prevents Google from indexing anything. Always verify your live robots.txt at yourdomain.com/robots.txt after deploying.

Robots.txt for WordPress, Shopify, and Static Sites

Different platforms have different directories that need protection. WordPress should block /wp-admin/ (allow /wp-admin/admin-ajax.php for AJAX functionality), /wp-includes/, and search URLs like /?s=. Shopify auto-generates a robots.txt and does not allow full customization — you can only add custom rules via the Shopify robots.txt.liquid template. OpenCart should block /admin/, /catalog/controller/, /install/, and /system/. Static sites (plain HTML, Next.js static export, Hugo, Gatsby) typically only need a minimal robots.txt allowing all crawlers and pointing to the sitemap. The generated file from this tool handles all these cases automatically based on the detected platform and adds the Sitemap: directive pointing to your sitemap.xml so crawlers know where to find it.

Frequently Asked Questions

What is a robots.txt file?: A robots.txt file sits at the root of your website and tells search engine crawlers which pages or directories they should not access. It is the first file most crawlers request when visiting a site.
Which CMS platforms are auto-detected?: ToolMint's robots.txt generator detects WordPress, Shopify, and OpenCart. For each platform it applies the appropriate default disallow rules — for example, blocking /wp-admin/ for WordPress.
Can search engines ignore robots.txt?: Reputable crawlers like Googlebot follow robots.txt by convention, but it is not enforced by any technical mechanism. For truly sensitive content, use server-level access controls combined with robots.txt.
How do I upload the robots.txt file?: Upload the file to your website's root directory so it is accessible at https://yourdomain.com/robots.txt. On most hosts this is the public_html or www folder.
Will this tool overwrite my existing robots.txt?: No. The generator reads your existing file and uses it as reference, but you download the newly generated content separately. Your live file is unaffected until you manually replace it.

SEO Tools