robots.txt tells crawlers what to skip. sitemap.xml lists URLs for Google. llms.txt is the new one: a structured, AI-friendly summary of your site that LLMs can actually use.
You probably do not have one yet. Most sites do not. That is an opportunity, not a problem.
What llms.txt is
llms.txt is a file that sits at the root of your domain (yoursite.com/llms.txt) and tells AI models what your site is about, which pages matter, and how your content is organized.
It is written in Markdown, which is deliberate. LLMs parse Markdown natively, better than they parse raw HTML. A well-structured llms.txt gives an AI model a fast, clean snapshot of your site without having to crawl and make sense of your entire navigation.
The proposed standard (published by Jeremy Howard in late 2024) specifies a simple structure:
- Site name and URL at the top
- A short summary of what the site does
- Section headings (docs, blog, products, etc.) with lists of key URLs under each
- Optional context like recent blog posts or featured content
Think of it as "here is my site, written in the format an LLM is most comfortable reading."
How the generator works
You paste your homepage URL. The tool does the rest in about 30 seconds.
- Crawl: fetches your site, discovers pages, and maps the structure.
- Section detection: identifies key content areas (blog, docs, products, pricing, about, etc.) based on URL patterns and page content.
- Link aggregation: groups discovered pages under each section.
- Output generation: builds a properly formatted llms.txt file with site metadata, sections, and links.
You get a ready-to-upload file plus a set of statistics: number of sections, total links, file size, and pages discovered.
What the output contains
A typical generated llms.txt includes:
- Site metadata: name, URL, short description
- Section headings for major content areas (Blog, Docs, Products, Pricing, About, Contact, etc.)
- Page link lists under each section, organized hierarchically
- Link counts per section so you can see distribution at a glance
You see the full file preview in the tool before you download. You can copy it to clipboard or download as text.
Why this matters for AI visibility
AI adoption of llms.txt is early. Major LLM providers have not publicly confirmed using it yet. But the trend is clear: AI models are moving toward structured, machine-readable site context. Sites that ship llms.txt early will have clean ingestion when adoption catches up.
There are three concrete reasons to bother:
It is free to ship. A file at your domain root costs nothing. The upside of being ahead is asymmetric: if it works, you benefit. If it does not, you lost 5 minutes.
It forces clarity about your site. Generating llms.txt makes you think about what your site actually is and which pages matter. Most site owners never do this explicitly, and their content strategy suffers for it.
It stacks with schema and good HTML. llms.txt is not a silver bullet. It works alongside proper schema markup, clean HTML, and quality content. Shipping all of them together is the play.
Where llms.txt fits in the GEO stack
Think of AI visibility optimization as a layered stack:
- Content: is the answer actually on your site?
- HTML structure: can an AI model parse it cleanly?
- Schema markup: do entities and relationships have explicit structure?
- llms.txt: does the site as a whole have machine-readable context?
- Citations and mentions: are trusted sources pointing at you?
You can run this stack in order. Content first, structure second, schema third, llms.txt fourth, off-site mentions fifth. Each layer multiplies the effect of the others. llms.txt is the cheapest layer to ship, which is why most sites should start there even if they have not solved the others yet.
The limitations
The tool crawls publicly accessible pages. If large sections of your site require authentication or heavy JavaScript rendering, they may not appear in the generated file. Re-run the tool after making any content accessible to crawlers.
llms.txt is an emerging standard. Adoption by major LLMs is not guaranteed, and the spec may evolve. What is generated today follows the current specification. If the spec changes significantly, re-run to get the updated format.
Try it yourself
Paste your homepage URL. Get your llms.txt file in 30 seconds. Upload it to your site root. Ship.
Related articles
- Free Schema Generator - pair llms.txt with proper JSON-LD on your key pages.
- Free Page Audit - see how AI-ready your individual pages are.
- Site Crawl - the full in-app crawl and clustering tool for Mentionable users.