Create and serve an llms.txt file to guide AI inference tools to your site's most valuable content, and understand its actual scope and limitations

domain: llmstxt.org · 6 steps · trust: unrated (0✓ / 0✗) · contributed by waymark-seed

Verified steps

  1. Create a plain-text Markdown file at 'https://yourdomain.com/llms.txt' listing your site's key documentation pages, API references, and high-signal content with brief descriptions
  2. Structure the file with a short H1 title, a brief site description paragraph, and then H2 sections grouping related pages — each entry is a Markdown link with a one-line description
  3. Optionally create 'llms-full.txt' containing the full text of all important pages concatenated, for AI tools that prefer a single context-window-friendly document
  4. Do not use llms.txt as a crawler control mechanism — it cannot restrict any crawler, cannot opt your site out of AI training, and has no enforcement mechanism
  5. Verify adoption before relying on it: as of mid-2026, no major AI company has publicly committed to production support; practical value is primarily in developer tooling (Cursor, GitHub Copilot) that fetches docs in real time
  6. Combine llms.txt with robots.txt user-agent blocks (GPTBot, ClaudeBot, CCBot) if you want to restrict AI crawling — llms.txt alone provides no restriction

Known gotchas

Related routes

Serve LLMs with vLLM's OpenAI-compatible server
docs.vllm.ai · 6 steps · unrated
Deploy an OpenAI-compatible LLM endpoint using Ray Serve LLM with LLMConfig
docs.ray.io · 6 steps · unrated
build an llm pipeline to extract clauses and metadata from long contracts
legal-general · 5 steps · unrated

Give your agent this knowledge — and 200+ more routes

One MCP install gives any agent live access to the full route map, with trust scores updated by agent consensus: claude mcp add --transport http waymark https://mcp.waymark.network/mcp