What is a Sitemap?
TL;DR
An XML sitemap is a file that lists all the important pages on your website, helping search engines and AI crawlers discover and prioritize your content.
Last updated: 2026-03-09
Definition#
An XML sitemap is a file (usually at
https://yoursite.com/sitemap.xml) that lists the URLs of all the important pages on your website. It includes metadata like the last modification date, change frequency, and priority for each page.
Sitemaps help crawlers discover pages that might be hard to find through normal link-following. They are especially useful for large sites, new sites with few inbound links, and sites with pages that are not well-connected through internal links.
Most CMS platforms (WordPress, Shopify, Wix) generate sitemaps automatically. For custom sites, you can generate them with tools or libraries. Reference your sitemap in your robots.txt file with the Sitemap: directive.Why It Matters for AI Readiness#
A well-maintained sitemap helps AI crawlers find all your important content quickly. It is a signal of site quality and organization. The AgentReady™ scanner checks for sitemap presence and accuracy as part of the Crawl Health factor.
Make sure your sitemap only includes pages that return 200 status codes and are not blocked by robots.txt or noindex tags.
Related Concepts#
Sitemaps work with robots.txt (which references the sitemap), canonical tags (use canonical URLs in your sitemap), and crawl budget (a clean sitemap improves budget efficiency).
Related Pages
Was this page helpful?