What Is a Sitemap?
A practical explanation of website sitemaps, XML sitemaps, HTML sitemaps, and why they matter for search engines.
A sitemap is a file or page that lists important URLs on a website. Search engines use sitemaps as discovery hints, especially when a site has many pages, deep navigation, new content, or pages that are not linked clearly from the homepage.
The most common SEO sitemap is an XML sitemap. It usually lives at a URL like
https://example.com/sitemap.xml and contains structured entries for pages,
posts, products, categories, or other indexable URLs.
XML sitemaps
An XML sitemap is built for machines. It tells crawlers which URLs exist and can
include optional metadata such as lastmod, changefreq, and priority.
Most modern CMS platforms generate XML sitemaps automatically. WordPress, Shopify, Webflow, and many site builders create either one sitemap file or a sitemap index that links to several smaller child sitemaps.
Use the free Sitemap Validator to inspect whether a sitemap URL is reachable and parseable.
HTML sitemaps
An HTML sitemap is built for people. It is a regular page with links to important sections or pages. It can help visitors and crawlers understand a site's structure, but it is not a replacement for XML sitemap submission.
Why sitemaps matter
Sitemaps do not force search engines to index every URL, but they improve discovery. They are especially useful when:
- A site has new pages that need to be discovered quickly.
- A site has pages that are not linked from many other pages.
- A site is large enough that crawling every URL takes time.
- A site uses multiple templates, languages, or content types.
Where to find a sitemap
Common sitemap locations include:
/sitemap.xml/sitemap_index.xml/wp-sitemap.xmlrobots.txtdirectives such asSitemap: https://example.com/sitemap.xml
The fastest way to check is to use the free Find Sitemap tool, which checks robots.txt and common sitemap paths.
Sitemap maintenance
A useful sitemap should stay clean. Avoid including redirected URLs, duplicate URLs, blocked pages, noindex pages, and URLs that return errors. If a sitemap is stale, search engines may waste crawl budget on pages that no longer matter.
For a quick maintenance check, validate the sitemap, count the listed URLs, and extract the URL list into a spreadsheet so stale or missing pages are easier to spot.