All posts
Sitemap Basics2026/05/06

What Is a Sitemap?

A practical explanation of website sitemaps, XML sitemaps, HTML sitemaps, and why they matter for search engines.

A sitemap is a file or page that lists important URLs on a website. Search engines use sitemaps as discovery hints, especially when a site has many pages, deep navigation, new content, or pages that are not linked clearly from the homepage.

The most common SEO sitemap is an XML sitemap. It usually lives at a URL like https://example.com/sitemap.xml and contains structured entries for pages, posts, products, categories, or other indexable URLs.

XML sitemaps

An XML sitemap is built for machines. It tells crawlers which URLs exist and can include optional metadata such as lastmod, changefreq, and priority.

Most modern CMS platforms generate XML sitemaps automatically. WordPress, Shopify, Webflow, and many site builders create either one sitemap file or a sitemap index that links to several smaller child sitemaps.

Use the free Sitemap Validator to inspect whether a sitemap URL is reachable and parseable.

HTML sitemaps

An HTML sitemap is built for people. It is a regular page with links to important sections or pages. It can help visitors and crawlers understand a site's structure, but it is not a replacement for XML sitemap submission.

Why sitemaps matter

Sitemaps do not force search engines to index every URL, but they improve discovery. They are especially useful when:

  • A site has new pages that need to be discovered quickly.
  • A site has pages that are not linked from many other pages.
  • A site is large enough that crawling every URL takes time.
  • A site uses multiple templates, languages, or content types.

Where to find a sitemap

Common sitemap locations include:

  • /sitemap.xml
  • /sitemap_index.xml
  • /wp-sitemap.xml
  • robots.txt directives such as Sitemap: https://example.com/sitemap.xml

The fastest way to check is to use the free Find Sitemap tool, which checks robots.txt and common sitemap paths.

Sitemap maintenance

A useful sitemap should stay clean. Avoid including redirected URLs, duplicate URLs, blocked pages, noindex pages, and URLs that return errors. If a sitemap is stale, search engines may waste crawl budget on pages that no longer matter.

For a quick maintenance check, validate the sitemap, count the listed URLs, and extract the URL list into a spreadsheet so stale or missing pages are easier to spot.