Sitemap

SEO

Also: XML Sitemap · HTML Sitemap

What it isA list of URLs for search engines
Two typesXML for crawlers, HTML for people
Submit toGoogle Search Console
Watch forStale or broken URLs

Quick definition

A sitemap is a file that lists the URLs on your website. The XML version is built for search engine crawlers, telling them which pages exist and when they were last updated. The HTML version is built for visitors navigating a large site. Most modern sites need both.

What it actually means

Think of a sitemap as a building directory. It doesn't unlock any doors, but it tells the crawler which floors exist and what's on each one. Without it, a crawler has to discover pages by following links, which means anything not well-linked internally might never get found.

The XML sitemap is the one that matters for Search Engine Optimisation (SEO). It's a structured file, usually at yourdomain.com/sitemap.xml, that lists your URLs with optional metadata: when each page was last modified, how often it changes, and how important it is relative to other pages. Submitting it to Google Search Console tells Google where to look and how fresh the content is.

The HTML sitemap is a navigational aid for humans, typically a page with categorised links to every section of the site. It helps on large e-commerce or government sites where the navigation doesn't expose every page. For most small business sites it's optional.

Neither type of sitemap overrides crawl budget or forces indexing. Google decides what to index. The sitemap just helps it make that decision faster and with less guesswork.

A sitemap doesn't guarantee indexing. It's a polite suggestion to a crawler, not a command.

How it shows up

A missing or broken sitemap shows up as slow or incomplete indexing. Pages that exist but aren't linked from anywhere often don't appear in search results until the sitemap is submitted. In Search Console, the Coverage and Sitemaps reports show which URLs were submitted, which were indexed and which were skipped. Skipped doesn't mean penalised, it means Google decided the page wasn't worth indexing, usually because of thin content, duplicate content or a noindex tag.

Where people get this wrong

Assuming submitting a sitemap means Google will index everything in it.Google treats the sitemap as a suggestion, not an instruction. Pages with thin content, blocked robots.txt rules or noindex tags will be skipped regardless.
Including URLs you don't want indexed, like thank-you pages or filtered category URLs.Every URL in your sitemap signals that it deserves attention. Including low-value pages dilutes the signal and can invite crawl budget waste on a large site.
Setting it up once and never updating it.A sitemap full of deleted pages or URLs that return errors confuses crawlers and undermines the trust of the file. Most platforms regenerate it automatically, but it's worth auditing quarterly.

Related terms

Common questions

Do I need a sitemap if my site is small?

For a site with fewer than a handful of well-linked pages, Google will find everything through crawling and a sitemap adds little. Once you have dozens of pages, distinct content sections, or pages that aren't easily navigable, a sitemap starts paying off.

Where do I submit my sitemap?

In Google Search Console under the Sitemaps section. Enter the URL of your sitemap file, usually /sitemap.xml. You can also reference it in your robots.txt file so any crawler finds it automatically without needing manual submission.

What's the difference between an XML sitemap and an HTML sitemap?

The XML sitemap is for crawlers and lives as a structured file at a URL like /sitemap.xml. The HTML sitemap is a webpage for human visitors with links to every section of the site. They serve different purposes and aren't interchangeable.

How do I know if my sitemap is working?

Check the Sitemaps report in Google Search Console. It shows how many URLs you submitted, how many Google has indexed and any errors it found. A large gap between submitted and indexed URLs usually points to a content quality or duplication issue.

Keep exploring

About New Rebellion

New Rebellion is a marketing intelligence consultancy. We build tools, score Australian businesses on how their marketing actually performs, and publish Debrief every day. This dictionary is part of how we work in the open.

How we think →