Definition
A sitemap is either a structured XML file submitted to search engines listing every URL on a site, or a diagram showing the hierarchical relationship between pages used during information architecture work.
There are two sitemaps: the human one (a tree diagram showing how pages relate) and the machine one (an XML file telling search engines what to crawl). Both serve discoverability — humans navigate via the IA the diagram describes; bots follow the URLs in the XML.
The XML sitemap is the cheapest SEO win on most sites. It guarantees search engines know about every page, when each was last updated, and how the site prioritises them. Submitting the sitemap to Google Search Console and Bing Webmaster Tools costs minutes and accelerates indexing of new content from days to hours.
Origin
XML sitemaps emerged from Google's 2005 protocol announcement, later codified at sitemaps.org. Visual sitemaps (the diagram form) date to the earliest days of website information architecture work in the mid-1990s.
How it works
- Generate the XML sitemap automatically from the CMS or build process — keep it always current.
- Include only canonical URLs you want indexed; exclude noindex pages, paginated archives, faceted-search URLs.
- Submit to Google Search Console and Bing Webmaster Tools.
- Keep file under 50,000 URLs / 50MB; split into a sitemap index if larger.
- Reference the sitemap in robots.txt (Sitemap: https://example.com/sitemap.xml).
- Audit monthly — broken URLs in the sitemap erode trust signals to crawlers.
When to use it
Use when
- Every public website. The XML form is essentially mandatory for modern SEO.
- Visual sitemaps during information-architecture work on new builds or redesigns.
Skip when
- On purely private apps not intended for search indexing.
Key metrics
- % of sitemap URLs indexed in Google Search Console.
- Crawl frequency and depth.
- Indexing latency for new pages.
Examples
- The sitemap exposed 40 orphan pages with no internal links.
- Submitting an XML sitemap to Google Search Console is the cheapest SEO win there is.
- The visual sitemap collapsed 11 nav items into 5.
In practice at Makreate
Makreate website builds ship with both a human IA sitemap and a maintained XML sitemap so search engines and users find everything. Every commit that adds or removes a page regenerates the XML — no orphans, no stale references. A recent client's traffic doubled in 90 days simply because their previous sitemap hadn't been updated in 18 months and Google was missing half their content.
Website Design & Development →Common mistakes
- Forgetting to update the XML after restructuring. Orphan pages die quietly.
- Including non-canonical or noindex URLs.
- Letting the file grow beyond 50,000 URLs without splitting into an index.
- Not referencing it in robots.txt.
- Treating a visual sitemap as a deliverable that never ships into actual nav.
Frequently asked
XML sitemap or HTML sitemap?
Both. XML for crawlers, HTML for users (and as a fallback navigation aid). HTML sitemaps also pass internal-link signal.
How often should the sitemap update?
Automatically, on every content change. Manual sitemaps go stale within weeks.
Do I need a sitemap for a small site?
Yes. The cost is near-zero and the speed-to-index benefit is the same.