Toggle menu

Sitemaps

Your site's sitemap.xml lists all of the published, public pages on your site. It is available at [site.url]/sitemap.xml. For example, the sitemap of this site can be found at https://docs.gossinteractive.com/sitemap.xml

Our sitemaps conform to the standard sitemap protocol (opens new window) expected by search engines like Google (see Google's support page at https://support.google.com/webmasters/answer/183668).

Which Articles?

The sitemap includes all of the live non-secured articles on your site. Articles beneath "hidden" articles are included, as are articles that may be excluded from your site search or otherwise hidden from the main content structure and navigation.

Just to stress again, the sitemap will include links to every single article, unless an article is in a secured area or has display properties set which would return a "404 page not found" result.

Excluding Articles

Having said that, articles can be excluded by relating metadata to them. The standard metadata group (found in the goss/sitemap group of the metadata library) includes an "exclude" property with two values.

Sitemap Exclude Metadata
 

  • article excludes the article it is related to
  • article_and_descendants excludes the article it is related to and all articles beneath it in the content tree

XML Tags

The standard sitemap includes the compulsory <loc> and optional <lastmod> tag for each page. For example:

<url>
    <loc>https://docs.gossinteractive.com/article/5773/Resources</loc>
    <lastmod>2019-01-29</lastmod>
</url>

The optional <changefreq> and <priority> tags can be added by relating values from the relevant metadata property to your article.

For example, adding these two values to an article:

Sitemap Priority and Frequency
 

would change the XML above to:

<url>
    <loc>https://docs.gossinteractive.com/article/5773/Resources</loc>
    <lastmod>2019-01-29</lastmod>
    <priority>0.5</priority>
    <changefreq>weekly</changefreq>
</url>

Whether search engines and crawlers respect these values is outside of our control.

How It Works

The sitemap.xml is generated using a SOLR query. The root article of the current subsite is taken as the starting point.

The three metadata properties in the sitemap metadata group have search fields set as:

  • SITEMAPXMLPRIORITY
  • SITEMAPXMLCHANGEFREQ
  • SITEMAPXMLEXCLUDE

When metadata values from these properties are related to an article, the search index entry of that article holds those values in its dynamic "OBJECT_SF_*" fields. It is these search fields that identify the metadata values as "special" and part of the sitemap configuration.

The <lastmod> tag is the "Modified" value visible in the "Properties" tab of the article editor.

Example Metadata

The standard sitemap metadata should already be present in your iCM metadata library (found in the goss/sitemap group). If it isn't you can download and import Example Sitemap Metadata(zip)[1KB].

Last modified on 05 April 2022

Share this page

Facebook icon Twitter icon email icon

Print

print icon