Standard Index

The Sitevision Standard Index is the default index of type sv:nodeIndex. It contains data about pages and binaries:

  • sv:sitePage
  • sv:page
  • sv:article
  • sv:structurePage
  • sv:collaborationGroupPage
  • sv:file
  • sv:image

Default query fields ("qf") when using the ExtendedDisMaxParser is:

  • content.analyzed
  • name.analyzed
  • title.analyzed

Default query field ("df") when using the StandardParser is:

  • content.analyzed

Querying the Standard Index

The Standard Index is used in Searcher when no index is explicitly specified .

var searchFactory = require('SearchFactory');

var searcher = searchFactory.getSearcherBuilder().build();

var searchResult = searcher.search('the query', 10);

/* handle search result... */

Though, the Standard Index can also be explicitly specified (i.e. via SearcherBuilder.setIndex).

var searchFactory = require('SearchFactory');
var standardIndexType = require('IndexUtil.IndexType.NODE');

// Get the Standard index
var indexUtil = searchFactory.getIndexUtil();
var standardIndex = indexUtil.getDefaultIndex(standardIndexType);

var searcher = searchFactory.getSearcherBuilder()
                 .setIndex(standardIndex) // Set index
                 .build();

var searchResult = searcher.search('the query', 10);

/* handle search result... */

Fields

acl_allow (solr.StrField, stored, multivalued)

Permission tokens.

anonaccess (solr.BoolField, stored)

Whether or not resource is publicly accessible.

comments (solr.TrieLongField, stored)

Number of page comments.

content.analyzed (solr.TextField, analyzed, multivalued)

Text content in analyzed form. This field uses Stopwords. This is a default query field.

  • Used as "qf" by ExtendedDisMaxParser
  • Used as "df" by StandardParser

content.stemmed.analyzed (solr.TextField, stemmed, multivalued)

Stemmed text content (see content.analyzed for non-stemmed). This field uses Stopwords.

created (solr.TrieDatefield, stored)

Creation date.

createdby (solr.StrField, stored)

The Node identifier of the creator.

heading1.analyzed (solr.TextField, analyzed, multivalued)

First level headings (i.e. <h1> data of page content). This field uses Stopwords

heading2.analyzed (solr.TextField, analyzed, multivalued)

Second level headings (i.e. <h2> data of page content). This field uses Stopwords

heading3.analyzed (solr.TextField, analyzed, multivalued)

Third level headings (i.e. <h3> data of page content). This field uses Stopwords

heading4.analyzed (solr.TextField, analyzed, multivalued)

Fourth level headings (i.e. <h4> data of page content). This field uses Stopwords

headings.analyzed (solr.TextField, analyzed, multivalued)

All headings. The combined values of heading1.analyzed...heading4.analyzed (i.e. <h1>...<h4> data of page content). This field uses Stopwords

id (solr.StrField, stored, unique)

The unique document id (typically the Node identifier of the represented data).

jcrtype (solr.StrField, stored)

The JCR NodeType.

language (solr.StrField, stored)

The language.

languagetag (solr.StrField, stored)

The languagetag.

lastcommented (solr.TrieDateField, stored)

Date of last page comment.

lastindexed (solr.TrieDateField, stored)

Indexing timestamp (i.e. when was this document indexed?)

lastpublished (solr.TrieDateField, stored)

Date of last publish (date of last update for non-publishable nodes, e.g. files/images).

lastpublishedby (solr.StrField, stored)

The Node identifier of the last publishing user (or last updating user for non-publishable nodes, e.g. files/images).

mimetype (solr.StrField, stored)

The MIME (i.e. the "Content-Type").

modified (solr.TrieDateField, stored)

Date of last modification.

modifiedby (solr.StrField, stored)

The Node identifier of the last modifying user.

name (solr.StrField, stored)

The name.

name.analyzed (solr.TextField, analyzed, multivalued)

The name in analyzed form. This is a default query field.

  • Used as "qf" by ExtendedDisMaxParser

name.stemmed.analyzed (solr.TextField, stemmed, multivalued)

The name in stemmed form (see name.analyzed for non-stemmed). This field uses Stopwords

name.sortable (solr.CollationField)

The (multivalued) name in sortable form.

namepath (solr.StrField, stored)

The name-path of the nodes in the structural path from the site.

nodeid (solr.StrField, stored)

The Node identifier.

path (solr.StrField, stored, multivalued)

The Node identifiers of the nodes in the structural path from the site.

published (solr.TrieDateField, stored)

The publish date. Also see lastpublished for "last published date".

publishedby (solr.StrField, stored)

The Node identifier of the publishing user (or updating user for non-publishable nodes, e.g. files/images). Also see lastpublishedby for "last published by".

shortname (solr.StrField, stored)

The short name.

shortname.analyzed (solr.TextField, analyzed, multivalued)

The short name in analyzed form.

site (solr.StrField, stored)

The Node identifier of the sv:site.

size (solr.TrieLongField, stored)

The size - number of bytes of the resource.

summary (solr.TextField, stored, multivalued)

The text content in stored form. Typically used for highlighting and debug. Also see content.analyzed for the text content in analyzed (i.e. searchable) form. This field uses Stopwords

svtype (solr.StrField, stored, multivalued)

Type of resource (site, page, article, structurepage, grouppage, image, file, fileresource, groupresource). Typically used in filter query to limit hits to specific "types".

svtypeval (solr.TrieFloatField, stored)

Numerical representation of type (3.0, 4.0, 5.0, 6.0). Typically used for "boosting" by type.

tags (solr.StrField, stored, multivalued)

The tags (names of sv:tag).

tags.analyzed (solr.TextField, analyzed, multivalued)

The tags in analyzed form.

template (solr.StrField, stored)

The Node identifier of the sv:template.

title (solr.StrField, stored)

The title. The name if no title is present.

title.analyzed (solr.TextField, analyzed)

The title in analyzed form.

title.stemmed.analyzed (solr,TextField, stemmed)

The title in stemmed form (see title.analyzed for non-stemmed). This field uses Stopwords

title.sortable (solr.CollationField)

The title in sortable form.

translationtags (solr.StrField, stored, multivalued) [@since 2024.01.2]

The language tags for translations of this node.

uri (solr.StrField, stored)

The URI.

url (solr.StrField, stored)

The URL.

Dynamic Fields

Sitevision metadata is handled by dynamic fields prefixed with "metadata."

All metadata definition types populates the base fields:

  • metadata.<identifier>
  • metadata.analyzed.<identifier>
  • metadata.sortable.<identifier>

Certain metadata definition types can also populate further dynamic index fields:

A Numeric metadata also populates:

  • metadata.long.<identifier>

A Date metadata also populates:

  • metadata.long.<identifier>
  • metadata.date.<identifier>

metadata.* (solr.StrField, stored, multivalued)

The metadata value as string.

metadata.analyzed.* (solr.TextField, analyzed, multivalued)

The metadata value in analyzed format. Note! This field will use the same analyzer chain as the default query field ("df") of the index, i.e. content.analyzed (hence this field also uses Stopwords).

metadata.sortable.* (solr.CollationField)

The metadata value in sortable format

metadata.long.* (solr.TrieLongField, stored)

The metadata value as number.

metadata.date.* (solr.TrieDateField, stored)

The metadata value as date.