Solr MoreLikeThisQuery pipeline stage configuration specifications
This stage uses the content of the current document to query for similar documents, using Solr’s MoreLikeThis component.
-
The incoming query must include an id
field in order to get recommendations from this stage. The stage returns documents similar to the one specified by this field.
-
Since these secondary queries tend to be large, this stage can impact search performance. You can improve performance by first clustering your documents, then configuring this stage to query a specific document cluster instead of all documents.
|
When entering configuration values in the UI, use unescaped characters, such as \t for the tab character. When entering configuration values in the API, use escaped characters, such as \\t for the tab character.
|
Returns results similar to a given item using Solr's MoreLikeThis component. Provide parameters for the component here. For more information, see https://cwiki.apache.org/confluence/display/solr/MoreLikeThis
skip - boolean
Set to true to skip this stage.
Default: false
label - string
A unique label for this stage.
<= 255 characters
condition - string
Define a conditional script that must result in true or false. This can be used to determine if the stage should process or not.
useQueryParser - booleanrequired
Specifies whether to use the MLT Query Parser. Note, if you choose to use this you MUST specify a document id to run the MLT Query on and a Field to run the MLT with.
Default: true
moreLikeThisFields - array[string]
Specifies the name of the field you want to run the mlt on. NOTE: If you don't supply any fields we will default to using the body field.
docId - string
Specifies the name of the id field we are finding more like this terms on
Default: id
mindf - integer
Specify the frequency at which words will be ignored which occur in at least this many docs
Default: 2
maxdf - integer
Specify the frequency at which words will be ignored which occur in more than this many docs
Default: 10000
count - integer
Specifies the number of similar documents to be returned for each result.
mintf - integer
Specify the frequency below which terms will be ignored in the source doc
minwl - integer
Sets the minimum word length for words to be recognized by the MoreLikeThis
Default: 3
maxwl - integer
Sets the maximum word length
maxqt - integer
Sets the max number of query terms that will be included in any generate query
maxntp - integer
Sets the max number of tokens to parse in each example doc that is not stored with TV support
boost - boolean
Specifies if the query will be boosted by the interesting term relevant
matchInclude - boolean
Specifies whether the response should include the matched doc
matchOffset - integer
Specifies an offset to the main query to find the doc on which the MoreLikeThis query should operate. By default it is 0
interestingTerms - string
Controls how the More Like This component presents the interesting terms. Supports 3 settings, list lists the terms, none lists no terms and details lists the terms with the boosts
Allowed values: listnonedetails