Similar queries based on documents clicked | Similar queries based on co-occurrence in sessions |
---|---|
Queries are considered for recommendation if two queries have similar sets of document IDs clicked according to the signals data. This is directly implemented from the similar queries portion of the Synonym Detection job. This approach can work on both raw and aggregated signals. | Queries are considered for recommendation if two queries have co-occurred in the same session based on the assumption that users search for similar items in a single search session (this may or may not hold true depending on the data). This approach, based on session/user IDs, needs raw signals to work. |
Query-to-Query Session-Based Similarity job dataflow
COLLECTION_NAME_query_recs
) and a dedicated collection and pipeline are created when you enable recommendations for a collection.
At a minimum, you must configure these:
COLLECTION_NAME_signals
solr
query_s
doc_id_s
ps3$
becomes ps3
.(playstation 3, playstation console)
is boosted with a similarity score of 1, provided the minimum match is set to 1 token or 0.5.
mm
param in Solr, this defines the number/fraction of tokens that should overlap if boosting is enabled. Queries and recommendations are split by “ “ (space) and each part is considered a token. If using a less-than sign (<), it must be escaped using a backslash.mm
value is set to 1.COLLECTION_NAME_queries_query_recs
collection:
query_t
recommendation_t
similarity_d
, the similarity scoresource_s
, the approach that generated this pair, one of the following: SessionBased
or ClickedDocumentBased
query_count_l
, the number of times the query occurred in signalsrecommendation_count_l
, the number of times recommendations occurred in signalspair_count_l
, the number of instances of the pair generated in the final recommendations using either of the approachestype_s
, always set to similar_queries
COLLECTION_NAME_queries_query_recs
. is created.