Product Selector

Fusion 5.9
    Fusion 5.9

    Improve RelevancyGetting Started with Managed Fusion

    Signals are events that can be aggregated and used for automatic boosting or recommendations, which are methods to make search results more relevant.

    As an example of boosting, the most popular search results for certain queries can be boosted so they display first (or at least nearer the top) when other users make similar queries.

    Similarly for recommendations, click events or purchase events can be collected as signals and used to display "Customers who viewed this also viewed" or "Best-selling holiday items".

    In a production environment, users' actions generate signals. For the purposes of this tutorial, you will use Query Workbench to generate click signals.

    Lucidworks offers free training to help you get started.

    The Learning Path for Refining Search Results focuses on the Fusion features that help you maximize the relevancy of your search results:

    Refining Search Results

    Visit the LucidAcademy to see the full training catalog.

    Before you begin

    To perform the steps in this part of the tutorial, you must first complete Part 1 - Create a Managed Fusion application, Part 2 - Index Data, and Part 3 - Query Data, which give you an indexed dataset that is configured for faceted search.

    Format display fields

    1. Sign in to Managed Fusion if it is not currently open.

    2. In the Managed Fusion launcher, click the Movie Search app.

    3. Click Querying Querying > Query Workbench.

    4. At the top right, select Display Fields.

    5. In the Name field, click the row to display a dropdown of possible values.

    6. Select title_txt. You can filter the list of possible values.

    7. In the Description field, select id.

    8. Click Display Fields to close the Display Fields panel.

    Enable synthetic signals

    To complete this part of the tutorial, you must generate more signals because not enough signals were created in Part 3 - Query Data. Since this example is a prototype app, users are not generating signals. To create more signals, you must enable synthetic signals in Query Workbench.

    1. At the bottom of the Query Workbench page, click Format Results.

    2. Select Show signal generators and Send click signals.

      Configure signals

    3. Click Save.

    4. Hover over one of the search results. Query Workbench displays controls that include a Simulate button next to a field that lets you specify the number of signals to simulate:

      Controls to simulate signals

    Generate signals

    With synthetic signals enabled, you will generate a simple set of signal data that you can use to generate meaningful recommendations.

    For this tutorial, you will generate signals you can use to boost your favorite sci-fi titles so that they display first.

    1. Search for star wars.

      The top results are not your favorite titles:

      Star Wars search results

    2. In Format Results, Set results per page to 20. This should allow you to scroll for this next part instead of explicitly searching for the titles.

      Next you will generate signals you can use to boost certain titles. Signals are tied to the search query, so your boosted titles will display first in the search results only when users search for star wars.

    3. Hover over "Star Wars: Episode IV - A New Hope".

    4. Set the number of signals to 4000 and click Simulate.

    5. Hover over "Star Wars: Episode V - The Empire Strikes Back".

    6. Set the number of signals to 3000 and click Simulate.

    7. Hover over "Star Wars: Episode VI - Return of the Jedi".

    8. Set the number of signals to 2000 and click Simulate.

      With synthetic signals enabled, you can also send a single signal by clicking the underlined movie title.
    9. In the upper right, click Save. The Save Pipeline window displays. By default, you will overwrite the existing query pipeline for the selected collection (in this case, Movie_Search_signals).

    10. Click Save pipeline.

    Explore the raw signals

    When you create a collection, two corresponding collections are also created automatically:

    • COLLECTION_NAME_signals for raw signals. For example, Movie_Search_signals.

    • COLLECTION_NAME_signals_aggr for aggregated signals. For example, Movie_Search_signals_aggr.

    You can use Query Workbench to explore the data in the _signals collection.

    1. In the collection picker in the upper left, select Movie_Search_signals.

    2. Click Querying Querying > Query Workbench. The signals data displays.

      Signal data

      Your signals collection contains several types of signals, such as click signals and search result signals.

    3. To display click signals, enter type:click and click Search Search.

    4. For any of the results, click show fields.

      The count_i field displays the number of click signals you generated for this event. For example, given the corresponding doc_id for Star Wars: Episode IV - A New Hope, the count_i equals 4000.

      Number of click signals for document 260

    5. Click hide fields.

    Optional: Format signals fields

    You can configure your search view to display fields that are meaningful for your investigation. For example, you can display the document ID and the number of click signals.

    1. Click Display Fields.

    2. For the Name field, select doc_id.

    3. For the Description field, select count_i.

      Doc ID and count fields for click signals

    Explore the aggregated signal data

    Aggregation jobs are created automatically when you create an app. However, you need to run the aggregation job manually:

    1. Navigate to Collections Collections > Jobs.

    2. In the job list, select Movie_Search_click_signals_aggregation.

    3. Click Run.

    4. Click Start. The job may take a few minutes to complete.

    5. When the success message displays, in the collections picker in the upper left, select Movie_Search_signals_aggr.

    6. Click Querying Querying > Query Workbench.

      The aggregated signals data displays. If the aggregated signals data does not display, wait a minute and then reload your browser, or click Search Search in Query Workbench.

      Aggregated signals

    7. Click Display Fields.

    8. In the Name field, select doc_id.

    9. In the Description field, select aggr_count_i.

    10. For the result named 1210, click show fields.

      Aggregated signal fields

      The fields for aggregated signals are very similar to the fields for raw signals, with additional fields to describe the aggregation:

      • aggr_count_i. Number of signals that have been aggregated. For example, 3000.

      • aggr_id_s. Name of the aggregation job.

      • aggr_job_id_s. Job ID.

      • aggr_type_s. Aggregation type.

    11. In the upper right, click Save.

      The Save Pipeline screen displays. By default, you will overwrite the existing query pipeline for the selected collection. For example, Movie_Search.

    12. Click Save pipeline.

    View the search results with and without default boosting

    1. In the collections picker in the upper left, select Movie_Search.

    2. Open Query Workbench. Navigate to Querying Querying > Query Workbench.

    3. Search for star wars.

      "Star Wars: Episode IV - A New Hope" is the first search result, followed by Episode V and then VI. These search results are automatically boosted by the default configuration of the Boost with Signals query pipeline stage, which boosts on the id field.

      Star wars movies boosted

    4. Click Compare. Another preview panel opens. Now the working pipeline is on the right and a static snapshot of that same pipeline is on the left.

      Compare two pipelines

      In this view, you can compare results from one query pipeline side by side with another query pipeline. In this case, you will compare results for the same pipeline (Movie_Search) with and without the Boost with Signals stage enabled.

    5. Turn off the Boost with Signals stage.

      Boosting comparison

      The search results on the right display as they did before you generated synthetic click signals. Rank-change indicators indicate which results moved up or down as a result of turning off boosting.

    6. Turn on the Boost with Signals stage again to restore the boosted results.

    7. To close the comparison preview panel, click Close Close.