Fusion

Version 5.3
How To
Documentation
    Learn More

      Fusion 5.0.2 Release Notes

      Release date: 18 December 2019

      Component versions:

      Solr 8.3.1

      ZooKeeper 3.5.6

      Spark 2.4.3

      Upgrade Instructions

      To upgrade from Fusion 5.0.x to 5.0.2 using the GKE installation script (setup_f5_gke.sh), run the following command:

      ./setup_f5_gke.sh -c <cluster> -r <release> -n <namespace> -p <project> -z <zone> -i <instance_type> --upgrade

      To upgrade from Fusion 5.0.x to 5.0.2 using Helm, run the following commands:

      helm repo update
      helm upgrade ${RELEASE} "lucidworks/fusion" --timeout 180 --namespace "${NAMESPACE}" --values "${RELEASE}_${NAMESPACE}_fusion_values.yaml"

      New features

      Predictive Merchandiser

      Fusion 5.0.2 integrates Predictive Merchandiser as a component of Fusion.

      If you have a Fusion license, you can access Predictive Merchandiser by navigating to Relevance > Rules and selecting Merchandiser.

      To learn more, see Predictive Merchandiser.

      Jupyter Integration

      Starting with Fusion 5.0.2, we now provide a Jupyter service that can be run from the Fusion Helm chart. Jupyter is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations, and narrative text.

      • Run/Debug Spark code in Scala/Python (replacement for spark-shell)

      • Run SQL queries via Fusion SQL

      • Debug Scala and SQL transforms in PBL jobs

      • Everything else for which Jupyter is designed

      To learn more, see Jupyter Support in Fusion.

      Improvements

      • The HTML parser now parses the application/xhtml+xml media type by default.

      • The HTML parser can filter HTML content using jsoup. New HTML parser configuration options include:

        Configuration option

        Default value

        Description

        excludeFilters

        N/A

        Jsoup-formatted selectors for elements to exclude from the HTML

        filterBeforeMapping

        False

        Apply exclude filters before performing HTML field mapping

        filterBeforeExtractingLinks

        False

        Apply exclude filters before performing link extraction

      • Index pipelines can now receive nested JSON documents through the Parallel Bulk Loader.

      • The Fusion UI now supports undo for deleted rules.

      • A new boolean property, shareState, is introduced for Javascript Query Stages. When set to true, a single JavaScript engine will be used for all queries on a Javascript query stage. Previously, a unique engine was used for each thread.

        Under certain conditions, this can improve performance significantly.

      • If a rule creation API is incorrectly formatted in JSON, Fusion now produces a detailed error message.

      • Improved the output from phrase extraction jobs. It is now more useful for non-signals datasets.

      • Improvement made in error propagation between services.

      • The Windows Share SMB 2/3 Connector now parses data into individual Solr fields instead of a large JSON document.

      • Various UI improvements and fixes.

      Bug Fixes

      • Fixed a bug that sometimes resulted in the <app>_recommender_models collection to delete the <app>_recommender_models collection.

      • The collection_alias field is now required for time-based partitioning in the Fusion SQL service.

      • The roles selector in the New User panel now correctly displays the available role types.

      • Fixed a bug that prevented the Box Connector from correctly indexing PowerPoint files.

      • Fixed a bug that prevented the Box Connector from indexing Japanese text with UTF-8 character encoding.

      • Fixed a bug that prevented V2 connector jobs from succeeding if a fetcher was defined without a phase name.

      • Fixed a bug that produced an Out of Memory (OOM) error when indexing large JSON files with a V2 connector.

      • V1 connector stateful jobs are no longer dependent on server IPs. Re-crawling after restarting V1 connectors now works as expected.

      • Fixed a bug that removed CJK characters from file names when uploading a file.

      • Fixed a bug the prevented the Jobs API from scheduling a recently created job.

      • Fixed a bug that sometimes caused DefaultHostInfoMetricReporter to fail on Java 11.

      • Boost lists will now utilize the correct term, orig_score, when rewriting a query.

      • Corrections made to the default synonym detection job settings.

      • Fixed a bug that sometimes caused pytests to fail when using the fusion user.

      • Logstash is now enabled by default.

      Known issues

      • If High performance mode is selected on the Web connector configuration, the crawl does not start.

        This bug is fixed in Fusion 5.1.0.
      • Deploying an application into the webapps service requires a non-empty, non-root context path.

        This bug is fixed in Fusion 5.1.0.