Managed Fusion 5.9.12
Originally released on April 16, 2025 and re-released on May 14, 2025, this maintenance release provides support for chunking in Lucidworks AI, model hosting with Ray, and important security upgrades and bug fixes.
If you were upgraded to Managed Fusion 5.9.12 and rolled back to Managed Fusion 5.9.9, you will be upgraded to the re-released Managed Fusion 5.9.12. This ensures you benefit from critical stability, security, and compatibility improvements included in the re-released version. |
For supported Kubernetes versions and key component versions, see Platform support and component versions. |
Key highlights
Improved relevance in large documents with chunking
Fusion 5.9.12 introduces document chunking, a major advancement for search and generative AI quality and performance. Chunking breaks large documents into smaller, meaningful segments—called chunks—that are stored and indexed using Solr’s block join capabilities. Each chunk captures either lexical content (for keyword-based search) or semantic vectors (for neural search), enabling Fusion to retrieve the most relevant part of a document, rather than treating the document as a single block.
This improves:
-
Search relevance: Users get results that point to the most relevant sections within large documents, not just documents that match overall.
-
Neural search precision: Vector chunks improve hybrid scoring by aligning semantic relevance with specific lexical content.
-
Scalability and maintainability: Updates or deletions are applied at the chunk level, ensuring consistency and avoiding stale or orphaned content.
-
Faceted search and UX: Results can be grouped and ranked more accurately, especially in use cases where dense documents contain multiple topics.
Document chunking is particularly valuable in knowledge management and technical content domains, where retrieving the right paragraph can be more important than retrieving the right document.
-
A new LWAI Chunker index pipeline stage uses one of the available chunking strategies (chunkers) for the specified LW AI model to provide optimized storage and retrieval. The chunkers asynchronously split the provided text in various ways such as by sentence, new line, semantics, and regular expression (regex) syntax.
-
The Chunking Neural Hybrid Query pipeline stage now detects chunked documents and retrieves the most relevant lexical and vector segments for hybrid search.
-
Updates and deletions now ensure consistent chunk synchronization to prevent orphaned data.
-
This feature includes a new Lucidworks AI Async Chunking API.
Click Get Started below to see how to enable chunking in Fusion:
Contact your Lucidworks account manager to confirm that your license includes this feature. |
Model hosting with Ray
Managed Fusion 5.9.12 introduces support for model hosting with Ray, replacing the previous Seldon-based approach. Ray offers a more scalable and efficient architecture for serving machine learning models, with native support for distributed inference, autoscaling, and streamlined deployment. This transition simplifies Managed Fusion’s AI infrastructure, enhances performance, and aligns with modern MLOps practices to make deploying and managing models faster, more reliable, and easier to monitor.
For more information, see Develop and deploy a machine learning model with Ray.
-
The Seldon vectorize stages have been renamed to Ray/Seldon Vectorize Field and Ray/Seldon Vectorize Query.
-
There are two new jobs for deploying models with Ray:
If you previously deployed a model with Seldon, you can deploy the same model with Ray. Just follow the instructions in Develop and deploy a machine learning model with Ray, and deploy the model with a different name to avoid conflicts. When you have verified that the model is working after deployment with Ray, you can delete the Seldon model using the Delete Seldon Core Model Deployment job. |
AI and machine learning features
Managed Fusion’s machine learning services now run on Python 3.10 and Java 11, bringing improved performance, security, and compatibility with the latest libraries. These upgrades enhance model execution speed, memory efficiency, and long-term support, ensuring Managed Fusion’s machine learning capabilities remain optimized for your evolving AI workloads. No configuration changes are required to take advantage of these improvements.
Seamless multi-region Solr replication with CrossDC
For clients contracted for 99.99% availability (also known as "Four 9s"), this release introduces built-in support for Apache Solr’s CrossDC (Multi-Region Solr) feature, enabling seamless replication between Solr clusters using Kafka. Previously, deploying CrossDC required manual configuration, but now it’s fully integrated into Managed Fusion’s packaging, making it more efficient to scale your deployments across distributed environments.
With this update, Managed Fusion collections are automatically configured for CrossDC replication, ensuring that all Solr objects and data remain synchronized across target clusters. This enhancement provides improved high availability, disaster recovery, and multi-region search capabilities.
Bug fixes
-
Fixed an issue where ConfigSync removed job schedules during upgrade.
In some cases, upgrading to Managed Fusion 5.9.11 could result in ConfigSync removing job schedules from the cluster without consistently reapplying them, leading to missing or disabled schedules post-upgrade. Managed Fusion 5.9.12 resolves this issue, ensuring job schedules remain intact across upgrades and ConfigSync behaves predictably in all environments.
-
Fixed incorrect image repository for Solr in Helm charts.
In Managed Fusion 5.9.11, the Helm chart specified an internal Lucidworks Artifactory repository for the Solr image. This has been corrected in 5.9.12 so the Solr image repository is either empty or points to
lucidworks/fusion-solr
, aligning with other components and simplifying deployment for external environments. -
Added support for configuring the Spark version used by Fusion.
Managed Fusion 5.9.12 now lets your choose whether to use Spark 3.4.1 (introduced in Fusion 5.9.10) or the earlier Spark 3.2.2 version used in Fusion 5.9.9. This flexibility helps maintain compatibility with legacy Python (3.7.3) and Scala environments, especially for apps that depend on specific Spark runtime behaviors.
When Spark 3.4.1 is enabled, custom Python jobs require Python 3.10. Contact Lucidworks to request Spark 3.2.2 for your Managed Fusion deployment.
-
Fixed incorrect
started-by
values for datasource jobs in the job history.In previous versions, datasource jobs started from the Managed Fusion UI were incorrectly shown as started by
default-subject
instead of the actual user. Managed Fusion now correctly records and displays the initiating user in the job history, restoring accurate audit information for datasource operations. -
Fixed a schema loading issue that prevented older apps from working with the Schema API.
Managed Fusion now correctly handles both
managed-schema
andmanaged-schema.xml
files when reading Solr config sets, ensuring backward compatibility with apps created before the move to template-based config sets. This prevents Schema API failures caused by unhandled exceptions during schema file lookup. -
Scheduled jobs now correctly trigger dependent jobs.
In Managed Fusion 5.9.12, we fixed an issue that prevented scheduled jobs from triggering other jobs based on their success or failure status. This includes jobs configured to run “on_success_or_failure” or using the “Start + Interval” option. With this fix, dependent jobs now execute as expected, restoring reliable job chaining and scheduling workflows.
-
Fixed an issue that prevented updates to existing scheduled job triggers in the Schedulers view.
This bug was caused by inconsistencies in how the API returned UTC timestamps, particularly for times after 12:00 UTC. The Admin UI now correctly detects changes and allows updates to trigger times without requiring the entry to be deleted and recreated.
-
Improved reliability of scheduled jobs in the
job-config
service.This release resolves several issues that could interfere with job scheduling and history visibility in Managed Fusion environments:
-
Stronger recovery from infrastructure interruptions: Ensures the scheduler recovers if all
job-config
pods briefly lose connection to ZooKeeper. -
Correct permission handling: Fixes cases where jobs could not be scheduled due to mismatches between user permissions and service account behavior.
-
Restored visibility of system job history: Fixes an issue where system jobs such as delete-old-system-logs and delete-old-job-history were missing from the UI despite running normally in the background.
-
Reliable schedule creation in all app states: Fixes an issue where adding a new schedule from the Run dialog appeared to succeed but did not persist the configuration in some apps.
-
-
Fixed a simulation failure in the Index Workbench when configuring new datasources.
Managed Fusion 5.9.12 resolves an issue where Index Workbench failed to simulate results after configuring a new datasource, displaying the error “Failed to simulate results from a working pipeline.” This fix restores full functionality to the Index Workbench, allowing you to preview and configure indexing workflows in one place without switching between multiple views.
-
Fixed a bug that caused aborted jobs to appear twice in the job history.
Previously, when you manually aborted a job, it was recorded twice in the job history. This duplication has been resolved, and each aborted job now appears only once in the history log.
-
Fixed an issue that prevented segment-based rule filtering from working correctly in Commerce Studio. Managed Fusion now honors the
lw.rules.target_segment
parameter, ensuring only matching rules are triggered and improving rule targeting and safety. -
This release eliminates extra warning messages in the API Gateway related to undetermined service ports. Previously, the gateway logged repeated warnings about missing
primary-port-name
labels, even though this did not impact functionality. This fix reduces unnecessary log noise and improves the clarity of your logs.
Removals
For full details on removals, see Deprecations and Removals.
-
The Tika Server Parser is removed in this release.
Use the Tika Asynchronous Parser instead. Asynchronous Tika parsing performs parsing in the background. This allows Managed Fusion to continue indexing documents while the parser is processing others, resulting in improved indexing performance for large numbers of documents.
-
MLeap is removed from the
ml-model
service. MLeap was deprecated in Managed Fusion 5.2.0 and was no longer used by Managed Fusion.
Platform Support and Component Versions
Kubernetes platform support
Lucidworks has tested and validated support for the following Kubernetes platform and versions:
-
Google Kubernetes Engine (GKE): 1.29, 1.30, 1.31
For more information on Kubernetes version support, see the Kubernetes support policy.
Component versions
The following table details the versions of key components that may be critical to deployments and upgrades.
Component | Version |
---|---|
Solr |
fusion-solr 5.9.12 |
ZooKeeper |
3.9.1 |
Spark |
3.4.1 |
Ingress Controllers |
Nginx, Ambassador (Envoy), GKE Ingress Controller |
More information about support dates can be found at Lucidworks Fusion Product Lifecycle.