LWAI Vectorize Query is a Fusion query pipeline stage that generates a vector based on the current query string (q parameter).
ImportantThis feature is currently only available to clients who have contracted with Lucidworks for features related to Neural Hybrid Search and Lucidworks AI.
To use this stage, non-admin Fusion users must be granted the PUT,POST,GET:/LWAI-ACCOUNT-NAME/** permission in Fusion, which is the Lucidworks AI API Account Name defined in Lucidworks AI Gateway when this stage is configured.
This query stage must be placed before the Solr Query stage.

Configurable vector quantization method

In Fusion 5.9.13 and later, you can configure the vector quantization method. Quantization converts high-precision float vectors into compact 8-bit integer vectors, significantly lowering storage and compute costs. By default, no quantization is performed; you enable it by selecting a method. To select the quantization method, go to Model Configuration in the stage configuration and enter the vectorQuantizationMethod parameter with the value for the desired method:
Vector quantization method configuration in an LWAI pipeline stage
Available methods are:
  • min-max creates tensors of embeddings and converts them to uint8 by normalizing them to the range [0, 255].
    This method loses precision when evaluated against non-quantized vectors.
    Test it against your data to see if the loss is acceptable.
  • max-scale finds the maximum absolute value along each embedding, normalizes the embeddings by scaling them to a range of -127 to 127, and returns the quantized embeddings as an 8-bit integer tensor.
    This method has no loss at the ten-thousandths place during evaluation against non-quantized vectors.

Configuration

When entering configuration values in the UI, use unescaped characters, such as \t for the tab character. When entering configuration values in the API, use escaped characters, such as \\t for the tab character.