Lucidworks AI
Configure A Lucidworks AI Gateway integration
account.yaml
.
The file for a single integration should look similar to this one:
PUT,POST,GET:/LWAI-ACCOUNT-NAME/**
where LWAI-ACCOUNT-NAME
must match the value of fusion.lwai.account[n].name
in the integration YAML.
Configure the LWAI Prediction index stage
PUT,POST,GET:/LWAI-ACCOUNT-NAME/**
permission in Fusion, which is the Lucidworks AI API Account Name defined in Lucidworks AI Gateway when this stage is configured.To configure this stage:PUT,POST,GET:/LWAI-ACCOUNT-NAME/**
Your Fusion account name must match the name of the account that you selected in the Account Name dropdown.
For more information about models, see:
{destination name}_t
is the full response.
_lw_ai_properties_ss
contains the Lucidworks account, boolean setting for async, use case, input for the call, and the collection.
_lw_ai_request_count
is the number of GET requests by predictionId
and _lw_ai_success_count
is the number of responses without errors. These two fields are used for debugging only. Based on the deployment, the most useful measure is the ratio of _lw_ai_success_count
divided by `_lw_ai_request_count and then adjusting as much as possible to achieve 1.0.
enriched_ss
contains the use case. This can be used as a boolean value to verify if the use case indexed successfully.
{destination name}_t
is the full response.
_lw_ai_{use case}_t
is the response.response
object, which is the raw model output.
_lw_ai_{use case}_response_s
is the full response.
useCaseConfig
parameter is only applicable to certain use cases.
useCaseConfig
information for each applicable use case is described in Async Prediction API.useCaseConfig
information for each applicable use case is described in Prediction API.modelConfig
parameters are common to generative AI use cases.
modelConfig
information is described in Async Prediction API.modelConfig
information is described in Prediction API."apiKey"
is the value in the model’s "[OPENAI_API_KEY]"
field. For more information, see Authentication API keys."apiKey"
is the value generated by Azure in either the model’s "[KEY1 or KEY2]"
field. For requirements to use Azure models, see Generative AI models."apiKey"
is the value in the model’s"[BASE64_ENCODED_GOOGLE_SERVICE_ACCOUNT_KEY]"
field. For more information, see Create and delete Google service account keys.
Apache Tika
stage from your parser because it can cause datasource failures with the following error: “The following components failed: [class com.lucidworks.connectors.service.components.job.processor.DefaultDataProcessor : Only Tika Container parser can support Async Parsing.]”Solr Indexer
stage with the Solr Partial Update Indexer
stage with the following settings:
Enable Concurrency Control
set to offReject Update if Solr Document is not Present
set to offProcess All Pipeline Doc Fields
set to onAllow reserved fields
set to onUpdate Type
, Field Name
& Value
in Updates
Configure the LWAI Prediction query stage
PUT,POST,GET:/LWAI-ACCOUNT-NAME/**
permission in Fusion, which is the Lucidworks AI API Account Name defined in Lucidworks AI Gateway when this stage is configured.To configure this stage:signals
or access_control
.PUT,POST,GET:/LWAI-ACCOUNT-NAME/**
Your Fusion account name must match the name of the account that you selected in the Account Name dropdown.
For more information about models, see:
{destination name}_t
is the full response.
_lw_ai_properties_ss
contains the Lucidworks account, boolean setting for async, use case, input for the call, and the collection._lw_ai_{use case}_t
is the response.response
object, which is the raw model output.
_lw_ai_{use case}_response_s
is the full response.
body
and source
fields are required.body
- description_t
is the contents of the document.source
- link_t
is the URL/ID of the document.title
and date
fields are optional.title
- title_t
is the title of the document.date
- _lw_file_modified_tdt
is the creation date of the document in epoch time format.useCaseConfig
parameter is only applicable to certain use cases. For more information, see the Async Prediction API and the Prediction API.memoryUuid
parameter is required in the Standalone Query Rewriter use case, and is optional in the RAG use case.modelConfig
parameters are common to generative AI use cases. For more information, see Prediction API.
"apiKey"
is the value in the model’s "[OPENAI_API_KEY]"
field. For more information, see Authentication API keys."apiKey"
is the value generated by Azure in either the model’s "[KEY1 or KEY2]"
field. For requirements to use Azure models, see Generative AI models."apiKey"
is the value in the model’s"[BASE64_ENCODED_GOOGLE_SERVICE_ACCOUNT_KEY]"
field. For more information, see Create and delete Google service account keys.
Configure the LWAI Vectorize pipeline
phone
field is indexed into both the phone_s
single-valued field and the phone_ss
multi-valued field. If this option is not selected, the phone
field is indexed into only the phone_s
single-valued field.name
text field with a value of John Smith is indexed into both the name_t
and name_s
fields allowing relevant search using name_t
field (by matching to a Smith query) and also proper faceting and sorting using name_s
field (using John Smith for sorting or faceting). If this option is not selected, the name
text field is indexed into only the name_t
text field by default.{Destination Field}_b
is the boolean value if the vector has been indexed.
{Destination Field}
is the vector field.
useCaseConfig
parameter that is common to embedding use cases is dataType
, but each use case may have other parameters. The value for the query stage is query
.
_lw_fields_ss
multi-valued field to the document, which lists all fields that are being sent to Solr.
commit=true
and optimize=true
to be passed to Solr when specified as request parameters coming into this pipeline. Document commands that specify commit or optimize are still respected even if this checkbox is not selected.
Configure the LWAI Prediction index stage
PUT,POST,GET:/LWAI-ACCOUNT-NAME/**
permission in Fusion, which is the Lucidworks AI API Account Name defined in Lucidworks AI Gateway when this stage is configured.To configure this stage:PUT,POST,GET:/LWAI-ACCOUNT-NAME/**
Your Fusion account name must match the name of the account that you selected in the Account Name dropdown.
For more information about models, see:
{destination name}_t
is the full response.
_lw_ai_properties_ss
contains the Lucidworks account, boolean setting for async, use case, input for the call, and the collection.
_lw_ai_request_count
is the number of GET requests by predictionId
and _lw_ai_success_count
is the number of responses without errors. These two fields are used for debugging only. Based on the deployment, the most useful measure is the ratio of _lw_ai_success_count
divided by `_lw_ai_request_count and then adjusting as much as possible to achieve 1.0.
enriched_ss
contains the use case. This can be used as a boolean value to verify if the use case indexed successfully.
{destination name}_t
is the full response.
_lw_ai_{use case}_t
is the response.response
object, which is the raw model output.
_lw_ai_{use case}_response_s
is the full response.
useCaseConfig
parameter is only applicable to certain use cases.
useCaseConfig
information for each applicable use case is described in Async Prediction API.useCaseConfig
information for each applicable use case is described in Prediction API.modelConfig
parameters are common to generative AI use cases.
modelConfig
information is described in Async Prediction API.modelConfig
information is described in Prediction API."apiKey"
is the value in the model’s "[OPENAI_API_KEY]"
field. For more information, see Authentication API keys."apiKey"
is the value generated by Azure in either the model’s "[KEY1 or KEY2]"
field. For requirements to use Azure models, see Generative AI models."apiKey"
is the value in the model’s"[BASE64_ENCODED_GOOGLE_SERVICE_ACCOUNT_KEY]"
field. For more information, see Create and delete Google service account keys.
Apache Tika
stage from your parser because it can cause datasource failures with the following error: “The following components failed: [class com.lucidworks.connectors.service.components.job.processor.DefaultDataProcessor : Only Tika Container parser can support Async Parsing.]”Solr Indexer
stage with the Solr Partial Update Indexer
stage with the following settings:
Enable Concurrency Control
set to offReject Update if Solr Document is not Present
set to offProcess All Pipeline Doc Fields
set to onAllow reserved fields
set to onUpdate Type
, Field Name
& Value
in Updates
Configure the LWAI Prediction query stage
PUT,POST,GET:/LWAI-ACCOUNT-NAME/**
permission in Fusion, which is the Lucidworks AI API Account Name defined in Lucidworks AI Gateway when this stage is configured.To configure this stage:signals
or access_control
.PUT,POST,GET:/LWAI-ACCOUNT-NAME/**
Your Fusion account name must match the name of the account that you selected in the Account Name dropdown.
For more information about models, see:
{destination name}_t
is the full response.
_lw_ai_properties_ss
contains the Lucidworks account, boolean setting for async, use case, input for the call, and the collection._lw_ai_{use case}_t
is the response.response
object, which is the raw model output.
_lw_ai_{use case}_response_s
is the full response.
body
and source
fields are required.body
- description_t
is the contents of the document.source
- link_t
is the URL/ID of the document.title
and date
fields are optional.title
- title_t
is the title of the document.date
- _lw_file_modified_tdt
is the creation date of the document in epoch time format.useCaseConfig
parameter is only applicable to certain use cases. For more information, see the Async Prediction API and the Prediction API.memoryUuid
parameter is required in the Standalone Query Rewriter use case, and is optional in the RAG use case.modelConfig
parameters are common to generative AI use cases. For more information, see Prediction API.
"apiKey"
is the value in the model’s "[OPENAI_API_KEY]"
field. For more information, see Authentication API keys."apiKey"
is the value generated by Azure in either the model’s "[KEY1 or KEY2]"
field. For requirements to use Azure models, see Generative AI models."apiKey"
is the value in the model’s"[BASE64_ENCODED_GOOGLE_SERVICE_ACCOUNT_KEY]"
field. For more information, see Create and delete Google service account keys.
Configure the LWAI Vectorize pipeline
phone
field is indexed into both the phone_s
single-valued field and the phone_ss
multi-valued field. If this option is not selected, the phone
field is indexed into only the phone_s
single-valued field.name
text field with a value of John Smith is indexed into both the name_t
and name_s
fields allowing relevant search using name_t
field (by matching to a Smith query) and also proper faceting and sorting using name_s
field (using John Smith for sorting or faceting). If this option is not selected, the name
text field is indexed into only the name_t
text field by default.{Destination Field}_b
is the boolean value if the vector has been indexed.
{Destination Field}
is the vector field.
useCaseConfig
parameter that is common to embedding use cases is dataType
, but each use case may have other parameters. The value for the query stage is query
.
_lw_fields_ss
multi-valued field to the document, which lists all fields that are being sent to Solr.
commit=true
and optimize=true
to be passed to Solr when specified as request parameters coming into this pipeline. Document commands that specify commit or optimize are still respected even if this checkbox is not selected.