Connector Configuration Reference
Configure AEM V2 Connector
jcr:primaryType
. In this example, the AEM connector is configured to crawl the AEM Type cq:Page
, which represents web content pages.
jcr:primaryType
into Fusion. In this example, the value of NY_FairHealth.pdf is dam:Asset
.
jcr:(.*)
.
Setting | Notes |
---|---|
AEM URL | Required. This is the URL used to access the AEM Admin UI. |
AEM Username | Required. The user should have sufficient permissions to read content paths and access Users/Group APIs in case Security Trimming is needed. |
AEM Password | Required. |
Page Batch Size | Number of documents to fetch per page request. A higher value can increase crawling speed but also increases memory usage. |
Thread wait (ms) | Number of milliseconds to wait between fetch requests. This property can be used to throttle a crawl if necessary. |
Paths to search | Required. |
Paths that should not be fetched | Java regex for paths that should not be fetched. |
AEM Types | Required. AEM document type jcr:primaryType to include in the index. Examples: cq:Page , dam:Asset . |
Attachment types | File extensions to index. |
Content Property Include Regexes | A list of regex strings of content properties to include in indexed documents. Example: jcr:.* . |
Content Property Exlude Regexes | A list of regex strings of content properties to exclude from indexed documents. Example: sling:.* |
Enable Security Trimming | Enable this setting for content filtering of results based on the user’s id passed in during query. |
Group Mappings | AEM user groups mapped to indexed values in the security trimming field which are used to filter content based on user id passed in query. |
Cache Expire Time (m) | Specifies how long a query is cached in minutes. |
/bin/querybuilder.json
endpoint data is mandatory and must exist in order for a document to be indexed.Note the list of fields that can appear in an indexed document:Field | Source | Comments |
---|---|---|
id | <AEM_URL>/bin/querybuilder.json | Field: path |
content_txt | <AEM_URL>/bin/querybuilder.json | Whole data in text format. |
<rest fields> | <AEM_URL>/bin/querybuilder.json | All top level fields of JSON object. |
body_t | <AEM_URL>/crx/de/download.jsp | Used if path ends with one of Attachment types OR path does not end with: /jcr:content . |
body_t | <AEM_URL><id> | Used if there is no jcr data. If response status code is something other than 200, Fusion assumes there is no file to download under that path. |
body_t | [content_txt] | Defaults to [content_txt] if body_t is empty. |
parentPage | Id of document that contains attachment or link. | Populated in case of attachment/link. |
type | File extension of the path. | Populated in case of attachment/link. |
file_size | <AEM_URL>/bin/querybuilder.json | :jcr:data; used if jcr data is not empty. |
file_size | <AEM_URL>/bin/querybuilder.json | dam:size; used if jcr data is empty. |
cq:Page
and cq:PageContent
then the results could include duplicated data.Configure Remote V2 Connectors
remote-connectors
or admin
role.remote-connectors
role by default, you can create one. No API or UI permissions are required for the role.values.yaml
file, configure this section as needed:enabled
to true
to enable the backend ingress.
pathtype
to Prefix
or Exact
.
path
to the path where the backend will be available.
host
to the host where the backend will be available.
ingressClassName
to one of the following:
nginx
for Nginx Ingress Controlleralb
for AWS Application Load Balancer (ALB)logging.config
property is optional. If not set, logging messages are sent to the console.plain-text
to true
.connectors-backend
pod shuts down and is replaced by a new pod. Once the connector shuts down, connector configuration and job execution are disabled. To prevent that from happening, you should restart the connector as soon as possible.You can use Linux scripts and utilities to restart the connector automatically, such as Monit.max-grpc-retries
bridge parameters.job-expiration-duration-seconds
parameter. The default value is 120
seconds.connector-plugins
entry in your values.yaml
file: