Connectors Configuration Reference

Connectors are the conduit between Fusion and your external data sources. Connectors retrieve your data and import it into Fusion Server.

Installation instructions:

Some connectors are pre-installed:

Database connectors

Name

Description

Source Compatibility

Platform Version

Security Trimming

Couchbase V2

The Couchbase V2 connector uses the Couchbase Java client to retrieve data stored in Couchbase.

Couchbase Server 6.0.3 Enterprise Edition (V2)

V2

No

JDBC

The JDBC connector fetches documents from a relational database via SQL queries. Under the hood, this connector implements the Solr DataImportHandler (DIH) plugin.

Any JDBC compliant Database

V1

No

MongoDB

Retrieve data from a MongoDB instance.

MongoDB 2.6, 3.0, 3.2, 3.4

MongoDB 2.6, 3.0, 3.2, 3.4

Fusion Server 4.2.2 and above also supports MongoDB 3.6, 4.0

V1

Filesystem connectors

Name

Description

Source Compatibility

Platform Version

Security Trimming

Box.com

The Box connector retrieves data from a Box.com cloud-based data repository. To fetch content from multiple Box users, you must create a Box app that uses OAuth 2.0 with JWT server authentication. For limited testing using a single user account, you can create a Box app that uses Standard OAuth 2.0 authentication.

N/A

V1

Yes

File Upload

The File Upload connector provides a convenient way to quickly ingest data from your local filesystem. It’s available in the Quickstart wizard in addition to the Index Workbench and the Datasources page.

N/A

V1

No

FTP

Retrieve documents using the File Transfer Protocol (FTP).

N/A

V1

No

Google Cloud

All

V2

Yes

Google Drive

The Google Drive connector is used to index the documents in a Google Drive account.

Google V3 API

V1

Yes

HDFS

Hadoop Distributed File System (HDFS). It traverses the Hadoop file system as it would a regular Unix filesystem.

2.7.1

V1

No

Local Filesystem V2

This connector traverses a network file system (NFS), where a shared drive is mounted to the same location on all hosts in the cluster that are running this connector.

All

V2

No

OneDrive

OneDrive is a file hosting service that is part of the Microsoft Office Online services. The Fusion OneDrive connector crawls a OneDrive for Business instance and retrieves data from it for indexing within Fusion.

All

V2

Yes

S3

The S3 connector can access AWS S3 buckets in native format.

All

V1

No

Windows Share SMB2/3

The Windows Share connector can access content in a Windows Share or Server Message Block (SMB 2 and 3 protocols)/Common Internet File System (CIFS) filesystem. Available for Fusion Server version 4.0.2 and later.

SMB 2, 3 protocols

V1

Yes

Hadoop cluster connectors

Name

Description

Source Compatibility

Platform Version

Security Trimming

Cloudera

The Cloudera Connector is a MapReduce-enabled crawler that is compatible with Cloudera CDH v4.x and v5.x.

4.x, 5.x

V1

No

Hortonworks

The Hortonworks Connector is a MapReduce-enabled crawler that is compatible with Hortonworks Data Platform v2.x.

2.x

V1

No

Repository connectors

Name

Description

Source Compatibility

Platform Version

Security Trimming

Active Directory Connector for ACLs V2

Connector overview:

  1. The Active Directory Connector for ACLs V2 plugin is a special connector that supports other data sources with collecting Active Control List (ACL) data.

  2. It indexes ACL information into a configured "sidecar" Solr collection, for use by other connectors.

  3. The AD for ACLs plugin connects to LDAP, AD, and Azure AD instances to add objects directly to a special collection for use in security trimming queries.

  4. V2 extends the Azure AD model from V1 to return a delta link at the end of a crawl, allowing incremental delta searches for groups and users.

N/A

V2

Yes

Alfresco

The Alfresco Connector is a crawler for the Alfresco server, which adheres to the Content Management Interoperability Services (CMIS) standard.

CMIS 1.1 compliant versions

V1

Yes

Azure

The Azure connector is used to crawl an Azure instance. Its connector type is "lucid.azure" and its plugin type is "azure".

Blob and Table storage

V1

No

Confluence

Retrieve data from the Atlassian Confluence Wiki CMS. You can configure this datasource to crawl pages, spaces, blog posts, comments, and attachments.

Confluence Server 5.5 and later

Confluence Cloud

V1

Yes

Drupal

The Drupal connector requires Drupal’s Services 7.x3.11 Module REST API. Refer to this page to install the necessary packages: www.drupal.org/node/783236

Drupal 7.x

V1

No

GitHub

The GitHub connector retrieves data from GitHub repositories using the GitHub REST API.

N/A

V1

No

JIRA

The JIRA connector retrieves data from a instance of Atlassian’s JIRA issue tracking system.

6.x, 7.x

V1

Yes

Salesforce

Salesforce REST API to extract data from a Salesforce repository via a Salesforce Connected App.

N/A

V1

Yes

ServiceNow

The ServiceNow Datasource retrieves data from ServiceNow repository via the ServiceNow REST API. ServiceNow records are stored in named tables.

N/A

V1

Yes

SharePoint V2

The SharePoint connector retrieves content and metadata from an on-premises SharePoint repository.

V2

Yes

SharePoint V1 (deprecated after 4.2.3)

The SharePoint connector retrieves content and metadata from an on-premises SharePoint repository.

2010, 2013, 2016, Online

V1

Yes

SharePoint Online V1 (deprecated after 4.2.3)

The SharePoint Online V1 connector retrieves data from cloud-based SharePoint repositories. Authentication requires a Sharepoint user who has permissions to access Sharepoint via the SOAP API. This user must be registered with the Sharepoint Online authentication server; it is not necessarily the same as the user in Active Directory or LDAP.

N/A

V1

Yes

Sitecore

This connector provides full crawl and incremental crawl support for Sitecore versions 8.x and 9.x. It indexes document content and all metadata.

8.x, 9.x

V2

No

Solr Index

A Solr connector pulls documents from an external standalone Solr instance or SolrCloud cluster using Solr’s javabin response type and streaming response parser.

All

V1

No

Zendesk

The Zendesk connector uses the Zendesk REST API to retrieve tickets and their associated comments and attachments from a Zendesk repository.

N/A

V1

Yes

Script connectors

Name

Description

Source Compatibility

Platform Version

Security Trimming

Javascript

The Javascript connector allows users to write ad-hoc document retrieval routines to fetch content from filesystems and websites.

All

V1

No

Social media connectors

Name

Description

Source Compatibility

Platform Version

Security Trimming

Jive

Retrieve content from a Jive instance.

REST API +V3.12

V1

Yes

Slack

The Slack connector is used to retrieve data from a Slack service. The connector sends requests to the Slack REST API.

All

V1

Yes

Twitter Search

The Twitter Search connector uses Twitter’s search API to query Twitter for tweets that match specific parameters. It allows querying for any keyword, location or other query terms.

All

V1

No

Twitter Stream

The Twitter Stream connector uses Twitter’s streaming API to continually index Twitter. The datasource can be configured to limit tweets or it can be run indefinitely, until Twitter cuts off your access or you stop the datasource. This connector only retrieves tweets created after the datasource has been started.

All

V1

No

Web connectors

Name

Description

Source Compatibility

Platform Version

Security Trimming

Web

The Web connector retrieves data from a Web site using HTTP and starting from a specified URL.

N/A

Yes

No