Crawl A Subset of OneDrive

OneDrive is a file hosting service that is part of the Microsoft Office Online services. The Fusion OneDrive connector crawls a OneDrive for Business instance and retrieves data from it for indexing within Fusion.

You can optionally configure the OneDrive connector to crawl only the drives of certain users that you specify.

To limit the crawl to user-specific drives:
  1. Install the OneDrive connector.

  2. In the Fusion UI, navigate to the OneDrive connector configuration at Indexing > Datasources > Add > OneDrive.

  3. In the User principal name (UPN) filter field, specify a list of users (user principal names, or UPNs) to retrieve documents from.

    • A user’s UPN is the one they use for logging into OneDrive.

    • The User principal name (UPN) filter field is an array, so you can set multiple UPNs.

Troubleshooting tips:
  • All UPNs are validated, by retrieving the user drives. If the request fails, it is logged in $fusion_home/var/log/connectors/connectors-rpc/connectors-rpc.log.

  • If validation fails for all UPNs set, then the crawl job doesn’t start.

  • If at least one UPN is valid, then the validation succeeds and the job starts. While the job is running, requests to invalid UPNs fail and are logged.

  • If more than 10 UPNs are set, then the validation is skipped for performance reasons.