Getting Data In
There are many methods for getting data into Fusion, depending on the type of data:
-
Your searchable content, signals, and logs are all ingested by indexing them in Solr.
-
connectors, machine learning models, JDBC drivers, analytics catalogs, and Banana dashboards can be uploaded to the blob store.
-
The Parallel Bulk Loader (PBL) can perform distributed reads on any data source that implements the SparkSQL (2.2.1 or later) Data Sources API.
-
Though signals are normally indexed, you can also import them.
-
In some cases, you might find that it’s best to use other ingestion methods, such as Hive, Pig, or pushing data to a REST API endpoint.
Lucidworks offers free training to help you get started with Fusion. Check out the Ingesting Data course, which focuses on understanding connectors and how to configure datasources, parsers, and index pipelines: Visit the LucidAcademy to see the full training catalog. |