/api/connectors/datasources/{id}/db
endpoints allow looking into the crawl database and dropping tables or clearing the database.
The connectors that support the crawl database are currently lucid.fs
and lucid.solrxml
. The lucid.anda
connector also uses a crawl database, but it is not the same database, and does not have a REST API or other interface to access it.
/api/connectors/datasources/<id>/db
will include several sections detailing the database structure:
counters
: The counters
section reports the document counts of database activities, such as table inserts.ops
: The ops
section reports on database operations that have occurred, such as initiating tables, retrieving items, processing items and table drops.tables
: The tables
section lists the tables in the database with a count of the number of items in each table. Inspecting the items is described in the next section./api/connectors/datasources/<id>/db/<table>
will be empty. When dropping the database, note that no documents will be removed from the index. However, the crawl database will be empty, so on the next datasource run, all documents will be treated as though they were never seen by the connectors.
When dropping tables, be aware that the items
table does not delete documents from the index, but instead changes the database so database considers them new documents. When dropping other tables, such as the errors
table, it will merely clear out old error messages.
/api/connectors/datasources/<id>/db/items/<item>
retrieves information about an item or items.
A DELETE request removes the information from the Crawl Database only. Note that this does not affect the Solr Index.
validate=true
element in the create datasource command only validates the datasource. It does not automatically save the datasource. An example using this element is:[https://FUSION_HOST:8764/api/apps/]APP_NAME/datasources?validate=true
The POST /datasources
section of the API specification allows you to set the validate
element for testing and use.