Process All Pipeline Doc Fields
option, you will receive results similar to the following:"body_t":"[{\"field\":\"I’m the content from JSON!\"}]"
_lw_async_parsing_id_i
_lw_async_parsing_fail_count_i
Field | Notes |
---|---|
Tika Endpoints | List of Tika server endpoints that are in the cluster. The endpoints will be automatically updated. You do not need to specify the endpoints. |
Pipeline ID | The Pipeline ID is determined by the partial-update index pipeline you created earlier. |
Find Filter Queries | This is an optional filter to limit the documents fetched by the Tika Async parser. |
Finder batch size | This is the Solr rows batch size of the requests. |
Max bytes returned | The maximum number of characters indexed. |
Max embedded documents | This determines the amount of embedded documents that should be parsed until additional embedded documents are ignored. Use -1 for unlimited documents. |
Parse timeout | This configures the length of time for parsing in milliseconds. |
Max Parse Failure | If a document fails to parse, you can configure the reattempts up to your designated amount of times. |
Insert Batch Max Docs | This controls the batch size in number of documents before submitting to the index pipeline. |
Insert Batch Max Bytes | This controls the batch size number of bytes contained in the documents before submitting to the index pipeline. |
$FUSION_HOME/var/log/tika-server/tika-async.log