Apache Tika Parser Index Stage

The Apache Tika Parser index stage type includes rules for parsing documents with Apache Tika. Fusion uses Tika v1.13. (Note that components of the Solr distribution included with Fusion contian their own Tika jar files; these are not used by Fusion.)

To parse a CSV document, you should use a CSV Parsing Index Stage instead of an Apache Tika Parser stage.

Configuration

Tip
When entering configuration values in the UI, use unescaped characters, such as \t for the tab character. When entering configuration values in the API, use escaped characters, such as \\t for the tab character.