Skip to main content

Documentation Index

Fetch the complete documentation index at: https://doc.lucidworks.com/llms.txt

Use this file to discover all available pages before exploring further.

This parser breaks down incoming CSV files into the most efficient components for Fusion to index. It produces one new document per row from the CSV input, excluding comment rows and header rows. If your CSV file contains a column named id, this column is consumed to populate the document’s unique identifier (Solr’s uniqueKey) and is not available as a stored field. This behavior occurs because the default value of the parser’s Document ID Source Field parameter is also id. When a CSV column matches this parameter:
  • The column’s value is used to generate the document ID.
  • The column does not appear in the indexed document as a field.
If you need to preserve your id column data as a regular field, use one of these options:
  • Change the column header from id to another name such as record_id or item_id. This is the simplest solution.
  • In the CSV parser stage configuration, set the Prefix parsed fields with parameter to a value such as csv_. This makes the id column appear as csv_id in your indexed documents.
  • In the Index Workbench’s parser configuration, set the Document ID Source Field to a different column name. This allows id to be treated as a normal field, but you must specify a different column to use as the document identifier.
See Parsers Overview for information about configuring the Document ID Source Field parameter.
When entering configuration values in the UI, use unescaped characters, such as \t for the tab character. When entering configuration values in the API, use escaped characters, such as \\t for the tab character.