This parser breaks down incoming CSV files into the most efficient components for Lucidworks Search to index. It produces one new document per row from the CSV input, excluding comment rows and header rows. If your CSV file contains a column namedDocumentation Index
Fetch the complete documentation index at: https://doc.lucidworks.com/llms.txt
Use this file to discover all available pages before exploring further.
id, this column is consumed to populate the document’s unique identifier (Solr’s uniqueKey) and is not available as a stored field.
This behavior occurs because the default value of the parser’s Document ID Source Field parameter is also id. When a CSV column matches this parameter:
- The column’s value is used to generate the document ID.
- The column does not appear in the indexed document as a field.
id column data as a regular field, use one of these options:
- Change the column header from
idto another name such asrecord_idoritem_id. This is the simplest solution. - In the CSV parser stage configuration, set the Prefix parsed fields with parameter to a value such as
csv_. This makes theidcolumn appear ascsv_idin your indexed documents. - In the Index Workbench’s parser configuration, set the Document ID Source Field to a different column name. This allows
idto be treated as a normal field, but you must specify a different column to use as the document identifier.