Gazetteers and OpenNLP Tools
The following video shows how to configure a Gazetteer Lookup Extraction stage in combination with OpenNLP:Uploading Lookup Lists to Lucidworks Search Blob Store
Lucidworks Search includes a number of lookup lists in the directory athttps://EXAMPLE_COMPANY.lucidworks.cloud/data/nlp/gazetteer.
To use the supplied lists or a list of your own data, each must list be uploaded to Lucidworks Search
using the Blob Store API
in order to make the list contents available to the Gazetteer Lookup Extraction stage.
For example, to identify color names, you would first compile a list of color terms, one entry per line in a text file with suffix .lst and then
upload that file using the Lucidworks Search REST API endpoint api/blobs/<listfilename>, as per the following example which
uses the curl command-line utility, where ‘admin’ is the name of a user with admin privileges, and ‘pass’ is that user’s password:
Replace
EXAMPLE_COMPANY with the name provided by your Lucidworks representative.Name Lookup Example
Define a lookup-extractor to identify mentions of certain celebrities in text fielddescription_t: