HDFS Connector and Datasource Configuration

Hadoop Distributed File System (HDFS). It traverses the Hadoop file system as it would a regular Unix filesystem.

This connector can only be used with the default Hadoop shipped with Fusion.

See also the Hadoop connector, a connector for HDFS filesystem which uses MapReduce to distribute the crawl processes. When there is a lot of content to process, the MapReduce-enabled connector will be significantly faster.