HDFS Connector and Datasource Configuration
Hadoop Distributed File System (HDFS). It traverses the Hadoop file system as it would a regular Unix filesystem.
This connector can only be used with the default Hadoop shipped with Fusion.
See also the Hadoop connector, a connector for HDFS filesystem which uses MapReduce to distribute the crawl processes. When there is a lot of content to process, the MapReduce-enabled connector will be significantly faster.