How To
Documentation
    Learn More

      Mann-Whitney (mann_whitney)

      The mann_whitney function performs a Mann-Whitney U-test on two random samples drawn with different queries. The mann_whitney function takes four parameters:

      1. The numeric field to perform the Mann-Whitney U-test on

      2. The Lucene/Solr query for drawing the first sample

      3. The Lucene/Solr query for drawing the second sample

      4. The sample size for both samples

      Sample syntax

      select mann_whitney(sepal_width_d, "species_s:versicolor", "species_s:virginica", 150) as u,
             p_value
      from iris

      Result set

      The result set for the mann_whitney function contains one row with the results of the mann_whitney test. The mann_whitney function returns the U-statistic. The p_value field can be selected to return the p-value for the Mann-Whitney test.

      Sample result set in Apache Zeppelin

      Sample result

      Visualization

      Sample visualization of the mann_whitney function using the Apache Zeppelin number visualization:

      Sample visualization