How To
Documentation
    Learn More

      Machine Learning Jobs

      Fusion AI provides these job types to perform machine learning tasks.

      Signals analysis

      These jobs analyze a collection of signals in order to perform query rewriting, signals aggregation, or experiment analysis.

      • Ground Truth

        Estimate ground truth queries using click signals and query signals, with document relevance per query determined using a click/skip formula.

      Query rewriting

      These jobs produce data that can be used for query rewriting or to inform updates to the synonyms.txt file.

      • Head/Tail Analysis

        Perform head/tail analysis of queries from collections of raw or aggregated signals, to identify underperforming queries and the reasons. This information is valuable for improving overall conversions, Solr configurations, auto-suggest, product catalogs, and SEO/SEM strategies, in order to improve conversion rates.

      • Phrase Extraction

        Identify multi-word phrases in signals.

      • Synonym and Similar Queries Detection Jobs

        Use this job to generate pairs of synonyms and pairs of similar queries. Two words are considered potential synonyms when they are used in a similar context in similar queries.

      • Token and Phrase Spell Correction

        Detect misspellings in queries or documents using the numbers of occurrences of words and phrases.

      Signals aggregation

      Experiment analysis

      • Ranking Metrics

        Calculate relevance metrics (nDCG and so on) by replaying ground truth queries against catalog data using variants from an experiment.

      • SQL-Based Experiment Metric (deprecated)

        This job is created by an experiment in order to calculate an objective.

        SQL-Based Experiment Metric job is deprecated as of Fusion AI 4.0.2.

      Collaborative recommenders

      These jobs analyze signals and generate matrices used to provide collaborative recommendations.

      Content-based recommenders

      Content-based recommenders create matrices of similar items based on their content.

      Content analysis

      Data ingest

      • Parallel Bulk Loader

        The Parallel Bulk Loader (PBL) job enables bulk ingestion of structured and semi-structured data from big data systems, NoSQL databases, and common file formats like Parquet and Avro.

      Legacy machine learning jobs

      • Legacy Item Recommender

        Compute user recommendations based on a pre-computed item similarity model.

      • Legacy Item Similarity

        Use this job when you only want to compute item-to-item similarities. This method is more lightweight than the generic Recommendations job.

        Legacy Item Similarity job is deprecated as of Fusion AI 4.1.0. Use the ALS recommender job instead.