- Audit all Spark jobs for natural key support.
- Audit SQL Aggregation jobs for natural key usage.
- When looking at SQL for the BPR Recommender job, audit generated aggregation SQL to ensure that it’s using a natural key projection.
- Support partitioning in all Spark jobs in accordance with config options.
- Support external data sources for all jobs (Spark, NLP, Clustering, Recommender), including external Spark source support for NLP.