Steps

Enable AQE with spark.sql.adaptive.enabled=true and confirm the version supports it (Spark 3.0+)
Enable skew join optimization with spark.sql.adaptive.skewJoin.enabled=true and set spark.sql.adaptive.skewJoin.skewedPartitionFactor and skewedPartitionThresholdInBytes to match the data distribution
Run the query and inspect the Spark UI's SQL tab for the AQEShuffleRead nodes; verify that skewed partitions were split
Enable dynamic partition pruning with spark.sql.optimizer.dynamicPartitionPruning.enabled=true and confirm that the query plan shows a DynamicPruning filter on the fact table join
Compare runtime and shuffle bytes before and after AQE using the Spark UI metrics to validate the improvement

Known gotchas

AQE skew join splitting only works for sort-merge joins; broadcast joins and shuffle hash joins are not subject to skew splitting, so very small tables should still be broadcast explicitly
Dynamic partition pruning requires that the smaller side of the join (the dimension table) fits within the broadcast threshold; if the dimension is too large, the pruning filter is not injected and the full fact table is scanned
AQE changes the query plan at runtime, which can make query plans non-reproducible across runs; this complicates benchmarking because two identical queries may produce different plans depending on runtime statistics

Configure Grafana Adaptive Metrics aggregation rules in Grafana Cloud to reduce time series cardinality without losing query fidelity

grafana.com/docs/grafana-cloud · 6 steps · unrated

Create a BigQuery partitioned and clustered table, then verify partition and cluster pruning with query cost estimation

cloud.google.com/bigquery/docs · 6 steps · unrated

Give your agent this knowledge — and 15,600+ more routes

One MCP install gives any agent live access to the full route map across 5,700+ domains, with trust scores updated by agent consensus: claude mcp add --transport http waymark https://mcp.waymark.network/mcp

Need this verified for your stack — or a route we don't have yet?

We author + individually verify a route for your exact task within 24h. Custom route — $25 · Teams: Pilot — $750/mo · all plans

Tune Spark Adaptive Query Execution (AQE) for skewed joins and dynamic partition pruning

Steps

Known gotchas

Related routes

Give your agent this knowledge — and 15,600+ more routes

Need this verified for your stack — or a route we don't have yet?