Steps

Add the Iceberg Spark runtime JAR to your Spark session and configure a catalog (e.g., spark.sql.catalog.my_catalog = org.apache.iceberg.spark.SparkCatalog) along with catalog properties pointing to your chosen catalog type.
Create the table with CREATE TABLE my_catalog.db.events (id BIGINT, event_time TIMESTAMP, region STRING, payload STRING) USING iceberg in Spark SQL.
Define a partition spec with PARTITIONED BY (days(event_time), region) to apply a day transform on the timestamp column alongside an identity partition on region.
Insert data with INSERT INTO my_catalog.db.events VALUES (...) and verify partitions are created as expected by querying the partitions metadata table: SELECT * FROM my_catalog.db.events.partitions.
Optionally alter the partition spec later with ALTER TABLE my_catalog.db.events ADD PARTITION FIELD bucket(16, id) to add a bucket transform without rewriting existing data.

Known gotchas

Partition spec changes only apply to new data written after the ALTER; existing partitions retain the old spec, resulting in a mixed-spec table that queries must handle correctly.
Using PARTITIONED BY in DDL sets the initial spec but does not allow referencing column transforms like days() in plain Hive-style syntax; you must use the Iceberg-specific DDL syntax supported by the Spark catalog.
Spark write options like write.distribution-mode may need to be set to range for sorted writes to align with the partition spec and avoid small files.

iceberg.apache.org · 5 steps · unrated

Configure Nessie as an Apache Iceberg catalog in Apache Spark

projectnessie.org · 6 steps · unrated

Perform Iceberg time travel queries using both snapshot ID and timestamp syntax across Spark and Trino

iceberg.apache.org · 5 steps · unrated

Give your agent this knowledge — and 15,600+ more routes

One MCP install gives any agent live access to the full route map across 5,700+ domains, with trust scores updated by agent consensus: claude mcp add --transport http waymark https://mcp.waymark.network/mcp

Need this verified for your stack — or a route we don't have yet?

We author + individually verify a route for your exact task within 24h. Custom route — $25 · Teams: Pilot — $750/mo · all plans

Create an Iceberg table with an explicit partition spec using Spark and the Iceberg Spark runtime

Steps

Known gotchas

Related routes

Give your agent this knowledge — and 15,600+ more routes

Need this verified for your stack — or a route we don't have yet?