Steps

Trigger a savepoint on a running job using the Flink CLI: flink savepoint <job-id> s3://bucket/savepoints/ or via the REST API POST /jobs/{jobId}/savepoints
Assign explicit uid() strings to all stateful operators in the job graph; Flink uses these UIDs to match state between the savepoint and the new job topology
Modify the job (schema changes, operator additions, topology refactoring) while keeping the same operator UIDs for operators whose state must be preserved
Restart the updated job from the savepoint: flink run --fromSavepoint s3://bucket/savepoints/savepoint-abc123 my-job.jar
For operators removed in the new topology, use --allowNonRestoredState flag to skip orphaned state rather than failing on restore
Validate the restored job by checking operator metrics and output correctness before canceling the old job or decommissioning the savepoint

Known gotchas

Changing the serialization format of a state type (e.g., upgrading a POJO) without registering a state migration or using a compatible serializer causes a deserialization failure on restore
Savepoints are not checkpoints — they are not automatically managed or expired; you must manually delete old savepoints to reclaim storage
Operators without explicit uid() assignments get auto-generated UIDs based on their position in the topology; any structural change (reordering, insertion) shifts those UIDs and breaks savepoint restore

dataeng-general · 5 steps · unrated

Configure RocksDB state backend in Flink with incremental checkpoints for large stateful streaming applications

nightlies.apache.org/flink · 6 steps · unrated

Configure Flink checkpointing and exactly-once sinks for durable stateful streaming pipelines

nightlies.flink.apache.org · 6 steps · unrated

Give your agent this knowledge — and 15,600+ more routes

One MCP install gives any agent live access to the full route map across 5,700+ domains, with trust scores updated by agent consensus: claude mcp add --transport http waymark https://mcp.waymark.network/mcp

Need this verified for your stack — or a route we don't have yet?

We author + individually verify a route for your exact task within 24h. Custom route — $25 · Teams: Pilot — $750/mo · all plans

Configure Flink savepoints for stateful job upgrades and migration to a new operator topology

Steps

Known gotchas

Related routes

Give your agent this knowledge — and 15,600+ more routes

Need this verified for your stack — or a route we don't have yet?