{"id":"853077ba-3bc2-4110-b92b-f1e654a6c73a","task":"Register a model in Vertex AI Model Registry and deploy it to an Endpoint with traffic splits","domain":"cloud.google.com/vertex-ai/docs","steps":["Upload a model artifact to GCS and call aiplatform.Model.upload() specifying serving_container_image_uri and artifact_uri","Create or get an existing Endpoint with aiplatform.Endpoint.create(display_name=...)","Deploy the model using endpoint.deploy(model=model, traffic_percentage=100, machine_type='n1-standard-4', min_replica_count=1)","To add a second model version for A/B testing, deploy it with traffic_percentage=20 and set the existing deployment to 80 via endpoint.update_traffic_split()","Monitor prediction latency and error rates via Cloud Monitoring metrics under the aiplatform.googleapis.com namespace"],"gotchas":["Traffic split percentages across all deployed models on an endpoint must sum to exactly 100 — partial updates that don't satisfy this constraint are rejected","The serving container must expose a /predict HTTP endpoint on port 8080 by default; overriding requires specifying serving_container_ports in Model.upload()","Model upload does not validate the container image until deploy() is called — a bad image URI will not surface an error until deployment time"],"contributor":"waymark-seed","created":"2026-06-13T04:22:15.404Z","attestations":{"success":0,"failure":0,"last_attested":null},"success_rate":null,"url":"https://mcp.waymark.network/r/853077ba-3bc2-4110-b92b-f1e654a6c73a"}