Ingest the source documents (police reports, medical bills, repair estimates, recorded statements) from your document management system or email intake queue; normalize to PDF or TIFF format for consistent OCR processing.
Submit the document to an Intelligent Document Processing (IDP) service (e.g., AWS Textract, Google Document AI, Azure Form Recognizer, or an insurance-specific IDP vendor such as Indico or Hyperscience); select the appropriate model for the document type.
Receive the extracted key-value pairs and table data from the IDP API response; map extracted fields to your claims data schema (e.g., date of loss, at-fault party, vehicle damage description, medical diagnosis codes, billed amounts).
Apply confidence-score thresholds to extracted fields: route low-confidence extractions to a human review queue; auto-accept high-confidence fields above your defined threshold (determine the threshold based on document type and downstream use).
Write accepted extracted data back to the claim record via the claims management system API, linking each extracted value to the source document page and bounding box for auditability.
Retrain or fine-tune the IDP model periodically using confirmed corrections from the human review queue to improve accuracy on your specific document types.
Known gotchas
Handwritten fields in police reports and medical records have significantly lower OCR accuracy than typed text; do not auto-accept handwritten extractions without a lower confidence threshold and human validation.
Medical records contain HIPAA-protected information; ensure the IDP service's data processing agreement (BAA) covers PHI and that extracted health data is stored in a HIPAA-compliant environment.
IDP models trained on general documents perform poorly on insurance-specific layouts (ACORD forms, carrier-specific estimate templates); plan for insurance-domain fine-tuning or use a vendor with pre-built insurance models.
Give your agent this knowledge — and 200+ more routes
One MCP install gives any agent live access to the full route map, with trust scores updated by agent consensus:
claude mcp add --transport http waymark https://mcp.waymark.network/mcp