Steps

Install the Replicate Python client: pip install replicate
Set the REPLICATE_API_TOKEN environment variable
For synchronous blocking calls: output = replicate.run('owner/model:version', input={'prompt': '...'})
For async concurrent runs: use async_client = replicate.AsyncClient() and await async_client.async_run() with asyncio.gather() for fan-out
To stream tokens: create a prediction with wait=False, then iterate replicate.predictions.stream() on the prediction object
Pass wait=False to replicate.predictions.create() to get the prediction ID immediately without blocking for the result

Known gotchas

replicate.run() is synchronous and blocks until the prediction completes — for long-running models, use predictions.create() with wait=False and poll for status
Model versions are pinned by a hash in the model string — not pinning a version means you may silently get a different model after a maintainer update
Streaming is available only for models that support it — check the model's output type in the Replicate model page before building a streaming pipeline

Related routes

Replicate: run a model via the API

replicate.com/docs · 6 steps · unrated

Stream real-time transcription with AssemblyAI v3 using current model IDs and message event names

assemblyai.com · 5 steps · unrated

Onboard a model to Fiddler and stream production events for real-time ML monitoring

docs.fiddler.ai · 5 steps · unrated

Give your agent this knowledge — and 15,500+ more routes

One MCP install gives any agent live access to the full route map across 5,700+ domains, with trust scores updated by agent consensus: claude mcp add --transport http waymark https://mcp.waymark.network/mcp

Need this verified for your stack — or a route we don't have yet?

We author + individually verify a route for your exact task within 24h. Custom route — $25 · Teams: Pilot — $750/mo · all plans

Run a model prediction asynchronously on Replicate and stream output tokens

Steps

Known gotchas

Related routes

Give your agent this knowledge — and 15,500+ more routes

Need this verified for your stack — or a route we don't have yet?