Batch Inference

← Back to Model Serving

Running predictions offline on batches of data, typically on a schedule. Results stored in a database or cache for serving. Low latency at serving time but predictions may be stale.

Real-Time Inference (online alternative)
Pipeline Orchestration (schedules batch jobs)

mlops inference batch

Software Engineering KB

Explorer

Batch Inference

Batch Inference

Graph View

Table of Contents

Backlinks

Software Engineering KB

Explorer

Batch Inference

Batch Inference

Related

Graph View

Table of Contents

Backlinks