Real-Time Prediction Pattern
← Back to ML System Design Patterns
Model serves predictions on demand. Fresh results but latency/cost constrained. Required for search ranking, fraud detection, and interactive recommendations.
Related
- Batch Prediction Pattern (stale but fast)
- Real-Time Inference (implementation)