Serving Frameworks

← Back to Model Serving

Tools for deploying and serving ML models in production. TorchServe (PyTorch), TensorFlow Serving (TF), Triton Inference Server (NVIDIA, multi-framework), BentoML (framework-agnostic, containerized).

Real-Time Inference (what these frameworks enable)
Model Optimization (optimize before serving)

mlops serving frameworks

Software Engineering KB

Explorer

Serving Frameworks

Serving Frameworks

Graph View

Table of Contents

Backlinks

Software Engineering KB

Explorer

Serving Frameworks

Serving Frameworks

Related

Graph View

Table of Contents

Backlinks