Two-Tower Model
← Back to ML System Design Patterns
Separate encoders for query and item, with dot product for scoring. Enables precomputing item embeddings for fast retrieval. Used in recommendation/search at scale (YouTube, Google, Pinterest).
Related
- Embedding-Based Retrieval (retrieval mechanism)
- Recommendation System Design (common architecture)