MillWheel to Dataflow Mapping

Back to Data Systems Mapping

Google InternalOpen SourceGCP
SystemMillWheelApache Beam (streaming)Dataflow
ConceptStream processingStream processingManaged stream processing

MillWheel (2013 paper) was Google’s stream processing framework with exactly-once semantics and low latency. Its concepts heavily influenced Apache Beam’s streaming model and the Dataflow service on GCP. Key contributions: watermarks, windowing, and exactly-once processing guarantees.


google-internal mapping millwheel dataflow