Software Engineering KB

Home

❯

03 Data Management

❯

06 Data Pipelines

❯

01 Concept

❯

Apache Hadoop

Apache Hadoop

Feb 10, 20261 min read

  • data-pipelines
  • batch
  • hadoop

Apache Hadoop

← Back to Batch Processing

Open source framework for distributed batch processing of large datasets. Core components: MapReduce (processing), HDFS (storage), and YARN (resource management).

Key Properties

  • MapReduce
  • HDFS
  • YARN

data-pipelines batch hadoop


Graph View

  • Apache Hadoop
  • Key Properties

Backlinks

  • Batch Processing
  • Apache Spark
  • HDFS
  • MapReduce (Pipelines)
  • YARN

Created with Quartz v4.5.2 © 2026

  • GitHub