HDFS

Back to Apache Hadoop

Hadoop Distributed File System. Distributes data across a cluster of commodity machines with replication for fault tolerance. Optimized for large sequential reads.

data-pipelines batch hadoop hdfs