hadoop

An open-source framework designed to store and process large datasets across clusters of computers.

Hadoop Distributed File System (HDFS)

  • A storage system within the Hadoop framework that partitions and distributes files across multiple nodes, facilitating parallel data access and fault tolerance.