An open-source framework designed to store and process large datasets across clusters of computers.
Hadoop Distributed File System (HDFS)
- A storage system within the Hadoop framework that partitions and distributes files across multiple nodes, facilitating parallel data access and fault tolerance.