I have 10 TB data and the whole size of the combined cluster is 14 TB
I have set the Replication factor to 2.
In this case how it will replicate the data?
Due to the Replication factor, the minimum size of the storage on the cluster must be double the size of the Data,
Is this something that Hadoop can't solve, or its a loophole?