The distributed copy command, distcp, is a utility that allows you to copy large data sets between two DFS cluster. The distcp command submits a regular MapReduce job that performs a file-by-file copy.
Usage:
hadoop distcp <source> <destination>
Example:
hadoop distcp hdfs://namenode1:9820/source_dir hdfs://namenode2:9820/destination_dir