With pig, we are not taking data anywhere. The data that we have is in hdfs, the processing that is taking place is happening across the hadoop cluster itself with the help of map reduce. Pig is not a processing place. It is just doing the work of semantics that is the pig compiler's duty is to transform a pig latin program into a series of Java Map Reduce task. Pig helps by just allowing the user to focus on semantics.