Why is using Combiner function not preferred in shuffling

Though a combiner function reduces the shuffling, it is not a preferred method and merging keys are used instead. My question is, why is it not preferred?

Jan 23, 2019 in Big Data Hadoop by slayer
• 29,370 points • 874 views

1 answer to this question.

Combiner function is not preferred because when a Combiner function is used, there is no way of controlling how many times and if at all Combiner will actually be used.
The default Shuffle & Sort mechanism is based on alphabetical sorting and hash shuffling of the keys and is preferred over Combiner function.

answered Jan 23, 2019 by Omkar
• 69,220 points

Related Questions In Big Data Hadoop

0 votes

1 answer

Why Hadoop is not implemented using Message Passing Interface (MPI)?

One of the big features of Hadoop/map-reduce ...READ MORE

answered Sep 21, 2018 in Big Data Hadoop by Frankie
• 9,830 points • 919 views

0 votes

1 answer

Why Hadoop is not implemented using MPI?

One of the big features of Hadoop/map-reduce ...READ MORE

answered Dec 4, 2018 in Big Data Hadoop by Frankie
• 9,830 points • 998 views

0 votes

1 answer

What is Custom partitioner in Hadoop? How to write partition function ?

Don't think that in Hadoop the same ...READ MORE

answered Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points • 1,738 views

0 votes

1 answer

Why is HDFS used for the applications with large data sets, and not for the multiple small files?

HDFS is more efficient for a large ...READ MORE

answered Dec 19, 2018 in Big Data Hadoop by Neha
• 6,300 points • 5,945 views

+1 vote

1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points • 11,387 views

0 votes

1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points • 2,830 views

+2 votes

11 answers

hadoop fs -put command?

Hi, You can create one directory in HDFS ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points • 110,953 views

–1 vote

1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points • 4,828 views

–1 vote

1 answer

Hadoop cluster is not running in vm

First check if all daemons are running: sudo ...READ MORE

answered Dec 26, 2018 in Big Data Hadoop by Omkar
• 69,220 points • 864 views

–1 vote

1 answer

Why is Hive not good for OLTP?

Apache Hive is mainly used for batch processing i.e. ...READ MORE

answered Jan 7, 2019 in Big Data Hadoop by Omkar
• 69,220 points • 6,964 views

Subscribe to our Newsletter, and get personalized recommendations.

REGISTER FOR FREE WEBINAR

Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP