Which one is better MR or Spark

0 votes
There is a Mapper only job which takes data from a source and with minimal processing it puts in HDFS. Will the same job give better performance in spark? If it depends on the size of source data please explain to me about things where spark is better and where map reduce is better.
Jul 15, 2019 in Big Data Hadoop by Lohit
1,852 views

1 answer to this question.

0 votes

imageThe above difference clearly points out that Apache Spark is way better than Hadoop MapReduce or in other words, more suitable for the real-time analytics. However it would be interesting to know that what makes Spark better than MapReduce. But before that you should what exactly these technologies are. Read below-

MapReduce is a methodology for processing huge amounts of data in a parallel and distributed setting. The two tasks that are undertaken in the MapReduce programming are the Mapper and the Reducer. Mapper takes up the job of sorting the data that is available and the Reducer is entrusted with the task of combining the data and converting it into smaller chunks. MapReduce along with HDFS and YARN are the three important components of Hadoop systems.Spark is a new and rapidly growing open source technology that works very well on cluster of computer nodes. Speed is one of the hallmarks of Apache Spark. The developers working in this environment get an application programming interface that is based on the framework of RDD (Resilient Distributed Dataset). RDD is nothing but the abstraction provided by Spark that lets you segregate nodes into smaller divisions on the cluster in order to independently process the data.

answered Jul 15, 2019 by Daniel

Related Questions In Big Data Hadoop

0 votes
1 answer

Which is better Neo4j or HBASE ?

For your requirement, I would recommend you ...READ MORE

answered Apr 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
787 views
0 votes
1 answer

Which is better to create a Hadoop Job? MapRed or MapReduce package?

There is no much difference between the ...READ MORE

answered May 13, 2019 in Big Data Hadoop by ravikiran
• 4,620 points

edited May 14, 2019 by Omkar 771 views
0 votes
1 answer

Which is helpful SVN or Git for Hadoop Development?

The official location for Hadoop is the ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,190 views
0 votes
1 answer

Which side join is taken by default by hive? Map-side or Reduce-side?

The syntax for Map-side join and Reduce-side ...READ MORE

answered Dec 13, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,141 views
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
11,032 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
2,540 views
+2 votes
11 answers

hadoop fs -put command?

Hi, You can create one directory in HDFS ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
108,852 views
–1 vote
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
4,615 views
0 votes
1 answer

which one is better bigdata or devops for career

I think Big Data will be good ...READ MORE

answered Aug 26, 2019 in Big Data Hadoop by Karan
2,492 views
0 votes
2 answers
webinar REGISTER FOR FREE WEBINAR X
REGISTER NOW
webinar_success Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP