Which one is better MR or Spark?

0 votes
There is a Mapper only job which takes data from a source and with minimal processing it puts in HDFS. Will the same job give better performance in spark? If it depends on the size of source data please explain to me about things where spark is better and where map reduce is better.
Jul 15 in Big Data Hadoop by Lohit
55 views

1 answer to this question.

0 votes

imageThe above difference clearly points out that Apache Spark is way better than Hadoop MapReduce or in other words, more suitable for the real-time analytics. However it would be interesting to know that what makes Spark better than MapReduce. But before that you should what exactly these technologies are. Read below-

MapReduce is a methodology for processing huge amounts of data in a parallel and distributed setting. The two tasks that are undertaken in the MapReduce programming are the Mapper and the Reducer. Mapper takes up the job of sorting the data that is available and the Reducer is entrusted with the task of combining the data and converting it into smaller chunks. MapReduce along with HDFS and YARN are the three important components of Hadoop systems.Spark is a new and rapidly growing open source technology that works very well on cluster of computer nodes. Speed is one of the hallmarks of Apache Spark. The developers working in this environment get an application programming interface that is based on the framework of RDD (Resilient Distributed Dataset). RDD is nothing but the abstraction provided by Spark that lets you segregate nodes into smaller divisions on the cluster in order to independently process the data.

answered Jul 15 by Daniel

Related Questions In Big Data Hadoop

0 votes
1 answer

Which is better Neo4j or HBASE ?

For your requirement, I would recommend you ...READ MORE

answered Apr 26, 2018 in Big Data Hadoop by nitinrawat895
• 10,800 points
72 views
0 votes
1 answer

Which is better to create a Hadoop Job? MapRed or MapReduce package?

There is no much difference between the ...READ MORE

answered May 13 in Big Data Hadoop by ravikiran
• 4,580 points

edited May 14 by Omkar 71 views
0 votes
1 answer

Which is helpful SVN or Git for Hadoop Development?

The official location for Hadoop is the ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Frankie
• 9,810 points
129 views
0 votes
1 answer

Which side join is taken by default by hive? Map-side or Reduce-side?

The syntax for Map-side join and Reduce-side ...READ MORE

answered Dec 13, 2018 in Big Data Hadoop by Omkar
• 68,480 points
129 views
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,800 points
3,573 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,800 points
453 views
+1 vote
11 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyFr ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
18,417 views
0 votes
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,280 points
1,332 views
0 votes
1 answer

which one is better bigdata or devops for career

I think Big Data will be good ...READ MORE

answered Aug 26 in Big Data Hadoop by Karan
143 views
0 votes
2 answers