How is Apache Spark different from the Hadoop approach?

Question

I'm new to this big data world. But from what I've learnt from online sources, Spark is better than Hadoop because it is fast.
How ?

BD Master · Answer

In Hadoop MapReduce the input data is on disk, you perform a&#160;map&#160;and a&#160;reduce&#160;and put the result back on disk. Apache Spark allows more complex pipelines. Maybe you need to&#160;map&#160;twice but don't need to&#160;reduce. Maybe you need to&#160;reduce&#160;then&#160;map&#160;then&#160;reduce&#160;again. The Spark API makes it very intuitive to set up very complex pipelines with dozens of steps.You could implement the same complex pipeline with MapReduce too. But then between each stage, you write to disk and read it back. Spark avoids this overhead when possible. Keeping data in-memory is one way. But very often even that is not necessary. One stage can just pass the computed data to the next stage without ever storing the whole data anywhere.This is not an option with MapReduce, because one MapReduce does not know about the next. It has to complete fully before the next one can start. That is why Spark can be more efficient for complex computation.The API, especially in Scala, is very clean too. A classical MapReduce is often a single line. It's very empowering to use.

How is Apache Spark different from the Hadoop approach

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Apache Spark

How is RDD in Spark different from Distributed Storage Management? Can anyone help me with this ?

What is the difference between Apache Spark SQLContext vs HiveContext?

How to save and retrieve the Spark RDD from HDFS?

How to print the contents of RDD in Apache Spark?

Hadoop Mapreduce word count Program

hadoop fs -put command?

Hadoop dfs -ls command?

I installed Spark but while executing command, I am getting ‘hadoop’ command not found error?

Is there any way to uncache RDD?

SQLInterpreter in Spark

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES