What is the difference between persist and cache in apache spark

How can we differenciate between persist() and cache() in Spark?

Jul 3, 2019 in Apache Spark by Nihal

recategorized Jul 4, 2019 by Gitika • 5,569 views

1 answer to this question.

Hi,

persist () allows the user to specify the storage level whereas cache () uses the default storage level in Spark.

answered Jul 3, 2019 by Gitika
• 65,730 points

Using cash technique we can save intermediate results in memory only.

Using persist we can save the intermediate results in five storage levels which are memory only, memory and disc, disc only, memory only serialisation and memory and disc serialisation

answered Dec 27, 2022 by Deepthi

edited Mar 5, 2025

Related Questions In Apache Spark

+1 vote

3 answers

What is the difference between rdd and dataframes in Apache Spark ?

Comparison between Spark RDD vs DataFrame 1. Release ...READ MORE

answered Aug 28, 2018 in Apache Spark by shams
• 3,670 points • 46,465 views

0 votes

1 answer

What is the difference between Apache Spark SQLContext vs HiveContext?

Spark 2.0+ Spark 2.0 provides native window functions ...READ MORE

answered May 26, 2018 in Apache Spark by nitinrawat895
• 11,380 points • 6,155 views

0 votes

1 answer

What is the difference between spark streaming and spark structured streaming?

Hi@akhtar Generally, Spark streaming is used for real time ...READ MORE

answered Feb 4, 2020 in Apache Spark by MD
• 95,460 points • 5,194 views

0 votes

1 answer

What's the difference between 'filter' and 'where' in Spark SQL?

Both 'filter' and 'where' in Spark SQL ...READ MORE

answered May 23, 2018 in Apache Spark by nitinrawat895
• 11,380 points • 36,622 views

+1 vote

2 answers

How do I get number of columns in each line from a delimited file??

Instead of spliting on '\n'. You should ...READ MORE

answered Aug 7, 2019 in Apache Spark by ashish
• 7,739 views

+1 vote

1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points • 14,519 views

0 votes

1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points • 5,242 views

+2 votes

11 answers

hadoop fs -put command?

Hi, You can create one directory in HDFS ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points • 121,611 views

0 votes

1 answer

What is the differentiate between equals() and == in Scala. Is the latter the same as == in Java?

Hey, When we try to compare two instances ...READ MORE

answered Jul 29, 2019 in Apache Spark by Gitika
• 65,730 points • 2,481 views

–2 votes

1 answer

What is the difference in Java’s “If..Else” and Scala’s “If..Else”?

Hey, Java’s “If. Else”: In Java, “If. Else” is a statement, ...READ MORE

answered Jul 31, 2019 in Apache Spark by Gitika
• 65,730 points • 3,388 views

Subscribe to our Newsletter, and get personalized recommendations.

REGISTER FOR FREE WEBINAR

Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP