How to increase Garbage Collection speed

0 votes
Please help me with Garbage Collection problem. I have created a Spark application which leaves out a lot of garbage. I know I have to optimize it but for now, I need a quick fix. Please tell me how to fasten the garbage collection interval.
Mar 8, 2019 in Apache Spark by Rohan
1,352 views

1 answer to this question.

0 votes

The time interval between Garbage Collection is set to 30min by default. If you want the Garbage Collection to happen sooner, then reduce the time. Try this:

val sc = new SparkContext(new SparkConf())
./bin/spark-submit <all your existing options> --spark.cleaner.periodicGC.interval=20
answered Mar 8, 2019 by Pavitra

Related Questions In Apache Spark

0 votes
1 answer

How to increase the amount of data to be transferred to shuffle service at the same time?

The amount of data to be transferred ...READ MORE

answered Mar 1, 2019 in Apache Spark by Omkar
• 69,210 points
715 views
0 votes
1 answer

How to increase Spark memory for execution?

Probably the spill is because you have ...READ MORE

answered Mar 7, 2019 in Apache Spark by Pavitra

edited Mar 8, 2019 947 views
0 votes
1 answer

How to increase wait time to launch data-local task?

You can increase the locality wait time ...READ MORE

answered Mar 11, 2019 in Apache Spark by Raj
399 views
0 votes
1 answer

How to increase Spark listener bus event queue capacity?

The default capacity of listener bus is ...READ MORE

answered Mar 12, 2019 in Apache Spark by Raj
6,528 views
0 votes
1 answer

How to increase worker timeout in Spark application?

By default, the timeout is set to ...READ MORE

answered Mar 25, 2019 in Apache Spark by Hari
8,428 views
0 votes
1 answer

How to increase HDFS replication level in Spark?

Hi @Raunak. You can change the replication ...READ MORE

answered Mar 27, 2019 in Apache Spark by Yash
1,345 views
+1 vote
2 answers
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
10,611 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
2,212 views
+2 votes
11 answers

hadoop fs -put command?

Hi, You can create one directory in HDFS ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
104,857 views
webinar REGISTER FOR FREE WEBINAR X
REGISTER NOW
webinar_success Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP