How to increase Garbage Collection speed?

0 votes
Please help me with Garbage Collection problem. I have created a Spark application which leaves out a lot of garbage. I know I have to optimize it but for now, I need a quick fix. Please tell me how to fasten the garbage collection interval.
Mar 7, 2019 in Apache Spark by Rohan
70 views

1 answer to this question.

0 votes

The time interval between Garbage Collection is set to 30min by default. If you want the Garbage Collection to happen sooner, then reduce the time. Try this:

val sc = new SparkContext(new SparkConf())
./bin/spark-submit <all your existing options> --spark.cleaner.periodicGC.interval=20
answered Mar 7, 2019 by Pavitra

Related Questions In Apache Spark

0 votes
1 answer

How to increase the amount of data to be transferred to shuffle service at the same time?

The amount of data to be transferred ...READ MORE

answered Mar 1, 2019 in Apache Spark by Omkar
• 69,030 points
193 views
0 votes
1 answer

How to increase Spark memory for execution?

Probably the spill is because you have ...READ MORE

answered Mar 7, 2019 in Apache Spark by Pavitra

edited Mar 7, 2019 241 views
0 votes
1 answer

How to increase wait time to launch data-local task?

You can increase the locality wait time ...READ MORE

answered Mar 11, 2019 in Apache Spark by Raj
89 views
0 votes
1 answer

How to increase Spark listener bus event queue capacity?

The default capacity of listener bus is ...READ MORE

answered Mar 11, 2019 in Apache Spark by Raj
1,682 views
0 votes
1 answer

How to increase worker timeout in Spark application?

By default, the timeout is set to ...READ MORE

answered Mar 25, 2019 in Apache Spark by Hari
3,362 views
0 votes
1 answer

How to increase HDFS replication level in Spark?

Hi @Raunak. You can change the replication ...READ MORE

answered Mar 26, 2019 in Apache Spark by Yash
365 views
+1 vote
2 answers
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,950 points
6,412 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,950 points
996 views
+1 vote
11 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyF ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
42,484 views