How to use ftp scheme using Yarn in Spark application

Question

Hi. I want to use FTP for my Spark application. I have learnt that Spark supports this but the problem is that I am using Yarn with spark and Yarn does not support this. I need to have the scheme in the local disk. I want to know how I can download schemes for Yarn. Please help

score 0 · Answer 1 · Mar 28, 2019

In case Yarn does not support schemes that Spark supports, you will have to download the schemes on local disk before adding to Yarn's cache. You can use the in-built feature of Spark that let's you download the schemes that you want. Refer to the below command to do this:

val sc = new SparkContext(new SparkConf())

./bin/spark-submit <all your existing options> --spark.yarn.dist.forceDownloadSchemes= <list of schemes>

answered Mar 28, 2019 by Raj

How to use ftp scheme using Yarn in Spark application

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Apache Spark

How to get SQL configuration in Spark using Python?

How to set executors for static allocation in Spark Yarn?

How to use Spark jars for Yarn distribution?

How to create paired RDD using subString method in Spark?

How do I get number of columns in each line from a delimited file??

Hadoop Mapreduce word count Program

hadoop.mapred vs hadoop.mapreduce?

hadoop fs -put command?

How to increase worker timeout in Spark application?

How to change the spark Session configuration in Pyspark?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES