Why Partitions are immutable in Spark

Question

Can anyone explain why partition are immutable in spark?

Gitika · Answer 1 · Jul 3, 2019

Hi,

Every transformation generates a new partition. Partitions use HDFS API so that partition is immutable, distributed and fault tolerance. Partition also aware of data locality.

answered Jul 3, 2019 by Gitika
• 65,730 points

score 0 · Answer 2 · Aug 25, 2022

Partitions use HDFS API.

answered Aug 25, 2022 by anonymous

edited Mar 5

Why Partitions are immutable in Spark

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Your comment on this answer:

Related Questions In Apache Spark

What are the levels of parallelism in spark streaming ?

what are the memory issues in spark ?

what are the job optimization Technics in spark and scala ?

Changing Column position in spark dataframe

Hadoop Mapreduce word count Program

hadoop fs -put command?

Hadoop dfs -ls command?

Is there a way to copy data from one one Hadoop distributed file system(HDFS) to another HDFS?

By default how many partitions are created in RDD in Apache spark?

What are some of the things you can monitor in the Spark Web UI?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES