Parquet File

Question

I'm new to spark. What is a parquet File?

Data_Nerd · Answer 1 · Jun 4, 2018

Parquet is a columnar format file supported by many other data processing systems. Spark SQL performs both read and write operations with Parquet file and consider it be one of the best big data analytics format so far.

answered Jun 4, 2018 by Data_Nerd
• 2,390 points

Parquet File

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Apache Spark

Is it better to have one large parquet file or lots of smaller parquet files?

What is a Parquet file in Spark?

is not a Parquet file. expected magic number at tail [80, 65, 82, 49] but found [51, 53, 10, 10]

Spark cannot access local file anymore?

How can I write a text file in HDFS not from an RDD, in Spark program?

where can i get spark-terasort.jar and not .scala file, to do spark terasort in windows.

Efficient way to read specific columns from parquet file in spark

Parquet Files Advantages

What do we exactly mean by “Hadoop” – the definition of Hadoop?

I installed Spark but while executing command, I am getting ‘hadoop’ command not found error?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES