is not a Parquet file. expected magic number at tail [80, 65, 82, 49] but found [51, 53, 10, 10]

0 votes

Hi,

I tried to load one CSV file in SparkR . But it shows me the below error.



Can anyone tell me why I am getting this error?

Thank You

Feb 3 in Apache Spark by akhtar
• 1,440 points
69 views

1 answer to this question.

0 votes

Hi@akhtar,

Here you are trying to read a csv file but it is expecting a parquet file. So you can use the bellow command to avoid this.

df <- read.df(csvPath, "csv", header = "true", inferSchema = "true", na.strings = "NA")

Thank You

answered Feb 3 by MD
• 2,750 points

Related Questions In Apache Spark

0 votes
1 answer

What is a Parquet file in Spark?

Hey, Parquet is a columnar format file supported ...READ MORE

answered Jul 2, 2019 in Apache Spark by Gitika
• 25,830 points
119 views
0 votes
1 answer

Is it better to have one large parquet file or lots of smaller parquet files?

Ideally, you would use snappy compression (default) ...READ MORE

answered May 23, 2018 in Apache Spark by nitinrawat895
• 10,840 points
3,638 views
+1 vote
1 answer

How can I write a text file in HDFS not from an RDD, in Spark program?

Yes, you can go ahead and write ...READ MORE

answered May 29, 2018 in Apache Spark by Shubham
• 13,370 points
2,193 views
0 votes
1 answer

error: identifier expected but ']' found.

Hi, You can try this remove brackets from ...READ MORE

answered Jul 3, 2019 in Apache Spark by Gitika
• 25,830 points
351 views
+1 vote
1 answer
0 votes
1 answer

Is it possible to run Apache Spark without Hadoop?

Though Spark and Hadoop were the frameworks designed ...READ MORE

answered May 2, 2019 in Big Data Hadoop by ravikiran
• 4,600 points
142 views
0 votes
1 answer

What do we exactly mean by “Hadoop” – the definition of Hadoop?

The official definition of Apache Hadoop given ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by Shubham
334 views
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,840 points
4,137 views
0 votes
1 answer

Cannot create directory /hive/xzxz/_temporary/0. Name node is in safe mode.

Hi@akhtar, Here you are trying to save csv ...READ MORE

answered Feb 3 in Apache Spark by MD
• 2,750 points
21 views
0 votes
1 answer

What is the difference between spark streaming and spark structured streaming?

Hi@akhtar Generally, Spark streaming  is used for real time ...READ MORE

answered Feb 4 in Apache Spark by MD
• 2,750 points
43 views