is not a Parquet file expected magic number at tail 80 65 82 49 but found 51 53 10 10

Hi,

I tried to load one CSV file in SparkR . But it shows me the below error.

Can anyone tell me why I am getting this error?

Thank You

Feb 3, 2020 in Apache Spark by akhtar
• 38,260 points • 20,104 views

1 answer to this question.

Hi@akhtar,

Here you are trying to read a csv file but it is expecting a parquet file. So you can use the bellow command to avoid this.

df <- read.df(csvPath, "csv", header = "true", inferSchema = "true", na.strings = "NA")

Thank You

answered Feb 3, 2020 by MD
• 95,460 points

Related Questions In Apache Spark

0 votes

1 answer

What is a Parquet file in Spark?

Hey, Parquet is a columnar format file supported ...READ MORE

answered Jul 2, 2019 in Apache Spark by Gitika
• 65,730 points • 1,889 views

0 votes

1 answer

Is it better to have one large parquet file or lots of smaller parquet files?

Ideally, you would use snappy compression (default) ...READ MORE

answered May 23, 2018 in Apache Spark by nitinrawat895
• 11,380 points • 14,395 views

+1 vote

1 answer

How can I write a text file in HDFS not from an RDD, in Spark program?

Yes, you can go ahead and write ...READ MORE

answered May 29, 2018 in Apache Spark by Shubham
• 13,490 points • 9,304 views

0 votes

1 answer

error: identifier expected but ']' found.

Hi, You can try this remove brackets from ...READ MORE

answered Jul 3, 2019 in Apache Spark by Gitika
• 65,730 points • 6,088 views

+1 vote

2 answers

How do I get number of columns in each line from a delimited file??

Instead of spliting on '\n'. You should ...READ MORE

answered Aug 7, 2019 in Apache Spark by ashish
• 6,852 views

0 votes

1 answer

Is it possible to run Apache Spark without Hadoop?

Though Spark and Hadoop were the frameworks designed ...READ MORE

answered May 3, 2019 in Big Data Hadoop by ravikiran
• 4,620 points • 1,906 views

0 votes

1 answer

What do we exactly mean by “Hadoop” – the definition of Hadoop?

The official definition of Apache Hadoop given ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by Shubham
• 2,771 views

+1 vote

1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points • 13,521 views

0 votes

1 answer

File not found exception while processing the spark job in yarn cluster mode with multinode hadoop cluster

Hi@Ganendra, I am not sure what's the issue, ...READ MORE

answered Jul 30, 2020 in Apache Spark by MD
• 95,460 points • 5,613 views

0 votes

1 answer

The number of stages in a job is equal to the number of RDDs in DAG. however, under one of the cgiven conditions, the scheduler can truncate the lineage. identify it.

Hi@Edureka, Spark's internal scheduler may truncate the lineage of the RDD graph ...READ MORE

answered Nov 26, 2020 in Apache Spark by MD
• 95,460 points • 5,191 views

Subscribe to our Newsletter, and get personalized recommendations.

REGISTER FOR FREE WEBINAR

Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP