Scala: org.apache.poi.openxml4j.exceptions.InvalidFormatException: Your InputStream was neither an OLE2 stream, nor an OOXML stream

Question

Hi Team,I am trying to read an excel file using Spark CLI, but I am getting "org.apache.poi.openxml4j.exceptions.InvalidFormatException: Your InputStream was neither an OLE2 stream, nor an OOXML stream" error.Below is the code I am using:import com.crealytics.spark.excel

val df = spark.read.format("com.crealytics.spark.excel")
.option("useHeader", "true")
.option("startColumn", 0)
.option("treatEmptyValuesAsNulls", "false")
.option("inferSchema", "false")
.option("location", "/home/Desktop/lucky/logs.xlsx")
.option("addColorColumns", "False")
.load()

Raman · Answer

Try executing the below code,def readExcel(file: String): DataFrame = sqlContext.read
&#160; &#160;&#160;.format("com.crealytics.spark.excel")
&#8203;    .option("location", file)
&#8203;    .option("useHeader", "true")
&#8203;    .option("treatEmptyValuesAsNulls", "true")
&#8203;    .option("inferSchema", "true")
&#8203;    .option("addColorColumns", "False")
&#8203;    .load()

val data = readExcel("path to your excel file")
&#8203;
data.show(false)If you want to know more about Apache Spark Scala, It's highly recommended to go for the&#160;Spark Certification Course today.

Scala org apache poi openxml4j exceptions InvalidFormatException Your InputStream was neither an OLE2 stream nor an OOXML stream

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Apache Spark

How to create RDD from an external file source in scala?

Error: value textfile is not a member of org.apache.spark.SparkContext

Error : split value is not a member of org.apache.spark.sql.Row

org.apache.spark.sql.AnalysisException: cannot resolve given input columns

How do I get number of columns in each line from a delimited file??

Hadoop Mapreduce word count Program

hadoop.mapred vs hadoop.mapreduce?

hadoop fs -put command?

How to execute a function in apache-scala?

Spark: Error while instantiating "org.apache.spark.sql.hive.HiveSessionState"

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES