load/save in spark

Question

1) load command will load only parquet&#160;val a = spark.read.load("employee.parquet")--works fine
val a=spark.read.load("employee.txt")--error2)val a =spark.read.format("csv").load("employee.txt")--works fine3)what is the difference between 1 and 2 point loading text as data&#160;4)except parquet file remaining format are not accepting the below stmt&#160;val a = spark.read.load("text/csv/json") 5) what is the difference of this two stmts&#160;val a =spark.read.json()
val a =spark.read.format("json").load("a.json")

Firoz · Answer

The reason why you are able to load employee.parquet and not employee.txt using load in spark.read.load by default it assumes that data source is in parquet format so it is able to load it but we can use format function which can be used to specify the different format and use the load function to load the data&#160;spark.read.json()Spark SQL can automatically infer the schema of a JSON dataset and load it as a Dataset[Row]. This conversion can be done by&#160;spark.read.json()Both spark.read.json() and&#160;spark.read.format("json").load("a.json") are same as you can see in the below screenshot,

load save in spark

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Apache Spark

How to save RDD in Apache Spark?

How do you load this multiline data in spark as a single record?

Changing Column position in spark dataframe

Efficient way to read specific columns from parquet file in spark

How do I get number of columns in each line from a delimited file??

Hadoop Mapreduce word count Program

hadoop.mapred vs hadoop.mapreduce?

hadoop fs -put command?

load/save text file in spark

Load custom delimited file in Spark

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES