Parquet to ORC format in Spark

0 votes
I am currently having data on my hadoop Consumption table(xyz table) which is stored in Parquet format. I have a requirement to convert the data from this (xyz table) to ORC format using Hive.

I did create a temp table with ORC compression and tried to load the data from the xyz table to the new temp table. But I am unable to see the data.

I don't need the solution, because I want to try it by myself. Can you just mention the steps?
Feb 14 in Apache Spark by Suraj
193 views

1 answer to this question.

0 votes

I appreciate that you want to try it by yourself. Here are the steps:

Step 1) First you need to create a table from Parquet table with "Stored As Text" 

Step 2) Secondly you can create A table from previous output as "Stored As ORC" 

Step 3) After that you can drop intermediate table.

answered Feb 14 by Anjali

Related Questions In Apache Spark

0 votes
1 answer

Efficient way to read specific columns from parquet file in spark

As parquet is a column based storage ...READ MORE

answered Apr 20, 2018 in Apache Spark by kurt_cobain
• 9,240 points
1,114 views
0 votes
1 answer

How to convert rdd object to dataframe in spark

SqlContext has a number of createDataFrame methods ...READ MORE

answered May 30, 2018 in Apache Spark by nitinrawat895
• 10,490 points
1,178 views
0 votes
1 answer
0 votes
1 answer

Ways to create RDD in Apache Spark

There are two popular ways using which ...READ MORE

answered Jun 19, 2018 in Apache Spark by nitinrawat895
• 10,490 points
1,226 views
0 votes
1 answer
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,490 points
2,344 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,490 points
238 views
0 votes
10 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyFr ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
12,035 views
0 votes
4 answers

How to change the spark Session configuration in Pyspark?

You can dynamically load properties. First create ...READ MORE

answered Dec 10, 2018 in Apache Spark by Vini
12,084 views
0 votes
6 answers