Parquet to ORC format in Spark

0 votes
I am currently having data on my hadoop Consumption table(xyz table) which is stored in Parquet format. I have a requirement to convert the data from this (xyz table) to ORC format using Hive.

I did create a temp table with ORC compression and tried to load the data from the xyz table to the new temp table. But I am unable to see the data.

I don't need the solution, because I want to try it by myself. Can you just mention the steps?
Feb 14, 2019 in Apache Spark by Suraj
623 views

1 answer to this question.

0 votes

I appreciate that you want to try it by yourself. Here are the steps:

Step 1) First you need to create a table from Parquet table with "Stored As Text" 

Step 2) Secondly you can create A table from previous output as "Stored As ORC" 

Step 3) After that you can drop intermediate table.

answered Feb 14, 2019 by Anjali

Related Questions In Apache Spark

0 votes
1 answer

Efficient way to read specific columns from parquet file in spark

As parquet is a column based storage ...READ MORE

answered Apr 20, 2018 in Apache Spark by kurt_cobain
• 9,310 points
2,384 views
0 votes
1 answer

How to convert rdd object to dataframe in spark

SqlContext has a number of createDataFrame methods ...READ MORE

answered May 30, 2018 in Apache Spark by nitinrawat895
• 10,920 points
2,278 views
0 votes
1 answer
0 votes
1 answer

Ways to create RDD in Apache Spark

There are two popular ways using which ...READ MORE

answered Jun 19, 2018 in Apache Spark by nitinrawat895
• 10,920 points
2,562 views
+1 vote
2 answers
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,920 points
5,085 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,920 points
724 views
+1 vote
11 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyF ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
30,102 views
0 votes
4 answers

How to change the spark Session configuration in Pyspark?

You can dynamically load properties. First create ...READ MORE

answered Dec 10, 2018 in Apache Spark by Vini
30,503 views
0 votes
7 answers