Most voted questions in Big Data Hadoop

0 votes
1 answer
0 votes
0 answers
0 votes
1 answer

Different between zkfc and zookeeper

Hey, The key difference between both the components ...READ MORE

Jul 31, 2019 in Big Data Hadoop by Sunny
1 flag 2,262 views
0 votes
1 answer

Hive: Assign subquery to variable

Refer to this example: Step1: Check table test1 ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Tina
5,191 views
0 votes
1 answer

FAILEDError: java.io.IOException: Type mismatch in key from map

This error is thrown when the parameters ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Rishi
955 views
0 votes
1 answer

Converting CSV files to Parquet

--Create Hive external Table for existing data CREATE ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Yogi
6,003 views
0 votes
1 answer

java.lang.IllegalArgumentException: Unable to instantiate SparkSession with Hive support because Hive classes are not found.

 Try using the below build.sbt, code.scala and command ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Kishan
6,187 views
0 votes
1 answer

What is "java.sql.SQLException: org.apache.thrift.transport.TTransportException" in Hive?

This particular exception is related to Hive logs. ...READ MORE

Jul 30, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
4,763 views
0 votes
1 answer

What is the difference between S3n, S3a, and S3?

Your doubt is quite an interesting one. Yes, ...READ MORE

Jul 30, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
8,777 views
0 votes
1 answer

How to choose number of Executors and Executor cores?

Refer to the below when you are ...READ MORE

Jul 29, 2019 in Big Data Hadoop by Veer
2,973 views
0 votes
1 answer

Error while Loading data to Hive.

Your data node is not running that ...READ MORE

Jul 29, 2019 in Big Data Hadoop by Quill
400 views
0 votes
1 answer

Killing oozie Jobs by userid

Suppose you want to kill the jobs ...READ MORE

Jul 29, 2019 in Big Data Hadoop by Esha
916 views
0 votes
1 answer

Difference between put and copyFromLocal

-Put and -copyFromLocal is almost same command ...READ MORE

Jul 29, 2019 in Big Data Hadoop by Neeraj
8,841 views
0 votes
1 answer

Multiple Output format in Hadoop

Each reducer uses an OutputFormat to write ...READ MORE

Jul 26, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
1,224 views
0 votes
1 answer

Does HQL query go to all the nodes to process data?

InputSplits are created by logical division of ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Tina
196 views
0 votes
1 answer

Why is Hive called as Data Warehouse?

Hive is a data warehouse infrastructure tool ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Joshua
3,956 views
0 votes
1 answer

Write data into hive table in a parquet

Below is an example query which you ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Sadu
1,834 views
0 votes
1 answer

Explanation to outputcollector vs context

Both codes contain different API of Map ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Rasheed
775 views
0 votes
2 answers

Load data into the hive table

Hi, We can use a normal insert query ...READ MORE

Oct 16, 2020 in Big Data Hadoop by MD
• 95,360 points
1,109 views
0 votes
1 answer

Query regarding ArrayBuffer

It's because that is the syntax. This ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Rishi
148 views
0 votes
1 answer

How to find in incorrect file/records in hive?

A value with a wrong datatype causes ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Ritu
1,360 views
0 votes
1 answer

Hive Query: Split value

You can use the SUBSTR() in hive ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Paul
185 views
0 votes
0 answers

Sqoop exclude tables

How to exclude tables in sqoop if ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Will
293 views
0 votes
1 answer

Parquet & Avro File hadling with HIVE

For avro you can follow the format ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Tarun
1,455 views
0 votes
1 answer

Delimiter on the data

You can use this: import org.apache.spark.sql.functions.struct val df = ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Vinay
144 views
0 votes
1 answer

Hive query: Join tables based on ID

After creating the tables a1 and b1 ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Tarun
259 views
0 votes
1 answer

How can I append data to an existing file in HDFS?

You have to do some configurations as ...READ MORE

Jul 25, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
6,254 views
0 votes
1 answer

Limit for Namenode Quantity

Each file Schema = 150bytes   Block schema ...READ MORE

Jul 25, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
134 views
0 votes
1 answer

Current Date in Hive

Try this instead: select from_unixtime(unix_timestamp()); If you have an ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Tina
1,783 views
0 votes
1 answer

Classes implementing InputFormat frequently

FileInputFormat : Base class for all file-based InputFormats Other ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Reshma
190 views
0 votes
1 answer

Mapreduce: custom Input format

Here's a list of Input Formats: CombineFileInputFormat CombineS ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Veer
617 views
0 votes
1 answer

Hive- unable to load data into table

The command you are typing is incorrect. ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Firoz
748 views
0 votes
1 answer

Index LZO files

You can do it using the following ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Rishi
288 views
0 votes
1 answer

Produce compressed data from map reduce

 It is straight forward and you can achieve ...READ MORE

Jul 24, 2019 in Big Data Hadoop by John
183 views
0 votes
1 answer

Running Mapreduce on compressed data

It is very straight forward, no need ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Nanda
193 views
0 votes
1 answer

How to Compress Map output?

With MR2, now we should set conf.set("mapreduce.map.output.compress", true)  conf.set("mapreduce.output.fileoutputformat.compress", ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Varun
648 views
0 votes
1 answer

How to download Large Hadoop[]closed] Data?

I would prefer you to download million songs ...READ MORE

Jul 23, 2019 in Big Data Hadoop by ravikiran
• 4,620 points

edited Jul 25, 2019 by ravikiran 437 views
0 votes
0 answers

pattern mapreduce

i need to write some mapreduce pattern ...READ MORE

Jul 23, 2019 in Big Data Hadoop by tiza
150 views
0 votes
1 answer

How can we retrieve/get complete HQL hive query from hive,spark and tez?

To get full query running for the ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Lohit
2,232 views
0 votes
1 answer

HBase shell failed to connect

Enter the below command in the terminal ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Joshua
737 views
0 votes
1 answer

Unable to run Sqoop scirpt to copy data to Cassandra

Unfortunately, this can't be achieved with open ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Shri
670 views
0 votes
2 answers

Not Able to read the file from hdfs location

Please make sure you connect to spark2-shell ...READ MORE

Jul 14, 2020 in Big Data Hadoop by Shantanu
• 190 points
822 views
0 votes
1 answer

How to import data to hbase database?

First you have to have the file ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Kiran
1,145 views
0 votes
1 answer

How to restart failed Namenode?

You need to solve the issue which ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Ishan
1,373 views
0 votes
1 answer

Name node RAM metadata

For the above requirement, the memory consumption ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Reshma
514 views
0 votes
1 answer

How can we ignore header line while loading data into Pig?

You can use the following code: A = ...READ MORE

Jul 22, 2019 in Big Data Hadoop by kiran
189 views
0 votes
1 answer

How to Sqoop in a Java Program?

You can use the following sample code for ...READ MORE

Jul 22, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
985 views
0 votes
2 answers

Namenode is not getting started

Hi, You can do one thing, first delete all ...READ MORE

Aug 5, 2019 in Big Data Hadoop by Namisha
400 views
0 votes
1 answer

How many partitions can do for one table?

Well, there are two kinds of partitions: 1. ...READ MORE

Jul 22, 2019 in Big Data Hadoop by Kunal
180 views