Most voted questions in Big Data Hadoop

0 votes
1 answer

Does Caching stand as the only advantage in Spark compared to Hadoop?

Spark has much lower per job and ...READ MORE

Jul 31, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
369 views
0 votes
1 answer
0 votes
0 answers
0 votes
1 answer

Different between zkfc and zookeeper

Hey, The key difference between both the components ...READ MORE

Jul 31, 2019 in Big Data Hadoop by Sunny
1 flag 3,470 views
0 votes
1 answer

Hive: Assign subquery to variable

Refer to this example: Step1: Check table test1 ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Tina
7,044 views
0 votes
1 answer

FAILEDError: java.io.IOException: Type mismatch in key from map

This error is thrown when the parameters ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Rishi
1,497 views
0 votes
1 answer

Converting CSV files to Parquet

--Create Hive external Table for existing data CREATE ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Yogi
6,881 views
0 votes
1 answer

java.lang.IllegalArgumentException: Unable to instantiate SparkSession with Hive support because Hive classes are not found.

 Try using the below build.sbt, code.scala and command ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Kishan
7,324 views
0 votes
1 answer

What is "java.sql.SQLException: org.apache.thrift.transport.TTransportException" in Hive?

This particular exception is related to Hive logs. ...READ MORE

Jul 30, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
6,582 views
0 votes
1 answer

What is the difference between S3n, S3a, and S3?

Your doubt is quite an interesting one. Yes, ...READ MORE

Jul 30, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
10,357 views
0 votes
1 answer

How to choose number of Executors and Executor cores?

Refer to the below when you are ...READ MORE

Jul 29, 2019 in Big Data Hadoop by Veer
3,504 views
0 votes
1 answer

Error while Loading data to Hive.

Your data node is not running that ...READ MORE

Jul 29, 2019 in Big Data Hadoop by Quill
914 views
0 votes
1 answer

Killing oozie Jobs by userid

Suppose you want to kill the jobs ...READ MORE

Jul 29, 2019 in Big Data Hadoop by Esha
1,619 views
0 votes
1 answer

Difference between put and copyFromLocal

-Put and -copyFromLocal is almost same command ...READ MORE

Jul 29, 2019 in Big Data Hadoop by Neeraj
11,946 views
0 votes
1 answer

Multiple Output format in Hadoop

Each reducer uses an OutputFormat to write ...READ MORE

Jul 26, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
1,745 views
0 votes
1 answer

Does HQL query go to all the nodes to process data?

InputSplits are created by logical division of ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Tina
492 views
0 votes
1 answer

Why is Hive called as Data Warehouse?

Hive is a data warehouse infrastructure tool ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Joshua
5,743 views
0 votes
1 answer

Write data into hive table in a parquet

Below is an example query which you ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Sadu
2,659 views
0 votes
1 answer

Explanation to outputcollector vs context

Both codes contain different API of Map ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Rasheed
1,408 views
0 votes
2 answers

Load data into the hive table

Hi, We can use a normal insert query ...READ MORE

Oct 16, 2020 in Big Data Hadoop by MD
• 95,440 points
1,772 views
0 votes
1 answer

Query regarding ArrayBuffer

It's because that is the syntax. This ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Rishi
321 views
0 votes
1 answer

How to find in incorrect file/records in hive?

A value with a wrong datatype causes ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Ritu
2,267 views
0 votes
1 answer

Hive Query: Split value

You can use the SUBSTR() in hive ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Paul
460 views
0 votes
0 answers

Sqoop exclude tables

How to exclude tables in sqoop if ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Will
490 views
0 votes
1 answer

Parquet & Avro File hadling with HIVE

For avro you can follow the format ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Tarun
2,334 views
0 votes
1 answer

Delimiter on the data

You can use this: import org.apache.spark.sql.functions.struct val df = ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Vinay
346 views
0 votes
1 answer

Hive query: Join tables based on ID

After creating the tables a1 and b1 ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Tarun
711 views
0 votes
1 answer

How can I append data to an existing file in HDFS?

You have to do some configurations as ...READ MORE

Jul 25, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
8,093 views
0 votes
1 answer

Limit for Namenode Quantity

Each file Schema = 150bytes   Block schema ...READ MORE

Jul 25, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
343 views
0 votes
1 answer

Current Date in Hive

Try this instead: select from_unixtime(unix_timestamp()); If you have an ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Tina
2,486 views
0 votes
1 answer

Classes implementing InputFormat frequently

FileInputFormat : Base class for all file-based InputFormats Other ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Reshma
412 views
0 votes
1 answer

Mapreduce: custom Input format

Here's a list of Input Formats: CombineFileInputFormat CombineS ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Veer
879 views
0 votes
1 answer

Hive- unable to load data into table

The command you are typing is incorrect. ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Firoz
1,150 views
0 votes
1 answer

Index LZO files

You can do it using the following ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Rishi
612 views
0 votes
1 answer

Produce compressed data from map reduce

 It is straight forward and you can achieve ...READ MORE

Jul 24, 2019 in Big Data Hadoop by John
367 views
0 votes
1 answer

Running Mapreduce on compressed data

It is very straight forward, no need ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Nanda
656 views
0 votes
1 answer

How to Compress Map output?

With MR2, now we should set conf.set("mapreduce.map.output.compress", true)  conf.set("mapreduce.output.fileoutputformat.compress", ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Varun
1,233 views
0 votes
1 answer

How to download Large Hadoop[]closed] Data?

I would prefer you to download million songs ...READ MORE

Jul 23, 2019 in Big Data Hadoop by ravikiran
• 4,620 points

edited Jul 25, 2019 by ravikiran 957 views
0 votes
0 answers

pattern mapreduce

i need to write some mapreduce pattern ...READ MORE

Jul 23, 2019 in Big Data Hadoop by tiza
372 views
0 votes
1 answer

How can we retrieve/get complete HQL hive query from hive,spark and tez?

To get full query running for the ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Lohit
2,951 views
0 votes
1 answer

HBase shell failed to connect

Enter the below command in the terminal ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Joshua
1,507 views
0 votes
1 answer

Unable to run Sqoop scirpt to copy data to Cassandra

Unfortunately, this can't be achieved with open ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Shri
998 views
0 votes
2 answers

Not Able to read the file from hdfs location

Please make sure you connect to spark2-shell ...READ MORE

Jul 14, 2020 in Big Data Hadoop by Shantanu
• 190 points
1,797 views
0 votes
1 answer

How to import data to hbase database?

First you have to have the file ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Kiran
2,420 views
0 votes
1 answer

How to restart failed Namenode?

You need to solve the issue which ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Ishan
2,688 views
0 votes
1 answer

Name node RAM metadata

For the above requirement, the memory consumption ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Reshma
1,336 views
0 votes
1 answer

How can we ignore header line while loading data into Pig?

You can use the following code: A = ...READ MORE

Jul 22, 2019 in Big Data Hadoop by kiran
522 views
0 votes
1 answer

How to Sqoop in a Java Program?

You can use the following sample code for ...READ MORE

Jul 22, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
1,313 views
0 votes
2 answers

Namenode is not getting started

Hi, You can do one thing, first delete all ...READ MORE

Aug 5, 2019 in Big Data Hadoop by Namisha
768 views