Most voted questions in Big Data Hadoop

0 votes
1 answer

Does Caching stand as the only advantage in Spark compared to Hadoop?

Spark has much lower per job and ...READ MORE

Jul 31, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
368 views
0 votes
1 answer
0 votes
0 answers
0 votes
1 answer

Different between zkfc and zookeeper

Hey, The key difference between both the components ...READ MORE

Jul 31, 2019 in Big Data Hadoop by Sunny
1 flag 3,467 views
0 votes
1 answer

Hive: Assign subquery to variable

Refer to this example: Step1: Check table test1 ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Tina
7,044 views
0 votes
1 answer

FAILEDError: java.io.IOException: Type mismatch in key from map

This error is thrown when the parameters ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Rishi
1,495 views
0 votes
1 answer

Converting CSV files to Parquet

--Create Hive external Table for existing data CREATE ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Yogi
6,878 views
0 votes
1 answer

java.lang.IllegalArgumentException: Unable to instantiate SparkSession with Hive support because Hive classes are not found.

 Try using the below build.sbt, code.scala and command ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Kishan
7,320 views
0 votes
1 answer

What is "java.sql.SQLException: org.apache.thrift.transport.TTransportException" in Hive?

This particular exception is related to Hive logs. ...READ MORE

Jul 30, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
6,581 views
0 votes
1 answer

What is the difference between S3n, S3a, and S3?

Your doubt is quite an interesting one. Yes, ...READ MORE

Jul 30, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
10,354 views
0 votes
1 answer

How to choose number of Executors and Executor cores?

Refer to the below when you are ...READ MORE

Jul 29, 2019 in Big Data Hadoop by Veer
3,501 views
0 votes
1 answer

Error while Loading data to Hive.

Your data node is not running that ...READ MORE

Jul 29, 2019 in Big Data Hadoop by Quill
909 views
0 votes
1 answer

Killing oozie Jobs by userid

Suppose you want to kill the jobs ...READ MORE

Jul 29, 2019 in Big Data Hadoop by Esha
1,619 views
0 votes
1 answer

Difference between put and copyFromLocal

-Put and -copyFromLocal is almost same command ...READ MORE

Jul 29, 2019 in Big Data Hadoop by Neeraj
11,938 views
0 votes
1 answer

Multiple Output format in Hadoop

Each reducer uses an OutputFormat to write ...READ MORE

Jul 26, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
1,744 views
0 votes
1 answer

Does HQL query go to all the nodes to process data?

InputSplits are created by logical division of ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Tina
491 views
0 votes
1 answer

Why is Hive called as Data Warehouse?

Hive is a data warehouse infrastructure tool ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Joshua
5,740 views
0 votes
1 answer

Write data into hive table in a parquet

Below is an example query which you ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Sadu
2,655 views
0 votes
1 answer

Explanation to outputcollector vs context

Both codes contain different API of Map ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Rasheed
1,407 views
0 votes
2 answers

Load data into the hive table

Hi, We can use a normal insert query ...READ MORE

Oct 16, 2020 in Big Data Hadoop by MD
• 95,440 points
1,772 views
0 votes
1 answer

Query regarding ArrayBuffer

It's because that is the syntax. This ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Rishi
321 views
0 votes
1 answer

How to find in incorrect file/records in hive?

A value with a wrong datatype causes ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Ritu
2,261 views
0 votes
1 answer

Hive Query: Split value

You can use the SUBSTR() in hive ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Paul
460 views
0 votes
0 answers

Sqoop exclude tables

How to exclude tables in sqoop if ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Will
490 views
0 votes
1 answer

Parquet & Avro File hadling with HIVE

For avro you can follow the format ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Tarun
2,329 views
0 votes
1 answer

Delimiter on the data

You can use this: import org.apache.spark.sql.functions.struct val df = ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Vinay
345 views
0 votes
1 answer

Hive query: Join tables based on ID

After creating the tables a1 and b1 ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Tarun
711 views
0 votes
1 answer

How can I append data to an existing file in HDFS?

You have to do some configurations as ...READ MORE

Jul 25, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
8,080 views
0 votes
1 answer

Limit for Namenode Quantity

Each file Schema = 150bytes   Block schema ...READ MORE

Jul 25, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
342 views
0 votes
1 answer

Current Date in Hive

Try this instead: select from_unixtime(unix_timestamp()); If you have an ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Tina
2,483 views
0 votes
1 answer

Classes implementing InputFormat frequently

FileInputFormat : Base class for all file-based InputFormats Other ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Reshma
412 views
0 votes
1 answer

Mapreduce: custom Input format

Here's a list of Input Formats: CombineFileInputFormat CombineS ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Veer
879 views
0 votes
1 answer

Hive- unable to load data into table

The command you are typing is incorrect. ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Firoz
1,149 views
0 votes
1 answer

Index LZO files

You can do it using the following ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Rishi
612 views
0 votes
1 answer

Produce compressed data from map reduce

 It is straight forward and you can achieve ...READ MORE

Jul 24, 2019 in Big Data Hadoop by John
367 views
0 votes
1 answer

Running Mapreduce on compressed data

It is very straight forward, no need ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Nanda
655 views
0 votes
1 answer

How to Compress Map output?

With MR2, now we should set conf.set("mapreduce.map.output.compress", true)  conf.set("mapreduce.output.fileoutputformat.compress", ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Varun
1,233 views
0 votes
1 answer

How to download Large Hadoop[]closed] Data?

I would prefer you to download million songs ...READ MORE

Jul 23, 2019 in Big Data Hadoop by ravikiran
• 4,620 points

edited Jul 25, 2019 by ravikiran 955 views
0 votes
0 answers

pattern mapreduce

i need to write some mapreduce pattern ...READ MORE

Jul 23, 2019 in Big Data Hadoop by tiza
372 views
0 votes
1 answer

How can we retrieve/get complete HQL hive query from hive,spark and tez?

To get full query running for the ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Lohit
2,950 views
0 votes
1 answer

HBase shell failed to connect

Enter the below command in the terminal ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Joshua
1,504 views
0 votes
1 answer

Unable to run Sqoop scirpt to copy data to Cassandra

Unfortunately, this can't be achieved with open ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Shri
997 views
0 votes
2 answers

Not Able to read the file from hdfs location

Please make sure you connect to spark2-shell ...READ MORE

Jul 14, 2020 in Big Data Hadoop by Shantanu
• 190 points
1,794 views
0 votes
1 answer

How to import data to hbase database?

First you have to have the file ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Kiran
2,416 views
0 votes
1 answer

How to restart failed Namenode?

You need to solve the issue which ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Ishan
2,682 views
0 votes
1 answer

Name node RAM metadata

For the above requirement, the memory consumption ...READ MORE

Jul 23, 2019 in Big Data Hadoop by Reshma
1,331 views
0 votes
1 answer

How can we ignore header line while loading data into Pig?

You can use the following code: A = ...READ MORE

Jul 22, 2019 in Big Data Hadoop by kiran
522 views
0 votes
1 answer

How to Sqoop in a Java Program?

You can use the following sample code for ...READ MORE

Jul 22, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
1,306 views
0 votes
2 answers

Namenode is not getting started

Hi, You can do one thing, first delete all ...READ MORE

Aug 5, 2019 in Big Data Hadoop by Namisha
764 views