Most voted questions in Big Data Hadoop

0 votes
1 answer

How many partitions can do for one table?

Well, there are two kinds of partitions: 1. ...READ MORE

Jul 22, 2019 in Big Data Hadoop by Kunal
522 views
0 votes
1 answer

Can we use different input and output format classes?

Yes, InputFormatClass and OutputFormatClass are independent of ...READ MORE

Jul 22, 2019 in Big Data Hadoop by Jishan
466 views
0 votes
1 answer

Why do we use job.waitForCompletion(true) ?

The main reason for job.waitForCompletion exists is that ...READ MORE

Jul 22, 2019 in Big Data Hadoop by Kiran
924 views
0 votes
1 answer

Mapreduce: What is the use of setting the name of the job?

Job job = new Job(conf,"job_name"), is just ...READ MORE

Jul 22, 2019 in Big Data Hadoop by Madhu
911 views
0 votes
1 answer

Output types of mapper and reducer does not match

job.setOutputValueClass will set the types expected as ...READ MORE

Jul 22, 2019 in Big Data Hadoop by Reena
2,429 views
0 votes
1 answer

Not a host:port pair: PBUF?

Hey. This error usually occurs when the ...READ MORE

Jul 22, 2019 in Big Data Hadoop by Esha
1,006 views
0 votes
1 answer

Mutliple Output Format in Hadoop

Each reducer uses an OutputFormat to write ...READ MORE

Jul 19, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
441 views
0 votes
1 answer

How to fix corrupt files on HDFS

1 - Spark if following slave/master architecture. So ...READ MORE

Jul 18, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
918 views
0 votes
1 answer

Sqoop export not working

The issue that you might be getting ...READ MORE

Jul 16, 2019 in Big Data Hadoop by Tina
2,428 views
0 votes
1 answer

Spark Vs Hive LLAP Question

While Apache Hive and Spark SQL perform ...READ MORE

Jul 16, 2019 in Big Data Hadoop by Karan
3,397 views
0 votes
1 answer

Pig: Difference between inner bag and outer bag

Outer Bag: An outer bag is nothing but ...READ MORE

Jul 16, 2019 in Big Data Hadoop by Firoz
2,136 views
0 votes
1 answer

CSV integration with Hadoop

For integrating Hadoop with CSV, we can use ...READ MORE

Jul 16, 2019 in Big Data Hadoop by Krish
780 views
0 votes
1 answer

RDMBS integration with Hadoop

About integrating RDBMS with Hadoop, you can ...READ MORE

Jul 16, 2019 in Big Data Hadoop by Nanda
477 views
0 votes
1 answer

How to execute combiner and partitioning program without adding package name?

Yes, it is possible to do so ...READ MORE

Jul 16, 2019 in Big Data Hadoop by Raman
748 views
0 votes
1 answer

Output Splitting problem in Hadoop

When you are loading two different files, ...READ MORE

Jul 16, 2019 in Big Data Hadoop by Sayni
869 views
0 votes
1 answer

Copy file from local to hdfs from the spark job in yarn mode

Please refer to the below code: import org.apache.hadoop.conf.Configuration import ...READ MORE

Jul 16, 2019 in Big Data Hadoop by Raj
10,852 views
0 votes
1 answer

How does data gets split in Sqoop?

I will drop the answer in the ...READ MORE

Jul 16, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
8,552 views
0 votes
1 answer

how to store images and videos on to HDFS?

HDFS is capable to accept data in ...READ MORE

Jul 16, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
2,581 views
0 votes
1 answer

Pig script not dumping the data correctly

The first column is denoted by $0, ...READ MORE

Jul 15, 2019 in Big Data Hadoop by George
986 views
0 votes
2 answers

Error in running hdfs commands

Hey, You can check this command also  hadoop fs ...READ MORE

Aug 5, 2019 in Big Data Hadoop by Rashi
1,241 views
0 votes
1 answer

How to Import the complete directory into HDFS?

First, need to create a directory in Hadoop: $ ...READ MORE

Jul 15, 2019 in Big Data Hadoop by Hiran
535 views
0 votes
1 answer

Which one is better MR or Spark?

The above difference clearly points out that ...READ MORE

Jul 15, 2019 in Big Data Hadoop by Daniel
1,575 views
0 votes
1 answer

Sqoop: Dsqoop.export.records.per.statement option

Dsqoop.export.records.per.statement=1 is as the name suggests how many ...READ MORE

Jul 15, 2019 in Big Data Hadoop by Krish
2,715 views
0 votes
1 answer

Load data into teradata using sqoop

The general syntax to do this as ...READ MORE

Jul 15, 2019 in Big Data Hadoop by Ritu
1,827 views
0 votes
2 answers

Exception in thread "main" java.lang.UnsupportedClassVersionError

Hi, keep in mind do not forget to ...READ MORE

Aug 5, 2019 in Big Data Hadoop by Rashi
1,078 views
0 votes
1 answer

Import JSON file into hive

There are two ways to load json ...READ MORE

Jul 15, 2019 in Big Data Hadoop by Guru
14,092 views
0 votes
1 answer

Hive Query to sort data

If you are trying to sort first ...READ MORE

Jul 14, 2019 in Big Data Hadoop by Tina
604 views
0 votes
1 answer

Creating a hive script and execute in the edureka cloudlab.

Please remove -f option from hive arguments and use hql extension ...READ MORE

Jul 14, 2019 in Big Data Hadoop by Karan
527 views
0 votes
2 answers

How to set up Map and Reduce Tasks?

Hi, The number of map tasks for a ...READ MORE

Aug 5, 2019 in Big Data Hadoop by Rashi
1,854 views
0 votes
1 answer

Unable to Locate WinUtils Library in Hadoop binary path.

If you are facing this problem while running a ...READ MORE

Jul 11, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
1,824 views
0 votes
1 answer

Primary keys in apache Spark

from pyspark.sql.functions import monotonically_increasing_id df.withColumn("id", monotonically_increasing_id()).show() Verify the second ...READ MORE

Jul 11, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
1,086 views
0 votes
1 answer

How to remove duplicate records from Hive table?

A record is duplicate if there are ...READ MORE

Jul 11, 2019 in Big Data Hadoop by Bhuvan
9,298 views
0 votes
1 answer

Hive: ParseException line 5:20 missing EOF at '.' near

In the stored as parameter, you have specified the ...READ MORE

Jul 11, 2019 in Big Data Hadoop by Kim
16,826 views
0 votes
1 answer

Not able to upload files to hdfs: No such file or directory

The reason for this error is that ...READ MORE

Jul 11, 2019 in Big Data Hadoop by Srujan
3,543 views
0 votes
1 answer

HDFS: Upload file from local file system to Edureka cloud lab

Step 1: Go to My Lab and ...READ MORE

Jul 11, 2019 in Big Data Hadoop by Zora
1,142 views
0 votes
1 answer

Command to copy file from desktop to hdfs

You can use either of the below ...READ MORE

Jul 11, 2019 in Big Data Hadoop by Firoz
3,822 views
0 votes
1 answer

How to retrieve/get complete HQL hive query from hive,spark and tez?

To get full query running for the ...READ MORE

Jul 11, 2019 in Big Data Hadoop by Yogi
3,761 views
0 votes
1 answer

Pig: Does these jar files run in NodeManager to produce result?

Below is what happens with the map-reduce ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Lucy
461 views
0 votes
1 answer

Is data sent to some other location to be processed in pig?

With pig, we are not taking data ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Adi
498 views
0 votes
1 answer

When and when not to use PigStore?

The Pig storage is not used only ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Rushil
514 views
0 votes
1 answer

Use of Group ALL in Pig

Suppose we have a data set as ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Roshan
2,866 views
0 votes
1 answer

How to import data to hbase database?

There are 3 Ways to Load Data ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Karan Reddy
5,019 views
0 votes
1 answer

Sqoop ERROR 1290 (HY000): The MySQL server is running with the --secure-file-priv option

If your dataset is in the FTP ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Kunal
2,128 views
0 votes
1 answer

Syntax to mention that the data is enclosed in double quotes in pig

So, we will execute the below command, new_A_2 ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Esha
1,108 views
0 votes
1 answer

How can we ignore header line while loading data into Pig?

Suppose you need to load this in ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Kiran
2,480 views
0 votes
1 answer

Can we use different i/p and o/p format classes in mapreduce code?

Yes, InputFormatClass and OutputFormatClass are independent of ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Jimmy
607 views
0 votes
1 answer

Why do we use job.waitForCompletion(true)?

The main reason for job.waitForCompletion exists is ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Josh
4,463 views
0 votes
1 answer

What is the use of setting the name of the job in mapreduce?

Job job = new Job(conf,"job_name") is just used ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Umesh
399 views
0 votes
1 answer

How to solve error caused due to output types of mapper and reducer not matching?

job.setOutputValueClass will set the types expected as ...READ MORE

Jul 9, 2019 in Big Data Hadoop by Rishab
910 views
0 votes
1 answer

Integration of hbase

For HBase Pig integration, you can refer to ...READ MORE

Jul 9, 2019 in Big Data Hadoop by Kiran
639 views