Most viewed questions in Big Data Hadoop

0 votes
1 answer

How to run Hadoop 2.1.0 in Windows System?

Same problem but with recent Hadoop v. ...READ MORE

Sep 11, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
2,716 views
0 votes
1 answer

What is SerDe in Hive?

Hey, The SerDe interface allows you to instruct ...READ MORE

May 14, 2019 in Big Data Hadoop by Gitika
• 65,730 points
2,710 views
0 votes
1 answer

How to set input split settings?

It can be controlled by setting the ...READ MORE

Dec 27, 2018 in Big Data Hadoop by Omkar
• 69,180 points
2,702 views
0 votes
1 answer

Why does one remove or add nodes in a Hadoop cluster frequently?

One of the most attractive features of ...READ MORE

Dec 14, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,699 views
0 votes
1 answer

How to perform a word count on a dataframe column?

You can use the below code to ...READ MORE

Jan 22, 2019 in Big Data Hadoop by Omkar
• 69,180 points
2,697 views
0 votes
1 answer

Is there any way to get the column name along with the output while execute any query in Hive?

If we want to see the column ...READ MORE

Dec 8, 2020 in Big Data Hadoop by Gitika
• 65,730 points
2,693 views
0 votes
1 answer

How to run MapReduce program?

Eclipse is already installed in the Edureka ...READ MORE

Feb 22, 2019 in Big Data Hadoop by Suman
2,687 views
0 votes
1 answer

NameNode Status Check

There's one more way of checking the ...READ MORE

Jun 18, 2018 in Big Data Hadoop by shams
• 3,670 points
2,687 views
0 votes
1 answer

ERROR 2997: Encountered IOException. Directory does not exist.

Hi. I think the reason for this ...READ MORE

May 23, 2019 in Big Data Hadoop by Britney
2,686 views
–1 vote
1 answer

Sqoop split-by problem

Hello. The -m or --num-mappers is just a ...READ MORE

Dec 19, 2018 in Big Data Hadoop by Omkar
• 69,180 points
2,685 views
+1 vote
1 answer

Information Unavailable:Running jps command in Hadoop

Hi, I think this is because the process ...READ MORE

Jan 21, 2020 in Big Data Hadoop by anonymous
2,682 views
0 votes
1 answer

How to get started with Hadoop and do some development using Eclipse IDE?

Alright, there are couple of things that ...READ MORE

Apr 4, 2018 in Big Data Hadoop by Ashish
• 2,650 points
2,682 views
0 votes
1 answer

Query regarding String and Varchar in Hive

Varchar datatype is also saved internally as ...READ MORE

Feb 20, 2019 in Big Data Hadoop by Omkar
• 69,180 points
2,673 views
0 votes
1 answer

What is the Path of hdfs.site.xml?

Here's where you can find the file: /etc/hadoop/[service ...READ MORE

Jan 4, 2019 in Big Data Hadoop by Omkar
• 69,180 points
2,671 views
0 votes
2 answers

How to set up Map and Reduce Tasks?

Hi, The number of map tasks for a ...READ MORE

Aug 5, 2019 in Big Data Hadoop by Rashi
2,655 views
0 votes
1 answer

Zookeeper Quorum value: zookeeper.ClientCnxn: Client session timed out

Enter the below command on the Web ...READ MORE

Feb 21, 2019 in Big Data Hadoop by Omkar
• 69,180 points
2,648 views
0 votes
1 answer

Name node is in safe mode. Not able to leave

In order to forcefully let the namenode ...READ MORE

Dec 7, 2020 in Big Data Hadoop by Gitika
• 65,730 points
2,647 views
0 votes
1 answer

Sqoop error while importing data from MySql

As the error states, the target directory ...READ MORE

Feb 11, 2019 in Big Data Hadoop by Omkar
• 69,180 points
2,647 views
0 votes
1 answer

Getting error in JAVA_HOME variable.

First of all check the hadoop version ...READ MORE

May 11, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,640 views
0 votes
1 answer

When is an identity mapper/reducer used?

1.One of the simplest example of Iterative ...READ MORE

Apr 3, 2018 in Big Data Hadoop by Ashish
• 2,650 points
2,637 views
0 votes
1 answer

Hadoop: error: Cannot lock storage /app/hadoop/tmp/dfs/name. The directory is already locked.

Hi, Check your dfs.name.dir and dfs.data.dir path in hdfs-site.xml. Or ...READ MORE

Aug 5, 2019 in Big Data Hadoop by Gitika
• 65,730 points
2,636 views
0 votes
1 answer

How to use mountable HDFS on Centos?

Hey. You can use the following commands ...READ MORE

Nov 26, 2018 in Big Data Hadoop by Omkar
• 69,180 points
2,636 views
0 votes
1 answer

Why CREATE EXTERNAL keyword is used in Hive?

Hey, The create external keyword is used to create a ...READ MORE

May 15, 2019 in Big Data Hadoop by Gitika
• 65,730 points
2,635 views
0 votes
1 answer

What is the best way to integrate SAS with Hadoop without losing the parallel processing capacity of Hadoop

One of the major pushes at SAS ...READ MORE

Oct 16, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,631 views
0 votes
1 answer

How do we authenticate user from http.signature.secret file?

hadoop.http.authentication.signature.secret.file: The signature secret file for signing ...READ MORE

Apr 24, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,627 views
–1 vote
1 answer

How many FSimage files will be created in hard disk?

In Hdfs, data and metadata are decoupled. ...READ MORE

Dec 21, 2018 in Big Data Hadoop by Omkar
• 69,180 points
2,621 views
0 votes
1 answer

What Distributed Cache is actually used for in Hadoop?

Basically distributed cache allows you to cache ...READ MORE

Apr 3, 2018 in Big Data Hadoop by Ashish
• 2,650 points
2,615 views
0 votes
2 answers

Load data into the hive table

Hi, We can use a normal insert query ...READ MORE

Oct 16, 2020 in Big Data Hadoop by MD
• 95,460 points
2,607 views
0 votes
2 answers

E0901: Namenode [localhost:8020] not allowed, not in Oozies whitelist

You can add this below property in oozie-site.xml: <property> <name>oozie.service.HadoopAccessorService.jobTracker.whitelist</name> <value>myaddress:8020</value> </property> Hope ...READ MORE

Aug 6, 2019 in Big Data Hadoop by Shiva
2,604 views
0 votes
1 answer

PIG Error while loading file: -bash: syntax error near unexpected token `('

Since your data is delimited with tabs ...READ MORE

Jul 31, 2019 in Big Data Hadoop by Ritu
2,601 views
0 votes
1 answer

Is there any way to include a python package with Hadoop streaming job?

See I don't know the answer to ...READ MORE

May 10, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,595 views
0 votes
1 answer

com.mysql.jdbc.exceptions.jdbc4.CommunicationsException: Communications link failure

Sqoop is not able to communicate with ...READ MORE

Jan 29, 2019 in Big Data Hadoop by Omkar
• 69,180 points
2,593 views
0 votes
1 answer

Hive Execution Engine

The default execution engine for Hive is mr. To ...READ MORE

Jun 4, 2019 in Big Data Hadoop by Varun
2,589 views
0 votes
1 answer

How to create initial database without using Hive schema tool?

Hey, If for some reason you decide not ...READ MORE

May 29, 2019 in Big Data Hadoop by Gitika
• 65,730 points
2,587 views
+1 vote
1 answer

How to print data without brackets using PIG Dump?

Please refer to the below commands: student = ...READ MORE

Dec 18, 2018 in Big Data Hadoop by Omkar
• 69,180 points
2,583 views
0 votes
1 answer

Iterate twice on values (MapReduce)

We have to cache the values from ...READ MORE

Oct 1, 2018 in Big Data Hadoop by digger
• 26,740 points
2,581 views
0 votes
1 answer

Installing Spark on Windows

Follow the below steps to install Spark ...READ MORE

Feb 20, 2019 in Big Data Hadoop by Omkar
• 69,180 points
2,578 views
–1 vote
1 answer

How to use multiple spark version?

You can use the  SPARK_MAJOR_VERSION for this. Suppose ...READ MORE

Dec 27, 2018 in Big Data Hadoop by Omkar
• 69,180 points
2,576 views
0 votes
1 answer

How should I connect Hadoop in C#?

The Uri should be your Cluster Name, ...READ MORE

Aug 7, 2018 in Big Data Hadoop by Neha
• 6,300 points
2,576 views
0 votes
1 answer

Datanode not running on Hadoop-2.7.3

Hey, I solved this problem by removing hadoop ...READ MORE

Jun 14, 2019 in Big Data Hadoop by Gitika
• 65,730 points
2,572 views
0 votes
1 answer

How to use data compression in sqoop import?

You can enable data compression from the ...READ MORE

May 15, 2019 in Big Data Hadoop by Rocky
2,571 views
0 votes
1 answer

What is the difference between MapRed and MapReduce?

Firstly, we must learn that these are ...READ MORE

Mar 6, 2019 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,571 views
0 votes
1 answer

Not able to start namenode with ./start-all.sh

Try this: stop all the daemons: ./stop-all.sh format the namenode: cd ...READ MORE

Jan 3, 2019 in Big Data Hadoop by Omkar
• 69,180 points
2,569 views
0 votes
1 answer

Which data type is used to store the data in HBase table column?

Hey, Byte Array,  Put p = new Put(Bytes.toBytes("John Smith")); All ...READ MORE

May 29, 2019 in Big Data Hadoop by Gitika
• 65,730 points
2,568 views
0 votes
1 answer

What is the meaning of Write Ahead Log in Hadoop?

Write Ahead Log (WAL) is a file ...READ MORE

Nov 21, 2018 in Big Data Hadoop by Sunil
2,566 views
0 votes
1 answer

Sqoop: Oracle Connection String

The format for connection string for Oracle ...READ MORE

Jun 6, 2019 in Big Data Hadoop by Varun
2,564 views
0 votes
1 answer

Unable to execute HDFS commands on the terminal

Maybe because you have not started hadoop ...READ MORE

Feb 7, 2019 in Big Data Hadoop by Omkar
• 69,180 points
2,561 views
0 votes
1 answer

Can you please help with Hive Query to get FirstName, MiddleName, LastName and Suffix from FullName in Hive. Thank you

Hey, You can get first name, middle name, ...READ MORE

May 15, 2019 in Big Data Hadoop by Gitika
• 65,730 points

edited May 15, 2019 by Gitika 2,554 views
+1 vote
1 answer

Hadoop: Using composite-key

You can use a Writable, something like ...READ MORE

Nov 12, 2018 in Big Data Hadoop by Omkar
• 69,180 points
2,554 views
0 votes
1 answer

Configuring JDBC Clients for LDAP Authentication with HiveServer2

JDBC client requires a connection URL as ...READ MORE

Jun 3, 2019 in Big Data Hadoop by Raman
2,546 views