Most viewed questions in Big Data Hadoop

–1 vote
1 answer

Setting Hive properties

Hey. It's definitely not a stupid question. ...READ MORE

Jan 5, 2019 in Big Data Hadoop by Omkar
• 69,230 points
2,157 views
0 votes
1 answer

Hadoop job getting executed in local mode only?

Basically, you have not configured the mapreduce.framework.name ...READ MORE

May 3, 2018 in Big Data Hadoop by Atul
• 10,240 points
2,144 views
0 votes
1 answer

Is it compulsory to have the hadoop user as sudo user?

No, it is not important to have ...READ MORE

May 8, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,137 views
0 votes
1 answer

ERROR 2997: Encountered IOException. Directory does not exist.

Hi. I think the reason for this ...READ MORE

May 23, 2019 in Big Data Hadoop by Britney
2,135 views
0 votes
1 answer

User:hdfs not allowed to do 'DECRYPT_EEK' on 'key'.

Hi@Shilpa, I don't think you have given permission ...READ MORE

Nov 20, 2020 in Big Data Hadoop by MD
• 95,440 points
2,133 views
0 votes
1 answer

Hadoop: Not able to start Namenode

Stop all running server 1) stop-all.sh Edit the ...READ MORE

Nov 8, 2018 in Big Data Hadoop by Omkar
• 69,230 points
2,117 views
0 votes
1 answer

INFO hdfs.DFSClient: Could not obtain block blk_-from any node: java.io.IOException: No live nodes contain current block

Hi@akhtar, I think your Datanodes are not in ...READ MORE

Apr 17, 2020 in Big Data Hadoop by MD
• 95,440 points
2,115 views
0 votes
1 answer

Which data type is used to store the data in HBase table column?

Hey, Byte Array,  Put p = new Put(Bytes.toBytes("John Smith")); All ...READ MORE

May 29, 2019 in Big Data Hadoop by Gitika
• 65,910 points
2,104 views
0 votes
1 answer

failed to start the Namenode format in hadoop generation 2

Hi, You can check this out: 1. Delete all ...READ MORE

Jul 2, 2019 in Big Data Hadoop by Ria

edited Jul 2, 2019 by Gitika 2,100 views
0 votes
1 answer

Hadoop: How to keep duplicates in Hive using collect_set()?

SELECT hash_id, COLLECT_LIST(num_of_cats) AS ...READ MORE

Nov 2, 2018 in Big Data Hadoop by Omkar
• 69,230 points
2,097 views
0 votes
1 answer

How to find the used cache in HDFS

hdfs dfsadmin -report This command tells fs ...READ MORE

May 4, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,097 views
0 votes
2 answers

Which framework allows random, realtime read/write access to hundreds of terabytes of data?

For this type of case you can ...READ MORE

Aug 14, 2018 in Big Data Hadoop by anonymous
2,093 views
0 votes
1 answer

Mention what are the main configuration parameters that user need to specify to run MapReduce Job?

Hi, The user of the MapReduce framework needs ...READ MORE

Jun 10, 2019 in Big Data Hadoop by Gitika
• 65,910 points
2,091 views
+1 vote
1 answer

starting hive gave me exception

Hi@Hans, You may got this error, if you ...READ MORE

May 20, 2020 in Big Data Hadoop by MD
• 95,440 points
2,086 views
0 votes
1 answer

How to perform a word count on a dataframe column?

You can use the below code to ...READ MORE

Jan 22, 2019 in Big Data Hadoop by Omkar
• 69,230 points
2,084 views
0 votes
1 answer

Change permissions of a file in HDFS.

Hi@akhtar, You can use the Chmod command to ...READ MORE

Oct 1, 2020 in Big Data Hadoop by MD
• 95,440 points
2,078 views
0 votes
1 answer

com.mysql.jdbc.exceptions.jdbc4.CommunicationsException: Communications link failure

Sqoop is not able to communicate with ...READ MORE

Jan 29, 2019 in Big Data Hadoop by Omkar
• 69,230 points
2,073 views
0 votes
1 answer

Sqoop: Oracle Connection String

The format for connection string for Oracle ...READ MORE

Jun 6, 2019 in Big Data Hadoop by Varun
2,072 views
0 votes
1 answer

Exception in thread "main" java.lang.NoClassDefFoundError

As far I can make out from ...READ MORE

Aug 9, 2019 in Big Data Hadoop by Arpit
2,067 views
0 votes
1 answer

Setting Impala host and port in Hue

To set the Impala server details, you ...READ MORE

May 24, 2019 in Big Data Hadoop by Kunal
2,066 views
0 votes
2 answers

Hadoop query regarding setJarByClass method of Job class

http://www.bigdataspeak.com/2014/06/what-is-need-to-use-jobsetjarbycla ...READ MORE

Feb 7, 2019 in Big Data Hadoop by siri
2,066 views
–1 vote
1 answer

How many FSimage files will be created in hard disk?

In Hdfs, data and metadata are decoupled. ...READ MORE

Dec 21, 2018 in Big Data Hadoop by Omkar
• 69,230 points
2,064 views
0 votes
1 answer

What is the Path of hdfs.site.xml?

Here's where you can find the file: /etc/hadoop/[service ...READ MORE

Jan 4, 2019 in Big Data Hadoop by Omkar
• 69,230 points
2,062 views
+1 vote
1 answer

How to print data without brackets using PIG Dump?

Please refer to the below commands: student = ...READ MORE

Dec 18, 2018 in Big Data Hadoop by Omkar
• 69,230 points
2,058 views
0 votes
1 answer

Relationship between Spark, Hadoop and Cassandra?

Spark is a distributed in memory processing ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,043 views
+1 vote
1 answer

Streaming facebook data using Flume framework!!!

Hi You can find the facebook 'API' from ...READ MORE

Jan 21, 2020 in Big Data Hadoop by anonymous
2,032 views
0 votes
1 answer

Changing hostname in fs.default.name property puts Hadoop in safemode

First of all, in Safe mode the ...READ MORE

Apr 27, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,031 views
0 votes
1 answer

How to create initial database without using Hive schema tool?

Hey, If for some reason you decide not ...READ MORE

May 29, 2019 in Big Data Hadoop by Gitika
• 65,910 points
2,028 views
0 votes
1 answer

Getting error in JAVA_HOME variable.

First of all check the hadoop version ...READ MORE

May 11, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,026 views
0 votes
0 answers

How I can kill the jobs using jobID running in local mode with Hadoop

I am Running hadoop jobs in local ...READ MORE

Aug 26, 2020 in Big Data Hadoop by kamboj
• 140 points
2,021 views
0 votes
1 answer

Namenode not getting started

You need to delete the contents of the ...READ MORE

Apr 23, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
2,017 views
0 votes
1 answer

How to change default block size in Hadoop Cluster?

Hi@akhtar, You can change the default block size ...READ MORE

Oct 3, 2020 in Big Data Hadoop by MD
• 95,440 points
2,016 views
0 votes
1 answer

Error connecting hive using python pyhs2

Seems like the host IP is not ...READ MORE

Jan 11, 2019 in Big Data Hadoop by Omkar
• 69,230 points
2,016 views
0 votes
1 answer

When is an identity mapper/reducer used?

1.One of the simplest example of Iterative ...READ MORE

Apr 3, 2018 in Big Data Hadoop by Ashish
• 2,650 points
2,015 views
0 votes
1 answer

Which among the following are the Features of Hadoop?

Apache Hadoop is a collection of open-source ...READ MORE

Dec 9, 2021 in Big Data Hadoop by Kavya
• 700 points
2,014 views
0 votes
1 answer

Why does one remove or add nodes in a Hadoop cluster frequently?

One of the most attractive features of ...READ MORE

Dec 14, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,007 views
0 votes
1 answer

JDBC connection to Hive using TLS/SSL

Hi. The properties are right but in ...READ MORE

May 31, 2019 in Big Data Hadoop by Karan
2,006 views
0 votes
1 answer

How to use data compression in sqoop import?

You can enable data compression from the ...READ MORE

May 15, 2019 in Big Data Hadoop by Rocky
2,006 views
+1 vote
1 answer

I installed Spark but while executing command, I am getting ‘hadoop’ command not found error?

For accessing Hadoop commands & HDFS, you ...READ MORE

Mar 21, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,004 views
0 votes
1 answer

Installing Spark on Windows

Follow the below steps to install Spark ...READ MORE

Feb 20, 2019 in Big Data Hadoop by Omkar
• 69,230 points
2,003 views
0 votes
1 answer

How do I join 2 tables in PIG using 2 fields?

Here, we have two tables: Tab1 having columns ...READ MORE

Dec 13, 2018 in Big Data Hadoop by Omkar
• 69,230 points
1,999 views
0 votes
1 answer

Does setup & clean methods in Hadoop MapReduce is executed after each task?

Setup & cleanup methods are executed after ...READ MORE

May 8, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,997 views
0 votes
1 answer

Is there any way to include a python package with Hadoop streaming job?

See I don't know the answer to ...READ MORE

May 10, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,996 views
0 votes
1 answer

How to add mysql database for Hue Query app?

Hi. To add mysql database in the config, open ...READ MORE

May 27, 2019 in Big Data Hadoop by Vasu
1,995 views
0 votes
1 answer

Use of MapReduce in PIG

Apache Pig programs are written in a ...READ MORE

Jul 25, 2018 in Big Data Hadoop by shams
• 3,670 points
1,991 views
0 votes
2 answers

How to see MySql service is running or not using linux command?

Hi, You can simply run the following commands ...READ MORE

Jan 21, 2020 in Big Data Hadoop by anonymous
1,990 views
0 votes
1 answer

What is the best way to integrate SAS with Hadoop without losing the parallel processing capacity of Hadoop

One of the major pushes at SAS ...READ MORE

Oct 16, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,988 views
0 votes
1 answer

How to enable impersonation in HiveServer2?

You have to configure 2 files for ...READ MORE

Jun 3, 2019 in Big Data Hadoop by Reshma
1,987 views
0 votes
1 answer

Not able to start namenode with ./start-all.sh

Try this: stop all the daemons: ./stop-all.sh format the namenode: cd ...READ MORE

Jan 3, 2019 in Big Data Hadoop by Omkar
• 69,230 points
1,983 views
0 votes
1 answer

How to find the number of blocks for a file in Hadoop?

Hi@akhtar, You can use Hadoop file system command to ...READ MORE

Oct 13, 2020 in Big Data Hadoop by MD
• 95,440 points
1,977 views