Trending questions in Big Data Hadoop

0 votes
1 answer

How to create a FileSystem object that can be used for reading from and writing to HDFS?

Read operation on HDFS In order to read ...READ MORE

Mar 21, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points

edited Mar 22, 2018 by nitinrawat895 2,679 views
0 votes
1 answer

Not able to start Job History Server in Hadoop 2.8.1

You have to start JobHistoryServer process specifically ...READ MORE

Mar 30, 2018 in Big Data Hadoop by Ashish
• 2,650 points
2,330 views
0 votes
1 answer

Cassandra for datawarehouse

Actually, Cassandra being a NoSQL database with ...READ MORE

Apr 20, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,314 views
0 votes
1 answer

What are the advantages & disadvantages of Hadoop Dockerization?

As you are already having a Hadoop ...READ MORE

Apr 18, 2018 in Big Data Hadoop by coldcode
• 2,080 points
1,365 views
0 votes
1 answer

When is an identity mapper/reducer used?

1.One of the simplest example of Iterative ...READ MORE

Apr 3, 2018 in Big Data Hadoop by Ashish
• 2,650 points
1,999 views
+1 vote
1 answer

Why is jar file required to execute a MR code instead of class file?

We use hadoop keyword to invoke the ...READ MORE

Apr 24, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,015 views
0 votes
1 answer

How to find hadoop distribution and version?

Just Use the command Hadoop version ...READ MORE

Apr 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points

edited Apr 6, 2018 by kurt_cobain 1,807 views
0 votes
1 answer

How to work with distributed cache in Hadoop?

The problem with your code is that ...READ MORE

Apr 20, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,170 views
0 votes
1 answer

How to upload file to HDFS in Ubuntu

you can use  hadoop fs -copyFromLocal  "/home/ritwi ...READ MORE

Apr 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,252 views
0 votes
1 answer

IDE for writing and running hadoop jobs?

You can use the Eclipse IDE. It's ...READ MORE

Apr 10, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,609 views
0 votes
1 answer

What Distributed Cache is actually used for in Hadoop?

Basically distributed cache allows you to cache ...READ MORE

Apr 3, 2018 in Big Data Hadoop by Ashish
• 2,650 points
1,851 views
0 votes
1 answer

How to get started with Hadoop and do some development using Eclipse IDE?

Alright, there are couple of things that ...READ MORE

Apr 4, 2018 in Big Data Hadoop by Ashish
• 2,650 points
1,760 views
0 votes
1 answer

Can we use apache Mahout without Hadoop dependency?

There is a number of algorithm implementations ...READ MORE

Apr 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
771 views
0 votes
1 answer

How can I use my host machine’s web browser to check my HDFS services running in the VM?

The sole purpose of the virtual machine ...READ MORE

Apr 18, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,092 views
0 votes
1 answer

In which directory Hadoop should be installed?

You can install Hadoop anywhere. You just ...READ MORE

Apr 17, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,173 views
0 votes
1 answer

Where Big Data tools like Hadoop and Spark comes into picture when we talk about ETL?

When we talk about ETL, ETL means ...READ MORE

May 3, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
413 views
0 votes
1 answer

What happend while formatting hadoop namenode?

Basically, NameNode is the master daemon that ...READ MORE

May 4, 2018 in Big Data Hadoop by Ashish
• 2,650 points
408 views
0 votes
1 answer

Replication Factor in Hadoop?

If you have provided a replication factor ...READ MORE

Apr 18, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,059 views
0 votes
1 answer

Why ResourceManager crashes after sometime or while accessing HDFS in Hadoop 2.8.1 and Ubuntu 16.04?

I was facing the same problem and ...READ MORE

Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,152 views
0 votes
1 answer

What is the function of getLocalCacheArchives method?

We use distributed cache to share those ...READ MORE

Apr 29, 2018 in Big Data Hadoop by Shubham
• 13,490 points
543 views
0 votes
1 answer

How to configure secondary namenode in Hadoop 2.x ?

bin/hadoop-daemon.sh start [namenode | secondarynamenode | datanode ...READ MORE

Apr 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,539 views
0 votes
1 answer

Relationship between Spark, Hadoop and Cassandra?

Spark is a distributed in memory processing ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,008 views
0 votes
1 answer

Getting error in Hadoop Streaming: Type mismatch in Key from Map

In Hadoop streaming you have to customize ...READ MORE

Apr 18, 2018 in Big Data Hadoop by coldcode
• 2,080 points
1,010 views
0 votes
1 answer

Different ports in a Hadoop cluster environment?

Below image will help you in understanding ...READ MORE

Apr 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,535 views
0 votes
1 answer

Map and Reduce task memory settings in Hadoop YARN

It's preferable and generally, it is recommended ...READ MORE

Apr 18, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,003 views
0 votes
1 answer

How do I parse pdf file using MapReduce?

If you have your own custom InputFormat(WholeFileInputFormat). ...READ MORE

Apr 11, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,269 views
0 votes
1 answer

HortonWorks Hadoop encryption tools and data security

There are many tools available for encrypting ...READ MORE

Apr 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
857 views
0 votes
1 answer

How do I print hadoop properties in command line?

You can use the following command to get ...READ MORE

Apr 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,460 views
0 votes
1 answer

How to install and configure a multi-node Hadoop cluster?

I would recommend you to install Cent ...READ MORE

Mar 22, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,133 views
0 votes
1 answer

Which is better Neo4j or HBASE ?

For your requirement, I would recommend you ...READ MORE

Apr 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
578 views
0 votes
1 answer

HDFS Location in Hadoop?

Try using the same path where you ...READ MORE

Apr 19, 2018 in Big Data Hadoop by Shubham
• 13,490 points
876 views
0 votes
1 answer

Error in Hadoop Mapreduce

The file that you are referring here ...READ MORE

Apr 19, 2018 in Big Data Hadoop by Shubham
• 13,490 points
874 views
0 votes
1 answer

How can we send data from MongoDB to Hadoop?

The MongoDB Connector for Hadoop reads data ...READ MORE

Mar 27, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,894 views
0 votes
1 answer

What are the different ways to load data from Hadoop to Azure Data Lake?

I would recommend you to go through ...READ MORE

Apr 18, 2018 in Big Data Hadoop by coldcode
• 2,080 points
898 views
0 votes
1 answer

What does Context mean in Hadoop MapR terms ?

The new API makes extensive use of ...READ MORE

Apr 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,412 views
0 votes
1 answer

How to perform Near-real time analytics using Hadoop?

To perform Near-real time analytics, you have ...READ MORE

Apr 24, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
566 views
0 votes
1 answer

How to checkout Hadoop 2.6.0 from git

Clone the following Git repository: git clone git ...READ MORE

Apr 23, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
618 views
0 votes
1 answer

Hadoop for data migration

You can use Hadoop for these kinds ...READ MORE

Apr 18, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
813 views
0 votes
1 answer

Why am I not able to see the Hadoop daemons that are running?

I guess you are starting the services ...READ MORE

Apr 18, 2018 in Big Data Hadoop by Shubham
• 13,490 points
797 views
0 votes
1 answer

How to implement data locality in Hadoop MapReduce?

You can use this getFileBlockLocations method of ...READ MORE

Apr 20, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
707 views
+1 vote
1 answer

I installed Spark but while executing command, I am getting ‘hadoop’ command not found error?

For accessing Hadoop commands & HDFS, you ...READ MORE

Mar 21, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,969 views
+2 votes
1 answer

Is Kafka and Zookeeper are required in a Big Data Cluster?

Apache Kafka is one of the components ...READ MORE

Mar 23, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,871 views
0 votes
1 answer

Getting error while building Hadoop core jar using ant.

I think you are missing libtool library. ...READ MORE

Apr 18, 2018 in Big Data Hadoop by coldcode
• 2,080 points
774 views
0 votes
1 answer

Why we are configuring mapred.job.tracker in YARN?

I really dont know the reason behind ...READ MORE

Mar 29, 2018 in Big Data Hadoop by Ashish
• 2,650 points
1,619 views
0 votes
1 answer

Are some Pig real time use cases available?

You can refer to this website https://acadgild.com/blog/pig-u ...READ MORE

Apr 21, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
666 views
0 votes
1 answer

Why Java Code in Hadoop uses own Data Types instead of basic Data types?

Hadoop provides us Writable interface based data ...READ MORE

Apr 15, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
884 views
0 votes
1 answer

Hadoop demo Code not working

You can use this command: "hadoop fs ...READ MORE

Apr 21, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
618 views
0 votes
1 answer

How Hadoop replication works?

dfs.replication is by default 3 and you ...READ MORE

Apr 19, 2018 in Big Data Hadoop by Shubham
• 13,490 points
669 views
0 votes
1 answer

Hadoop single node cluster set up issues

As far as the error, there's a ...READ MORE

Apr 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
655 views
0 votes
1 answer

How Impala is fast compared to Hive in terms of query response?

Impala provides faster response as it uses MPP(massively ...READ MORE

Mar 21, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,881 views