What are the advantages & disadvantages of Hadoop Dockerization?

0 votes
I am working on a Hadoop cluster which is using Hue, Flume & Cassandra. I have heard about Docker & have an idea about how it works. Before actually deploying the cluster in a real time environment, I want to consider the advantages & disadvantages of using docker container for Hadoop?

I guess the portability is one of the major benefit of using docker, but I am interested in knowing and comparing more. Can anyone help me out on this?
Apr 18, 2018 in Big Data Hadoop by Shubham
• 13,310 points
341 views

1 answer to this question.

0 votes
As you are already having a Hadoop cluster, you can understand it is difficult to reproduce this environment. Then next important thing is, docker helps you to isolate the environment which will not conflict any dependencies with other applications present in your host machine.

Talking from the perspective of Hadoop, to easily setup a multi node cluster. You can setup one docker Hadoop container. Then replicate the container and the change the setting. So, it will be very easy to setup a multi node cluster.

But again I would like to add, if you have no pain in setting a multi node cluster or you have no issue with the dependencies, So, you do not have to use the docker container because it is a hot topic. It only depends on your need and your ease.
answered Apr 18, 2018 by coldcode
• 2,020 points

Related Questions In Big Data Hadoop

0 votes
1 answer
0 votes
1 answer

What are the different ways of Installing Hadoop into our local machine?

Hadoop runs on Unix and on Windows. ...READ MORE

answered Aug 3, 2018 in Big Data Hadoop by Neha
• 6,280 points
329 views
0 votes
1 answer

What do we exactly mean by “Hadoop” – the definition of Hadoop?

The official definition of Apache Hadoop given ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by Shubham
230 views
0 votes
13 answers

What is the difference between Hadoop/HDFS & HBase?

HDFS is a distributed file system whereas ...READ MORE

answered Apr 26 in Big Data Hadoop by Arihar
• 160 points
10,925 views
0 votes
1 answer

What is the use of sequence file in Hadoop?

Sequence files are binary files containing serialized ...READ MORE

answered Apr 5, 2018 in Big Data Hadoop by Ashish
• 2,630 points
1,705 views
0 votes
1 answer

What are the hardware requirements for installing Hadoop on my Laptop?

You can either install Apache Hadoop on ...READ MORE

answered Apr 10, 2018 in Big Data Hadoop by Shubham
• 13,310 points
2,018 views
0 votes
12 answers

What is Zookeeper? What is the purpose of Zookeeper in Hadoop Ecosystem?

Hey, Apache Zookeeper says that it is a ...READ MORE

answered Apr 29 in Big Data Hadoop by Gitika
• 25,360 points
5,862 views
0 votes
1 answer

Best way of starting & stopping the Hadoop daemons with command line

First way is to use start-all.sh & ...READ MORE

answered Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,310 points
1,940 views
0 votes
1 answer
0 votes
1 answer

What are the different ways to load data from Hadoop to Azure Data Lake?

I would recommend you to go through ...READ MORE

answered Apr 18, 2018 in Big Data Hadoop by coldcode
• 2,020 points
112 views