What are the advantages & disadvantages of Hadoop Dockerization?

0 votes
I am working on a Hadoop cluster which is using Hue, Flume & Cassandra. I have heard about Docker & have an idea about how it works. Before actually deploying the cluster in a real time environment, I want to consider the advantages & disadvantages of using docker container for Hadoop?

I guess the portability is one of the major benefit of using docker, but I am interested in knowing and comparing more. Can anyone help me out on this?
Apr 18, 2018 in Big Data Hadoop by Shubham
• 12,150 points
215 views

1 answer to this question.

Your answer

Your name to display (optional):
Privacy: Your email address will only be used for sending these notifications.
0 votes
As you are already having a Hadoop cluster, you can understand it is difficult to reproduce this environment. Then next important thing is, docker helps you to isolate the environment which will not conflict any dependencies with other applications present in your host machine.

Talking from the perspective of Hadoop, to easily setup a multi node cluster. You can setup one docker Hadoop container. Then replicate the container and the change the setting. So, it will be very easy to setup a multi node cluster.

But again I would like to add, if you have no pain in setting a multi node cluster or you have no issue with the dependencies, So, you do not have to use the docker container because it is a hot topic. It only depends on your need and your ease.
answered Apr 18, 2018 by coldcode
• 1,980 points

Related Questions In Big Data Hadoop

0 votes
1 answer
0 votes
1 answer

What are the different ways of Installing Hadoop into our local machine?

Hadoop runs on Unix and on Windows. ...READ MORE

answered Aug 3, 2018 in Big Data Hadoop by Neha
• 6,140 points
94 views
0 votes
1 answer

What do we exactly mean by “Hadoop” – the definition of Hadoop?

The official definition of Apache Hadoop given ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by Shubham
119 views
0 votes
12 answers

What is the difference between Hadoop/HDFS & HBase?

HDFS is a distributed file system whereas ...READ MORE

answered Apr 26 in Big Data Hadoop by Arihar
• 160 points
4,154 views
0 votes
1 answer

What is the use of sequence file in Hadoop?

Sequence files are binary files containing serialized ...READ MORE

answered Apr 5, 2018 in Big Data Hadoop by Ashish
• 2,630 points
395 views
0 votes
1 answer

What are the hardware requirements for installing Hadoop on my Laptop?

You can either install Apache Hadoop on ...READ MORE

answered Apr 10, 2018 in Big Data Hadoop by Shubham
• 12,150 points
790 views
0 votes
12 answers

What is Zookeeper? What is the purpose of Zookeeper in Hadoop Ecosystem?

Hey, Apache Zookeeper says that it is a ...READ MORE

answered Apr 29 in Big Data Hadoop by Gitika
• 8,100 points
2,425 views
0 votes
1 answer

Best way of starting & stopping the Hadoop daemons with command line

First way is to use start-all.sh & ...READ MORE

answered Apr 15, 2018 in Big Data Hadoop by Shubham
• 12,150 points
692 views
0 votes
1 answer
0 votes
1 answer

What are the different ways to load data from Hadoop to Azure Data Lake?

I would recommend you to go through ...READ MORE

answered Apr 18, 2018 in Big Data Hadoop by coldcode
• 1,980 points
43 views

© 2018 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
"PMP®","PMI®", "PMI-ACP®" and "PMBOK®" are registered marks of the Project Management Institute, Inc. MongoDB®, Mongo and the leaf logo are the registered trademarks of MongoDB, Inc.