How data distribution is done in Hadoop?

0 votes

If there is any new nodes added in Hadoop how the distribution of data process will occur?

Apr 3, 2019 in Big Data Hadoop by sunny
194 views

1 answer to this question.

0 votes

To understand how or what are the process for the data distribution in Hadoop can be done , I will come up with the procedure how it works:

 Hadoop is a distributed file system which follows Master Slave Architecture for Data distribution. In this architecture there is a cluster which consists of one single Name node(Master node) and Data nodes (slave nodes).

Here the Name node and Data node will be working to distribute the file in a structural way with limited memory byte in each nodes.

Name node has the function to manage and maintain the Data nodes. It records the Meta data of the actual data . Meta data is something which keep the location of the block ,the size of the block. Name node also responsible to records if any modification made to the files . For example: If any file is deleted , the Name node will immediately record it.

Data nodes are the slave nodes which is responsible to store the actual data ,where data will be stored in different Data nodes. It also sends the heartbeat a kind of report to Name node periodically to make sure that the Data nodes are working properly or not. In every 3 secs the process will repeat by default. 

I hope it will be helpful .

answered Apr 4, 2019 by Gitika
• 25,460 points

Related Questions In Big Data Hadoop

0 votes
1 answer

What is Modeling data in Hadoop and how to do it?

I suggest spending some time with Apache ...READ MORE

answered Sep 19, 2018 in Big Data Hadoop by Frankie
• 9,810 points
158 views
0 votes
1 answer

How does Hadoop process data which is split across multiple boundaries in an HDFS?

I found some comments: from the Hadoop ...READ MORE

answered Jul 1, 2019 in Big Data Hadoop by ravikiran
• 4,600 points
53 views
+1 vote
1 answer

Is Hadoop only Framework in Big Data Ecosystem ?

Actually there are many other frameworks, one of ...READ MORE

answered Mar 26, 2018 in Big Data Hadoop by Ashish
• 2,630 points
100 views
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,840 points
3,915 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,840 points
538 views
+1 vote
11 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyFr ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
20,817 views
0 votes
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,290 points
1,476 views
0 votes
12 answers

What is Zookeeper? What is the purpose of Zookeeper in Hadoop Ecosystem?

Hey, Apache Zookeeper says that it is a ...READ MORE

answered Apr 29, 2019 in Big Data Hadoop by Gitika
• 25,460 points
7,089 views
0 votes
1 answer

How Big is Big data?

First question which strikes that how BIG ...READ MORE

answered Apr 4, 2019 in Big Data Hadoop by Gitika
• 25,460 points
86 views