How data distribution is done in Hadoop?

0 votes

If there is any new nodes added in Hadoop how the distribution of data process will occur?

Apr 3 in Big Data Hadoop by sunny
155 views

1 answer to this question.

0 votes

To understand how or what are the process for the data distribution in Hadoop can be done , I will come up with the procedure how it works:

 Hadoop is a distributed file system which follows Master Slave Architecture for Data distribution. In this architecture there is a cluster which consists of one single Name node(Master node) and Data nodes (slave nodes).

Here the Name node and Data node will be working to distribute the file in a structural way with limited memory byte in each nodes.

Name node has the function to manage and maintain the Data nodes. It records the Meta data of the actual data . Meta data is something which keep the location of the block ,the size of the block. Name node also responsible to records if any modification made to the files . For example: If any file is deleted , the Name node will immediately record it.

Data nodes are the slave nodes which is responsible to store the actual data ,where data will be stored in different Data nodes. It also sends the heartbeat a kind of report to Name node periodically to make sure that the Data nodes are working properly or not. In every 3 secs the process will repeat by default. 

I hope it will be helpful .

answered Apr 4 by Gitika
• 25,420 points

Related Questions In Big Data Hadoop

0 votes
1 answer

What is Modeling data in Hadoop and how to do it?

I suggest spending some time with Apache ...READ MORE

answered Sep 19, 2018 in Big Data Hadoop by Frankie
• 9,810 points
129 views
0 votes
1 answer
+1 vote
1 answer

Is Hadoop only Framework in Big Data Ecosystem ?

Actually there are many other frameworks, one of ...READ MORE

answered Mar 26, 2018 in Big Data Hadoop by Ashish
• 2,630 points
91 views
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,760 points
3,552 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,760 points
441 views
+1 vote
11 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyFr ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
18,204 views
0 votes
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,280 points
1,319 views
0 votes
12 answers

What is Zookeeper? What is the purpose of Zookeeper in Hadoop Ecosystem?

Hey, Apache Zookeeper says that it is a ...READ MORE

answered Apr 29 in Big Data Hadoop by Gitika
• 25,420 points
6,335 views
0 votes
1 answer

How Big is Big data?

First question which strikes that how BIG ...READ MORE

answered Apr 4 in Big Data Hadoop by Gitika
• 25,420 points
81 views