In Hadoop what does dfs replicate mean?

0 votes

If I made one file named filmdata.txt in hdfs, if I set dfs.replication=1,so is it totally one file(one filmdata.txt)?or besides the main file(filmdata.txt) hadoop will create another replication file. shortly say:if set dfs.replication=1,there are totally one filmdata.txt,or two filmdata.txt? 

Sep 25, 2018 in Big Data Hadoop by Neha
• 6,280 points
94 views

1 answer to this question.

0 votes

The total number of files in the file system will be what's specified in the dfs.replication factor. So, if you set dfs.replication=1, then there will be only one copy of the file in the file system.

Check the Apache Documentation for the other configuration parameters.

                                              OR 

To ensure high availability of data, Hadoop replicates the data.

When we are storing the files into HDFS, hadoop framework splits the file into set of blocks( 64 MB or 128 MB) and then these blocks will be replicated across the cluster nodes.The configuration dfs.replication is to specify how many replications are required.

The default value for dfs.replication is 3, But this is configurable depends on your cluster setup.

answered Sep 25, 2018 by Frankie
• 9,810 points

Related Questions In Big Data Hadoop

0 votes
1 answer

What does Context mean in Hadoop MapR terms ?

The new API makes extensive use of ...READ MORE

answered Apr 6, 2018 in Big Data Hadoop by nitinrawat895
• 10,110 points
86 views
0 votes
1 answer

What does count command in Hadoop tells?

The output of the count command are: ...READ MORE

answered Apr 23, 2018 in Big Data Hadoop by kurt_cobain
• 9,240 points
65 views
0 votes
1 answer

Hadoop: What does RDD mean?

RDD is the acronym for Resilient Distribution ...READ MORE

answered Nov 20, 2018 in Big Data Hadoop by Deepak
63 views
0 votes
1 answer

What does a secondary namenode do in Hadoop?

The Secondary namenode is mainly used as a ...READ MORE

answered Dec 20, 2018 in Big Data Hadoop by Omkar
• 67,120 points
42 views
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,110 points
2,055 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,110 points
198 views
0 votes
10 answers

hadoop fs -put command?

copy command can be used to copy files ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Sujay
10,516 views
0 votes
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,240 points
768 views
0 votes
1 answer

What does AM in “AM Container” mean in Hadoop

Here's another link from Hadoop which may ...READ MORE

answered Nov 2, 2018 in Big Data Hadoop by Frankie
• 9,810 points
109 views
0 votes
1 answer

What are the problems in creating volume in mapr hadoop?

It sounds like the user you are ...READ MORE

answered Aug 21, 2018 in Big Data Hadoop by Frankie
• 9,810 points
45 views