In Hadoop what does dfs replicate mean

0 votes

If I made one file named filmdata.txt in hdfs, if I set dfs.replication=1,so is it totally one file(one filmdata.txt)?or besides the main file(filmdata.txt) hadoop will create another replication file. shortly say:if set dfs.replication=1,there are totally one filmdata.txt,or two filmdata.txt? 

Sep 25, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,929 views

1 answer to this question.

0 votes

The total number of files in the file system will be what's specified in the dfs.replication factor. So, if you set dfs.replication=1, then there will be only one copy of the file in the file system.

Check the Apache Documentation for the other configuration parameters.

                                              OR 

To ensure high availability of data, Hadoop replicates the data.

When we are storing the files into HDFS, hadoop framework splits the file into set of blocks( 64 MB or 128 MB) and then these blocks will be replicated across the cluster nodes.The configuration dfs.replication is to specify how many replications are required.

The default value for dfs.replication is 3, But this is configurable depends on your cluster setup.

answered Sep 25, 2018 by Frankie
• 9,810 points

Related Questions In Big Data Hadoop

0 votes
1 answer

What does Context mean in Hadoop MapR terms ?

The new API makes extensive use of ...READ MORE

answered Apr 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
728 views
0 votes
1 answer

What does count command in Hadoop tells?

The output of the count command are: ...READ MORE

answered Apr 23, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
933 views
0 votes
1 answer

Hadoop: What does RDD mean?

RDD is the acronym for Resilient Distribution ...READ MORE

answered Nov 21, 2018 in Big Data Hadoop by Deepak
416 views
0 votes
1 answer

What does a secondary namenode do in Hadoop?

The Secondary namenode is mainly used as a ...READ MORE

answered Dec 21, 2018 in Big Data Hadoop by Omkar
• 69,150 points
603 views
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
8,035 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
1,366 views
+2 votes
11 answers

hadoop fs -put command?

Hi, You can create one directory in HDFS ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
66,843 views
–1 vote
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
3,059 views
0 votes
1 answer

What does AM in “AM Container” mean in Hadoop

Here's another link from Hadoop which may ...READ MORE

answered Nov 2, 2018 in Big Data Hadoop by Frankie
• 9,810 points
1,028 views
0 votes
1 answer

What are the problems in creating volume in mapr hadoop?

It sounds like the user you are ...READ MORE

answered Aug 22, 2018 in Big Data Hadoop by Frankie
• 9,810 points
301 views