In Hadoop what does dfs replicate mean?

0 votes

If I made one file named filmdata.txt in hdfs, if I set dfs.replication=1,so is it totally one file(one filmdata.txt)?or besides the main file(filmdata.txt) hadoop will create another replication file. shortly say:if set dfs.replication=1,there are totally one filmdata.txt,or two filmdata.txt? 

Sep 25, 2018 in Big Data Hadoop by Neha
• 6,140 points
55 views

1 answer to this question.

Your answer

Your name to display (optional):
Privacy: Your email address will only be used for sending these notifications.
0 votes

The total number of files in the file system will be what's specified in the dfs.replication factor. So, if you set dfs.replication=1, then there will be only one copy of the file in the file system.

Check the Apache Documentation for the other configuration parameters.

                                              OR 

To ensure high availability of data, Hadoop replicates the data.

When we are storing the files into HDFS, hadoop framework splits the file into set of blocks( 64 MB or 128 MB) and then these blocks will be replicated across the cluster nodes.The configuration dfs.replication is to specify how many replications are required.

The default value for dfs.replication is 3, But this is configurable depends on your cluster setup.

answered Sep 25, 2018 by Frankie
• 9,570 points

Related Questions In Big Data Hadoop

0 votes
1 answer

What does Context mean in Hadoop MapR terms ?

The new API makes extensive use of ...READ MORE

answered Apr 6, 2018 in Big Data Hadoop by nitinrawat895
• 9,030 points
66 views
0 votes
1 answer

What does count command in Hadoop tells?

The output of the count command are: ...READ MORE

answered Apr 23, 2018 in Big Data Hadoop by kurt_cobain
• 9,260 points
43 views
0 votes
1 answer

Hadoop: What does RDD mean?

RDD is the acronym for Resilient Distribution ...READ MORE

answered Nov 20, 2018 in Big Data Hadoop by Deepak
50 views
0 votes
1 answer

What does a secondary namenode do in Hadoop?

The Secondary namenode is mainly used as a ...READ MORE

answered Dec 20, 2018 in Big Data Hadoop by Omkar
• 65,850 points
37 views
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 9,030 points
1,656 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 9,030 points
130 views
0 votes
10 answers

hadoop fs -put command?

copy command can be used to copy files ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Sujay
8,027 views
0 votes
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,260 points
560 views
0 votes
1 answer

What does AM in “AM Container” mean in Hadoop

Here's another link from Hadoop which may ...READ MORE

answered Nov 2, 2018 in Big Data Hadoop by Frankie
• 9,570 points
83 views
0 votes
1 answer

What are the problems in creating volume in mapr hadoop?

It sounds like the user you are ...READ MORE

answered Aug 21, 2018 in Big Data Hadoop by Frankie
• 9,570 points
34 views

© 2018 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
"PMP®","PMI®", "PMI-ACP®" and "PMBOK®" are registered marks of the Project Management Institute, Inc. MongoDB®, Mongo and the leaf logo are the registered trademarks of MongoDB, Inc.