How does Hadoop accesses the files which are distributed among different boundaries?

Question

ravikiran · Answer

Hadoop's MapReduce function does not work on physical blocks of the file, instead, it is designed to work upon the logical memory or in simpler words, the input splits.&#160;These Input splits are dependent on the location where the file is written. A record may map two mappers.The HDFS is designed in such a way that each and every file is written into it is split into blocks of 128 MB each and each block is replicated 3 times by default.for example, consider a file. The data in this file can begin in block a and end in block b.HDFS does not track the location of the data. Instead, it solely depends upon the logical input splits. It is these input splits which depict the start and end of any particular file.for more information, you can go through this article.

How does Hadoop accesses the files which are distributed among different boundaries

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Big Data Hadoop

How can Hadoop process the records that are split across the block boundaries?

Which among the following are the Features of Hadoop?

What are the different ways to load data from Hadoop to Azure Data Lake?

What are some of the famous visualization tools which can be integrated with Hadoop & Hive?

Hadoop Mapreduce word count Program

hadoop.mapred vs hadoop.mapreduce?

hadoop fs -put command?

Hadoop dfs -ls command?

How does Hadoop process data which is split across multiple boundaries in an HDFS?

Explain how the MongoDB MapReduce and Hadoop MapReduce are similar to each other and explain the differences if there are any.

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES