Passing keys and values to the reducers during a standard sort and shuffle phase of MapReduce

Question

I am working with MapReduce and I want to know that how are keys and values presented and passed to the reducers during a standard sort and shuffle phase of MapReduce?

Can someone help!

nitinrawat895 · Answer

Let me explain you the whole scenarioReducer has 3 primary phases:&#160;&#160;1. Shuffle The Reducer copies the sorted output from each Mapper using HTTP across the network.&#160;&#160;2. Sort The framework merge sorts Reducer inputs by keys (since different Mappers may have output the same key).&#160;&#160;The shuffle and sort phases occur simultaneously i.e. while outputs are being fetched they are merged.&#160;&#160;SecondarySort To achieve a secondary sort on the values returned by the value iterator, the application should extend the key with the secondary key and define a grouping comparator. The keys will be sorted using the entire key, but will be grouped using the grouping comparator to decide which keys and values are sent in the same call to reduce.&#160;&#160;3. Reduce In this phase the reduce(Object, Iterable, Context) method is called for each <key, (collection of values)> in the sorted inputs.&#160;&#160;The output of the reduce task is typically written to a RecordWriter via TaskInputOutputContext.write(Object, Object).&#160;&#160;The output of the Reducer is not re-sorted.&#160;Hope this will help!

Passing keys and values to the reducers during a standard sort and shuffle phase of MapReduce

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Big Data Hadoop

Method to schedule the number of Mappers and Reducers in a Hadoop MapReduce Tsk.

How to retrieve the list of sql (Hive QL) commands that has been executed in a hadoop cluster?

What is the difference between a zero reducer and identity reducer in Hadoop Mapreduce?

How to print the content of a file in console present in HDFS?

Hadoop dfs -ls command?

Hadoop Mapreduce word count Program

hadoop fs -put command?

Is there a way to copy data from one one Hadoop distributed file system(HDFS) to another HDFS?

What happens in a MapReduce job when you set the number of reducers to one?

Which one among the both MapRed and MapReduce is considered better to create a Hadoop job?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES