When is an identity mapper/reducer used?

0 votes
I have two doubts

1. When is iterative MapReduce used ?

2.When is identity MapReduce used ?
Apr 3, 2018 in Big Data Hadoop by kurt_cobain
• 9,240 points
816 views

1 answer to this question.

0 votes
1.One of the simplest example of Iterative Mapreduce is K-Means Clustering.

We have the same input which is mapped into clusters then the output from mapper is used for making new clusters.

2. Identity Mappers and Reducers don't have a body, it only generates key-value pairs and the package is org.apache.hadoop.mapred.identity.

Identity Mapper and Reducer just like the concept of Identity function in mathematics i.e. do not transform the input and return it as it is in output form

An identity mapper is used can be used (among others!) if you would only want to sort your input.

Identity reducer is a bit different. It does not mean that the reduce step will not take place. It will take place and the related sorting/shuffling will also be performed but there will be no aggregation.

Hope this helps
answered Apr 3, 2018 by Ashish
• 2,630 points

Related Questions In Big Data Hadoop

0 votes
1 answer
0 votes
1 answer

How can I get the respective Bitcoin value for an input in USD when using c#

Simply make call to server and parse ...READ MORE

answered Mar 25, 2018 in Big Data Hadoop by charlie_brown
• 7,720 points
58 views
0 votes
1 answer

Which Data Structure is used in case of Map Reduce?

In case of Hadoop, HDFS is used ...READ MORE

answered May 4, 2018 in Big Data Hadoop by nitinrawat895
• 10,670 points
117 views
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,670 points
2,655 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,670 points
275 views
0 votes
10 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyFr ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
13,220 views
0 votes
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,240 points
972 views
0 votes
1 answer

What Distributed Cache is actually used for in Hadoop?

Basically distributed cache allows you to cache ...READ MORE

answered Apr 2, 2018 in Big Data Hadoop by Ashish
• 2,630 points
146 views
0 votes
1 answer

Why Apache Pig is used instead of Hadoop?

As you know writing mapreduce programs in ...READ MORE

answered May 7, 2018 in Big Data Hadoop by Ashish
• 2,630 points
209 views