Can one implement combiner and reducer separately?

0 votes

I was checking few hadoop projects on the internet and github. There, in many programs, I found that people have used reducer as combiner. One of the reason to do so might be because of the use case context or is it the only way to implement combiner? I mean, can I have implement combiner and reducer separately in a MapReduce program. 

Apr 10, 2018 in Big Data Hadoop by Shubham
• 13,250 points
210 views

1 answer to this question.

0 votes

Surely, you can use combiner separately along with reducer but, for implementing combiner you still be using reducer interface. Now, it is quite important for you to understand where combiner should be used. The primary goal of combiners is to optimize/minimize the number of key value pairs that will be shuffled across the network between mappers and reducers and thus to save as most bandwidth as possible.

You can consider combiner as mini reducer which is called several times during the map phase in order to reduce the set of key/value pairs that will be eventually sent to the reducer. This is the reason why combiner must implement the reduce interface or you can extend reducer class. 

answered Apr 10, 2018 by Ashish
• 2,630 points

Related Questions In Big Data Hadoop

0 votes
1 answer

How can I download only hdfs and not hadoop?

No, you cannot download HDFS alone because ...READ MORE

answered Mar 15, 2018 in Big Data Hadoop by nitinrawat895
• 10,210 points
68 views
0 votes
1 answer

How to create a FileSystem object that can be used for reading from and writing to HDFS?

Read operation on HDFS In order to read ...READ MORE

answered Mar 21, 2018 in Big Data Hadoop by nitinrawat895
• 10,210 points

edited Mar 21, 2018 by nitinrawat895 214 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,210 points
2,078 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,210 points
202 views
0 votes
10 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyFr ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
10,670 views
0 votes
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,240 points
779 views
0 votes
1 answer
0 votes
1 answer

When is an identity mapper/reducer used?

1.One of the simplest example of Iterative ...READ MORE

answered Apr 3, 2018 in Big Data Hadoop by Ashish
• 2,630 points
785 views