Speculative Execution in Hadoop

Question

What do you know about the Speculative Execution?

kurt_cobain · Answer 1 · Aug 28, 2018

In Hadoop, Speculative Execution is a process that takes place during the slower execution of a task at a node. In this process, the master node starts executing another instance of that same task on the other node. And the task which is finished first is accepted and the execution of other is stopped by killing that.

answered Aug 28, 2018 by kurt_cobain
• 9,350 points

@Kurt_cobin,

What if Speculative execution is set to false for mappers and one node goes down in-between while processing the task, in that case will another node will execute the another instance of the same task or not.

Means, Speculative execution only affects if it is slow or also in case of nodes goes down.

commented Aug 13, 2020 by kamboj
• 140 points

Hello, @Kamboj,

Hadoop does not fix or diagnose slow-running tasks. Instead, it tries to detect when a task is running slower than expected and launches another, an equivalent task as a backup (the backup task is called a speculative task). This process is called speculative execution in Hadoop.

There may be various reasons for the slowdown of tasks, including hardware degradation or software misconfiguration, but it may be difficult to detect causes since the tasks still complete successfully, although more time is taken than the expected time. Hadoop doesn’t try to diagnose and fix slow running tasks, instead, it tries to detect them and runs backup tasks for them. This is called speculative execution in Hadoop. These backup tasks are called Speculative tasks in Hadoop.

I hope this explanation will help you to understand your query.

commented Aug 13, 2020 by Gitika
• 65,730 points

Hi@Kamboj,

According to my knowledge, Speculative execution-only effects if the tasks are taking much time then expected. It will not look if any node goes down for some reason. when a task is running slower than expected, then launches another, an equivalent task as a backup. This process is called speculative execution in Hadoop.

commented Aug 13, 2020 by MD
• 95,460 points

Thanks for the update MD

I was also expecting the same but I have run the task by setting Speculative execution false and executed a job and maps started running on different nodes. Then I have stopped on node in-between and I observed that the map which was running on that node did not got run by any other working node and task stuck in running state.

The same scenario worked file when Speculative execution was set to true and task got completed.

commented Aug 14, 2020 by kamboj
• 140 points

Hi Kamboj,

Ok understood. But this can be automatically done if some node goes down and failure tasks will assign to another node. You can check this below blog once. It has a good discussion related to task failure and its recovery.

http://timepasstechies.com/handling-failures-hadoopmapreduce-yarn/

commented Aug 14, 2020 by MD
• 95,460 points

Thanks MD,

The shared link has detailed information about the failure and recovery of nodes. Waiting time helped me to understand the actual scenario. I have re-run the same task with (speculative false) and found that failed node jobs got completed after waiting for 17 minutes. But on RM, completed jobs showing corresponding to the nodes on which initially that jobs were submit and that nodes went down.