Why do we use job waitForCompletion true

Question

I have seen this line being used in many mapreduce codes. I am new to mapreduce so I am not understanding it's significance,

Why do we use job.waitForCompletion(true)? Is it because internally the code uses threads.

score 0 · Answer 1 · Jul 22, 2019

The main reason for job.waitForCompletion exists is that its method call returns only when the job gets finished, and it returns with its success or failure status which can be used to determine that further steps are to be run or not.

Actually, the files is split in blocks and each block is executed on a separate node. All the map tasks run in parallel and then are fed to the reducer after they are done. There is no question of synchronization as you would think about in a multi threaded program. In multi-threaded program, all the threads are running on the same box and since they share some of the data you have to synchronize them. But here threads are not involved unless you use threads to submit jobs in parallel and wait for their completion. For that, you have to use a job class instance per thread.