Basically, when we want to run multiple jobs parallel to each other, we can use Fork.
When fork is used we have to use Join as an end node to fork. Basically, Fork and Join work together. For each fork, there should be a join. As Join assumes all the node are a child of a single fork. We also use fork and join for running multiple independent jobs for proper utilization of the cluster.
Let’s see how fork is implemented:
<workflow-app xmlns = "uri:oozie:workflow:0.4" name = "simple-Workflow">
<start to = "fork_node" />
<fork name = "fork_node">
<path start = "Create_External_Table"/>
<path start = "Create_orc_Table"/>