1.
Which of these is not similar to Oozie?
Correct Answer
C. Anatellae
Explanation
Anatellae is not similar to Oozie because it is not a workflow scheduler or coordinator tool like Oozie. Oozie is used for managing and scheduling Hadoop jobs, while Anatellae is not specifically designed for this purpose. GETL, Apache Crunch, and Apatar are all similar to Oozie as they are also used for data integration and processing in a distributed computing environment.
2.
What are the two parts of Oozie?
Correct Answer
B. Workflow and coordinator engine
Explanation
The correct answer is Workflow and coordinator engine. Oozie is a workflow scheduler system that is used to manage Apache Hadoop jobs. It consists of two main components: the workflow engine and the coordinator engine. The workflow engine is responsible for defining and executing workflows, which are a collection of actions that need to be executed in a specific order. The coordinator engine is used to schedule and manage the execution of these workflows at specified times or intervals. Therefore, the two parts of Oozie are the workflow engine and the coordinator engine.
3.
Oozie Workflow is a collection of actions arranged in...
Correct Answer
D. Directed acyclic grapH
Explanation
An Oozie Workflow is a collection of actions that are arranged in a directed acyclic graph. This means that the actions are organized in a way that they have a specific order of execution, and there are no cycles or loops in the graph. The absence of cycles ensures that the workflow can be executed in a deterministic manner without any infinite loops or circular dependencies.
4.
What type of Workflow node trigger the execution of a computational task?
Correct Answer
B. Action node
Explanation
An action node triggers the execution of a computational task in a workflow. Unlike other types of nodes like mapreduce, control flow, or trigger nodes, the action node is specifically designed to initiate the execution of a task within the workflow. It serves as a starting point for carrying out a specific action or operation, which could involve data processing, calculations, or any other computational task. Therefore, an action node is the correct choice for triggering the execution of a computational task within a workflow.
5.
Which of these files cannot be found in the Oozie Workflow application?
Correct Answer
C. pHP scripts
Explanation
In an Oozie Workflow application, the files that cannot be found are PHP scripts. Oozie is a workflow scheduler system used to manage Apache Hadoop jobs. It primarily supports Java MapReduce jobs, Pig scripts, and Hive scripts. However, it does not support PHP scripts, making them the correct answer.
6.
In how many segments can we use Oozie coordinator?
Correct Answer
B. 3
Explanation
Oozie coordinator can be used in 3 segments. This means that Oozie coordinator can be divided into 3 distinct parts or sections. Each segment represents a specific functionality or feature of Oozie coordinator.
7.
The execution of data in Oozie depends on...
Correct Answer
B. Time and data
Explanation
The execution of data in Oozie depends on time and data because Oozie is a workflow scheduler system used to manage and coordinate the execution of tasks in a Hadoop cluster. It allows users to define workflows that specify the order and dependencies of tasks to be executed. The scheduling of these tasks is based on time triggers and the availability of data. Oozie ensures that tasks are executed at the specified time and only when the required data is available, ensuring the correct execution of the workflow.
8.
The chaining together of Workflow results is called...
Correct Answer
A. Data application pipeline
Explanation
The chaining together of Workflow results is called a data application pipeline. This refers to the process of connecting multiple workflows or tasks in a sequence to create a pipeline for the flow of data. It allows for the efficient and automated execution of tasks in a specific order, ensuring that the output of one task serves as the input for the next task. This helps in streamlining data processing and analysis, making it easier to manage and track the flow of data within a system.
9.
Which of these is not true about the security in Oozie?
Correct Answer
A. User is allowed to alter another user's job
Explanation
The statement "User is allowed to alter another user's job" is not true about the security in Oozie. Oozie provides security measures to ensure that users are not allowed to edit or alter other user's jobs. This is an important aspect of security to prevent unauthorized access and modifications to jobs, ensuring the integrity and privacy of each user's work.
10.
Which of these nodes must be used in pairs?
Correct Answer
A. Fork and join node
Explanation
Fork and join nodes must be used in pairs because they are used to split a flow into multiple parallel paths (fork) and then merge those paths back into a single flow (join). The fork node creates multiple threads of execution, allowing different tasks to be performed simultaneously. The join node waits for all the threads to complete before continuing with the execution. Therefore, using only one of these nodes without its corresponding pair would result in an incomplete or incorrect flow of execution.