1.
Replicate supports which parallelism?
Correct Answer
B. Component parallelism
2.
Broadcast component support which parallelism?
Correct Answer(s)
A. Data parallelism
B. Component parallelism
Explanation
Data parallelism and component parallelism are both supported by the broadcast component. Data parallelism refers to the parallel execution of the same operation on different data sets, while component parallelism refers to the parallel execution of different components of a system. In the case of the broadcast component, it supports both types of parallelism, allowing for parallel execution of operations on different data sets as well as parallel execution of different components within the system. Pipeline parallelism, on the other hand, refers to the parallel execution of consecutive stages of a pipeline, which is not supported by the broadcast component.
3.
Which is the system admin which is installed in Unix or Windows NT?
Correct Answer
B. CO>OP
Explanation
The correct answer is CO>OP. CO>OP is a system admin that is installed in both Unix and Windows NT operating systems. It is a commonly used tool for managing and configuring various aspects of the system, such as user accounts, file permissions, network settings, and system security. CO>OP provides a user-friendly interface and a set of commands that allow system administrators to efficiently perform their tasks and maintain the system's stability and security.
4.
Which compoment is advanced version of Aggregate?
Correct Answer
B. Rollup
Explanation
The advanced version of Aggregate is Rollup. Rollup is a powerful SQL operation that performs multiple levels of aggregation on a set of columns. It generates a result set that represents a hierarchy of dimensions. This allows for more comprehensive and detailed analysis of data compared to the basic aggregation provided by the Aggregate component.
5.
Types of Layout.
Correct Answer
C. A & B
Explanation
The correct answer is A & B because the given options list two types of layout, AI_SERIAL and AI_MFS, which suggests that both A and B are types of layout. Therefore, the correct answer is A & B.
6.
Max core is used in which component?
Correct Answer
D. All the above
Explanation
The correct answer is "all the above". Max core is used in all the mentioned components: sort, join, and rollup. This means that the maximum number of cores available is utilized in these components for efficient processing and performance.
7.
Mx how many count values can be used in join component?
Correct Answer
B. 20
Explanation
The answer is 20 because the join component in Mx allows for a maximum of 20 count values to be used.
8.
Vector field is used in which component?
Correct Answer
C. Normalize
Explanation
A vector field is used in the process of normalizing data. Normalization is a technique used to standardize data and bring it to a common scale. It involves transforming the data so that it falls within a specific range or distribution. In the context of data processing or analysis, a vector field can be used to represent the data and perform normalization operations on it. Therefore, the correct answer is "normalize."
9.
Command to generate DML:
Correct Answer
D. M_db gendml
Explanation
The correct answer is "m_db gendml". This command is used to generate Data Manipulation Language (DML) statements. The other options mentioned are not valid commands for generating DML.
10.
Which belongs to multistage component?
Correct Answer
B. Rollup
Explanation
The term "multistage component" suggests that it is referring to a component that has multiple stages or levels. Among the given options, "rollup" is the only term that is commonly associated with multiple stages. Rollup is a term used in various fields, such as data analysis and project management, to describe the process of summarizing or consolidating information from multiple lower-level components or stages into a higher-level component or stage. Therefore, "rollup" is the most appropriate choice for a multistage component.
11.
Which belongs to the partition components
Correct Answer(s)
A. Gather
B. Partition by key
C. Concatenate
D. Broadcast
Explanation
The given options "gather," "partition by key," "concatenate," and "broadcast" all belong to the partition components. These components are used in various data processing and analysis tasks. "Gather" is used to collect and combine data from different sources. "Partition by key" is used to divide data into separate partitions based on a specific key. "Concatenate" is used to combine or merge data sets. "Broadcast" is used to efficiently distribute data to all nodes in a cluster. Therefore, all of these options belong to the partition components.
12.
Relation of sandbox and project:
Correct Answer(s)
A. 1 sandbox, 1 project
C. Multiple sandbox, 1 project
Explanation
The correct answer implies that there can be one sandbox associated with one project, as well as multiple sandboxes associated with one project. This suggests that a project can have multiple sandboxes for different purposes or stages of development.
13.
Maximum how many components can be included in Phasing?
Correct Answer
D. 20
Explanation
Phasing is a technique used in genetic research to identify and analyze genetic variations. It involves determining the haplotypes, or combinations of alleles, in an individual's genome. Each haplotype represents a distinct component. Therefore, the maximum number of components that can be included in phasing is 20, as stated in the answer.
14.
How many inputs and how many outputs in join as default?
Correct Answer
B. 2 input 1 output
Explanation
The correct answer is 2 input 1 output. This means that the join operation, by default, takes two inputs and produces one output.
15.
What is the use of checkpoint?
Correct Answer
B. Recovery
Explanation
Checkpoints are used for recovery purposes. In case of any failure or interruption during the execution of a graph or program, checkpoints help in resuming the execution from a previously saved state. This ensures that the progress made before the failure is not lost and allows for the continuation of the execution without starting from scratch. Therefore, the use of checkpoints is primarily for recovery and ensuring the reliability of the execution process.
16.
Which is used for version control?
Correct Answer
C. EME
Explanation
EME is used for version control.
17.
Is it possible to run a graph without EME?
Correct Answer
A. YES
Explanation
It is possible to run a graph without EME (Encryption Media Extension). EME is a technology that enables the playback of encrypted media in web browsers. However, not all graphs or applications require encrypted media playback. Therefore, it is possible to run a graph without EME if it does not involve the playback of encrypted media.
18.
Is it possible to run a graph without co>op?
Correct Answer
B. No
Explanation
It is not possible to run a graph without cooperation (co>op). Cooperation is essential for the smooth functioning of a graph as it involves the coordination and collaboration of multiple components or nodes. Without cooperation, the nodes would not be able to communicate, exchange information, or work together effectively, leading to the inability to run the graph successfully.
19.
What are the ways of MFS?
Correct Answer
E. All the above
Explanation
The correct answer is "All the above". This means that all of the options mentioned (2, 3, 4, and 8) are ways of MFS (Mobile Financial Services). This suggests that there are multiple ways in which MFS can be conducted or implemented, and all of the options provided are examples of these different ways.
20.
Are phasing and checkpoint interrelated?
Correct Answer
A. Yes
Explanation
Phasing and checkpoint are interrelated because they both refer to different stages in a project or process. Phasing involves dividing a project into distinct stages or phases, each with its own objectives and deliverables. Checkpoints, on the other hand, are specific points in time or milestones within each phase where progress is reviewed and evaluated. These checkpoints ensure that the project is on track and that the objectives are being met. Therefore, phasing and checkpoints are interconnected as the checkpoints help to monitor and assess the progress of each phase in the overall project.
21.
Which component is used to avoid duplicate records?
Correct Answer(s)
A. Rollup
B. Dedup
C. Join
Explanation
The components used to avoid duplicate records are Rollup, dedup, and Join. Rollup is used to summarize data and remove duplicates by grouping records based on certain criteria. Deduplication (dedup) is a process of identifying and eliminating duplicate records from a dataset. Join is used to combine data from multiple tables based on a common field, which can help in identifying and removing duplicate records.
22.
Which is the partition component?
Correct Answer
C. Broadcast
Explanation
The correct answer is "broadcast" because in the context of partitioning data, broadcasting refers to sending a copy of a dataset to all the worker nodes in a cluster. This allows the data to be available for all the nodes to perform operations or computations on it in parallel. Broadcasting is commonly used when a small dataset needs to be shared across all the nodes efficiently.
23.
Which is the advanced version of the Aggregate component?
Correct Answer
C. Rollup
Explanation
The advanced version of the Aggregate component is the rollup. The rollup function allows for more complex calculations and summarization of data. It can be used to perform calculations on multiple levels of a hierarchy, such as subtotals and grand totals. This advanced feature provides a more comprehensive analysis of the data and allows for deeper insights to be gained.
24.
When the graph is executed what happens?
Correct Answer
C. Deployed script is generated
Explanation
When the graph is executed, a deployed script is generated. This means that the graph is processed and converted into a script that can be executed. The script contains all the necessary instructions and code to perform the tasks specified in the graph. Once the script is generated, it can be run to execute the graph and produce the desired results.
25.
Which is a key based component?
Correct Answer
A. Merge
Explanation
Merge is a key based component because it combines two or more sets of data based on a common key. It takes the common key from each set and matches them to create a single set that includes all the data from the original sets. This process is commonly used in database management systems and data analysis to merge datasets and perform operations on them based on their shared key values.
26.
When you want the output to be in a sorted format which component you opt for?
Correct Answer
A. Merge
Explanation
When you want the output to be in a sorted format, the best component to opt for is "Merge." Merge is a sorting algorithm that works by dividing the unsorted list into sublists, sorting the sublists, and then merging them back together in a sorted order. It is an efficient and commonly used method for sorting data, ensuring that the output is in the desired sorted format.
27.
Flow buffer is automatically embedded in which version?
Correct Answer
B. >1.8 version
Explanation
The correct answer is ">1.8 version" because starting from version 1.8, the flow buffer is automatically embedded. This means that in versions after 1.8, users do not need to manually add or configure the flow buffer as it is already included by default.
28.
When you use a priority in joining?
Correct Answer
B. Full outer join
Explanation
When you use a full outer join, it includes all the rows from both tables being joined, regardless of whether there is a match or not. This means that it will return all the rows from the left table and all the rows from the right table, and if there is no match, it will fill the missing values with NULL. Therefore, a full outer join is used when you want to include all the data from both tables, even if there are no matching values.
29.
What is the count value of redefine?
Correct Answer
E. Not mentioned
30.
Which is not possible using redefine?
Correct Answer
A. Can change the value
Explanation
The "redefine" keyword is used in programming to override a previously defined function or variable. When using "redefine", it is not possible to change the value of a variable or function because it is only used to modify the behavior or implementation of the existing entity, not its value. Therefore, changing the value is not possible using "redefine".
31.
In-memory is used in which component?
Correct Answer(s)
A. Join
B. Rollup
C. Scan
Explanation
In-memory is used in the join, rollup, and scan components. These components require data to be stored in memory for faster processing and retrieval. Join involves combining data from multiple tables, rollup involves summarizing data at different levels of granularity, and scan involves reading and processing data from a database or file. Storing data in memory allows for quicker access and manipulation of the data, resulting in improved performance and efficiency.
32.
Inputs should be sorted for join component note without using in memory.
Correct Answer
A. Yes
Explanation
The answer "yes" suggests that the inputs should be sorted before using them in the join component note. Sorting the inputs ensures that the join operation can be performed efficiently and accurately. By sorting the inputs, the join component can easily match corresponding records from the two sets of data. This helps in improving the performance and accuracy of the join operation without the need for using additional memory.
33.
Driving port in the join is for which table?
Correct Answer
B. Large table
Explanation
The driving port in the join is for the larger table. This means that the join operation is being performed based on the values in the larger table, and the smaller table is being matched against it. The larger table is the one that determines the order and execution of the join operation.
34.
Which component follows ordered data?
Correct Answer(s)
A. Concat
D. Interleave
Explanation
The components "concat" and "interleave" follow ordered data. The "concat" component is used to concatenate or combine multiple data streams in a specified order. The "interleave" component is used to interleave or merge multiple data streams in a specific sequence. Both of these components ensure that the data is arranged in a particular order, making them suitable for working with ordered data.
35.
In which departition component round-robin concept is used?
Correct Answer
C. Interleave
Explanation
In the interleave component of the departition process, the round-robin concept is used. This means that the data is distributed evenly across multiple partitions in a cyclic manner. Each partition receives an equal share of the data, ensuring a balanced distribution and efficient processing. This allows for parallel processing of the data in a distributed system, improving overall performance and throughput.
36.
What are the 2 main keys used in sort with in group?
Correct Answer(s)
A. Major
B. Minor
Explanation
In the context of sorting within a group, the terms "major" and "minor" refer to the primary and secondary keys used for sorting. The major key is the primary criterion for sorting, while the minor key is used as a secondary criterion in case of ties or equal values in the major key. By utilizing both major and minor keys, the sorting algorithm can achieve a more precise and specific ordering of the elements within a group.
37.
Which component gives cumulative output?
Correct Answer
C. Scan
Explanation
The component that gives cumulative output is "scan". This component is used to calculate running totals or cumulative sums of a given input. It takes a sequence of values as input and produces a sequence of output values where each output value is the sum of all the previous input values up to that point. Therefore, the scan component is responsible for generating cumulative output.
38.
Which component is used to call the ksh file?
Correct Answer
C. Run program
Explanation
The component used to call the ksh file is "run program". This component is responsible for executing the ksh file and initiating its functionality. It allows the user to run the program written in the ksh file and perform the desired operations.
39.
With which component you will fetch the first 100 records alone?
Correct Answer
C. Leading record
Explanation
The correct answer is "leading record". The leading record refers to the first record in a set or sequence. By using the leading record component, you can fetch only the first 100 records from a larger set of records. This component allows you to specifically retrieve the initial portion of the data, in this case, the first 100 records.
40.
Ways to execute the graph.
Correct Answer
C. GDE/Unix
Explanation
The correct answer is GDE/Unix because GDE (Graphical Development Environment) is a software tool used for designing and executing graphs. Unix is an operating system that is commonly used in computer systems. Therefore, executing the graph using GDE on a Unix operating system is a valid and possible option. The other options mentioned (GDE, Unix, and none of the above) do not specify the combination of GDE and Unix, making them incorrect choices.
41.
Reformat has_______________
Correct Answer
A. Multiple transform and one dml
Explanation
The correct answer is "multiple transform and one dml". This means that the "reformat" operation involves multiple transformations and only one data manipulation language (DML) operation. This suggests that the data is being reformatted and modified in various ways, but only one specific action is being taken on the data.
42.
Compare record component is used______
Correct Answer
C. Compare 2 input files
Explanation
The compare record component is used to compare two input files. This component allows for a comparison of the records in both files, identifying any differences or similarities between them. It is commonly used in data integration or data quality processes to ensure consistency and accuracy between different versions or sources of data.
43.
Generate record component need DML to specified.
Correct Answer
A. Yes
Explanation
The explanation for the given correct answer is that in order to generate a record component, a Data Manipulation Language (DML) statement needs to be specified. DML statements are used to manipulate data in a database, such as inserting, updating, or deleting records. Therefore, in order to generate a record component, it is necessary to specify the DML statement that will be used to manipulate the data.
44.
Which component is used to read the serial type of file?
Correct Answer
B. Adhoc multiple file
Explanation
The adhoc multiple file component is used to read the serial type of file. This component allows for the reading of multiple files in a serial manner, meaning that each file is read one after the other. It is specifically designed for handling multiple files and is commonly used in data processing tasks where sequential reading of files is required.
45.
Can private project share the objects with other projects?
Correct Answer
B. No
Explanation
Private projects cannot share objects with other projects. Private projects are isolated and have restricted access, meaning that the objects within a private project cannot be accessed or shared with other projects.
46.
Use of abinitiorc file.
Correct Answer
B. Stores the abinitio env settings
Explanation
The abinitiorc file is used to store the abinitio environment settings. These settings include various configurations and preferences that are specific to the abinitio application. By storing these settings in the abinitiorc file, users can easily access and manage their preferred environment settings, ensuring consistency and efficiency in their abinitio workflows.
47.
Command for checkin:
Correct Answer
A. Air project import - basedir
Explanation
The correct answer is "air project import - basedir". This command is used to import a project in the air format, with the specified base directory. It allows users to transfer a project from one location to another or to import a project that was previously exported. The "basedir" parameter specifies the base directory where the project files are located.
48.
Command for check out:
Correct Answer
B. Air project export - basedir
Explanation
The correct answer is "air project export - basedir" because this command is used to export a project and the "-basedir" flag specifies the base directory for the export. This command allows the user to export a project to a specific directory, making it easier to organize and manage project files.
49.
Embedding of xfr,dml,sql is______________
Correct Answer
B. Bad
50.
What happens when you give unsorted data for the join component?
Correct Answer
C. Gives incorrect results
Explanation
When unsorted data is given for the join component, it can lead to incorrect or unexpected results. The join component relies on matching records based on a specific condition, such as a common key. If the data is unsorted, the join component may not be able to properly match the records, resulting in incorrect output or errors. Therefore, the graph fails when unsorted data is provided for the join component.