1.
Point out the correct statement.
Correct Answer
A. Raw data is original source of data
Explanation
The correct statement is that raw data is the original source of data. Raw data refers to the unprocessed and unorganized data that is collected directly from the source. It has not undergone any manipulation or analysis. Preprocessed data, on the other hand, refers to the data that has been cleaned, transformed, and organized for further analysis. Therefore, raw data is the initial and untouched form of data that serves as the foundation for any data processing or analysis.
2.
Data that summarize all observations in a category are called __________ data.
Correct Answer
B. Summarized
Explanation
Summarized data refers to data that has been condensed or aggregated to provide an overview or summary of the observations within a category. This type of data is useful for providing a high-level understanding of the information and can help to identify patterns or trends. In contrast, raw data refers to the original, unprocessed data that has not been summarized or analyzed. Therefore, the correct answer is "summarized".
3.
Which of the following is an example of raw data?
Correct Answer
D. All of the mentioned
Explanation
All of the mentioned options are examples of raw data because they are all original and unprocessed data that have been directly collected from a source. The original swath files generated from a sonar system, the initial time-series file of temperature values, and the real-time GPS-encoded navigation file are all examples of raw data that can be further analyzed and processed to extract meaningful information. Therefore, the correct answer is "all of the mentioned."
4.
Point out the correct statement.
Correct Answer
A. Primary data is original source of data
Explanation
Primary data refers to the data that is collected firsthand from the original source. This means that primary data is the original source of data as it is collected directly by the researcher or organization for a specific purpose. Secondary data, on the other hand, is data that has been collected by someone else for their own purpose and is used by others for their research or analysis. Therefore, the correct statement is that primary data is the original source of data.
5.
Which of the following data is put into a formula to produce commonly accepted results?
Correct Answer
B. Processed
Explanation
Processed data refers to information that has been manipulated, organized, or transformed in some way to produce meaningful results. This data has undergone a series of steps or operations to extract valuable insights or to make it suitable for analysis. By using processed data in a formula, we can generate commonly accepted results that are reliable and accurate. Therefore, processed data is the correct choice as it is the type of data that is typically used in formulas to produce desired outcomes.
6.
Which of the following is another name for raw data?
Correct Answer
B. eggy data
Explanation
The term "eggy data" is not commonly used or recognized in the context of raw data. It is likely a made-up or incorrect term. Raw data refers to unprocessed and unorganized data that is directly collected from a source. It has not undergone any manipulation or analysis.
7.
Which type of data is generated by POS terminal in a busy supermarket each day?
Correct Answer
A. Source
Explanation
The correct answer is "Source" because a POS (Point of Sale) terminal in a busy supermarket generates raw data or the original data at the point of sale. This data includes information about the products sold, quantities, prices, and payment methods used. It is the starting point or the source of data that can be further processed, synchronized, or used for various analytical purposes.
8.
Point out the correct statement.
Correct Answer
C. Nearly 80% of data analysis is spent on the cleaning and preparing data
Explanation
The correct statement is that nearly 80% of data analysis is spent on the cleaning and preparing data. This means that a significant amount of time in data analysis is dedicated to organizing and formatting the data in a way that is suitable for analysis. This process involves tasks such as removing duplicates, handling missing values, and transforming the data into a usable format. By spending a majority of the time on data cleaning and preparation, analysts ensure that the data is accurate and reliable for further analysis and decision-making.
9.
Which of the following package is used for tidy data?
Correct Answer
A. tidyr
Explanation
The tidyr package is used for tidy data. Tidy data is a structured format where each variable has its own column and each observation has its own row. The tidyr package provides functions to easily manipulate and transform data to conform to this tidy format.
10.
Which of the following process involves structuring datasets to facilitate analysis?
Correct Answer
A. Data tidying
Explanation
Data tidying involves structuring datasets to facilitate analysis. This process includes cleaning and organizing the data, dealing with missing values, standardizing variables, and transforming the data into a format that is suitable for analysis. By tidying the data, it becomes easier to perform various analytical tasks such as data exploration, visualization, and modeling.
11.
Raw data in the real-world is tidy and properly formatted.
Correct Answer
A. True
Explanation
Raw data in the real world is often messy and unstructured, with inconsistencies, missing values, and formatting issues. It requires cleaning and organizing to make it tidy and properly formatted. Therefore, the statement that raw data in the real world is tidy and properly formatted is false.
12.
Which of the following function is used for loading flat files?
Correct Answer
C. read.table
Explanation
The function "read.table" is used for loading flat files. This function is commonly used in R programming to read data from a tabular file, such as a CSV file, into a data frame. It automatically detects the structure of the file and imports the data accordingly. "read.data" and "read.sheet" are not valid functions in R, and "none of the mentioned" is not the correct answer because "read.table" is indeed used for loading flat files.
13.
Which of the following is an important parameter of read.table function?
Correct Answer
D. all of the mentioned
Explanation
The correct answer is "all of the mentioned." This is because all three parameters - file, header, and sep - are important in the read.table function. The "file" parameter specifies the file to be read, the "header" parameter indicates whether the file has a header row, and the "sep" parameter specifies the separator used in the file. Therefore, all of these parameters are necessary for the function to properly read the data from a file.
14.
Which of the following will set the character that represents missing value?
Correct Answer
B. na.strings
Explanation
The na.strings command is used to set the character that represents missing values in R. By using this command, any string specified will be treated as a missing value in the dataset. Therefore, the correct answer is na.strings.
15.
Which of the following package is used for reading excel data?
Correct Answer
A. Xlsx
Explanation
The correct answer is "xlsx". The xlsx package is used for reading excel data.
16.
Which of the following can be used to view all the tables in memory?
Correct Answer
A. Tables
Explanation
The correct answer is "tables". This command is used to view all the tables that are currently stored in memory. By executing the "tables" command, a list of all the tables will be displayed, allowing the user to see what tables are available for use or manipulation.
17.
Which of the following is used to extract data from HTML code of websites?
Correct Answer
A. Webscraping
Explanation
Webscraping is the process of extracting data from the HTML code of websites. It involves using automated tools or scripts to navigate through the website's structure and retrieve specific information. Webdredging and webcleaning are not commonly used terms for this process. Therefore, the correct answer is Webscraping.
18.
Which of the following function is used to read data off the webpages?
Correct Answer
B. Read.Lines
Explanation
The correct answer is "read.Lines." This function is used to read data off webpages. It allows for reading multiple lines of data at once, making it suitable for extracting information from webpages that contain multiple lines of text. The other options, "read.web" and "read.Line," are not valid functions in R. Therefore, "read.Lines" is the correct answer.
19.
Which of the following function gives information about top level data?
Correct Answer
A. Head
Explanation
The function "head" gives information about the top level data. It returns the first few rows of a dataset, allowing users to quickly view the structure and content of the data. This function is commonly used in data analysis and exploration tasks to get a glimpse of the dataset before performing further operations.
20.
Point out the correct statement.
Correct Answer
D. none of the mentioned
Explanation
The correct statement is "none of the mentioned." This means that neither the head function nor the tail function work on a string.
21.
Which of the following function is used for quantiles of quantitative values?
Correct Answer
A. quantile
Explanation
The correct answer is "quantile". The quantile function is used to calculate the quantiles of quantitative values. It is a statistical function that divides a dataset into equal-sized intervals or groups. These intervals represent the percentiles or quantiles of the data. The quantile function is commonly used in data analysis and statistics to understand the distribution and spread of data.
22.
Who is a data scientist?
Correct Answer
D. All of the mentioned
Explanation
A data scientist is someone who possesses a combination of skills in programming, statistics, and mathematics. They are proficient in coding and software development, allowing them to manipulate and analyze large datasets. They also have a deep understanding of statistical concepts and techniques, enabling them to extract meaningful insights from data. Additionally, their mathematical skills help them build models and algorithms to solve complex problems. Therefore, the correct answer is "All of the mentioned" as a data scientist requires expertise in software programming, statistics, and mathematics.
23.
Which of the following function is used for casting data frames?
Correct Answer
A. Dcast
Explanation
The function "dcast" is used for casting data frames. This function is used to reshape the data frame from long to wide format, allowing for a more organized and structured representation of the data. It takes a data frame as input and allows the user to specify the variables to be used as row and column identifiers, as well as the values to be populated in the resulting wide format. Therefore, "dcast" is the correct answer for this question.
24.
Mutate function is used for casting as multi dimensional arrays.
Correct Answer
B. FALSE.
Explanation
The statement is false because the mutate function is not used for casting as multi-dimensional arrays. The mutate function is used in programming languages like JavaScript to modify or change the value of an existing array, but it does not have any specific functionality for casting multi-dimensional arrays.
25.
How do you check if “m” is a matrix data object in R?
Correct Answer
C. And.matrix()
Explanation
The correct answer is "And.matrix()". In R, the function "is.matrix()" is used to check if an object is a matrix. However, in the given options, there is no "is.matrix()" function available. Therefore, the correct option would be "And.matrix()" which suggests that we can use the "and" operator to check if "m" is a matrix data object.
26.
What is the intersection of a column and a row on a Excel worksheet called?
Correct Answer
D. Cell
Explanation
The intersection of a column and a row on an Excel worksheet is called a cell. A cell is the smallest unit of a worksheet and is identified by a unique address, which is a combination of the column letter and the row number. It is where data can be entered, edited, and displayed.
27.
What type of chart will you use to compare performance of two employees in the year 2016?
Correct Answer
B. Line chart
Explanation
A line chart is the most suitable type of chart to compare the performance of two employees in the year 2016. This type of chart is effective for showing trends and changes over time. By plotting the performance data for each employee on a line chart, it becomes easy to compare their progress and identify any patterns or differences in their performance throughout the year. The line chart allows for a clear visualization of how each employee's performance fluctuates over time, making it an ideal choice for this comparison.
28.
The __________ feature of MS Excel quickly completes a series of data
Correct Answer
B. Auto fill
Explanation
The Auto fill feature of MS Excel quickly completes a series of data by automatically filling in the subsequent values based on the pattern established by the initial selection. This saves time and effort by eliminating the need to manually enter each value in the series.
29.
What type of chart is useful for comparing parts of a whole?
Correct Answer
A. Pie chart
Explanation
A pie chart is useful for comparing parts of a whole because it visually represents the proportion of each category or component in relation to the whole. The chart is divided into slices, with each slice representing a specific category or component. The size of each slice is proportional to its percentage or share of the whole, making it easy to compare and understand the relative importance or distribution of each part. This type of chart is commonly used to show market share, budget allocation, or demographic distribution.
30.
What type of chart will you use to compare performance of sales of two products
Correct Answer
A. Column chart
Explanation
A column chart is the most suitable type of chart to compare the performance of sales of two products. This chart displays the data in vertical columns, with each column representing a different product. The height of each column represents the sales performance, allowing for easy visual comparison between the two products. A line chart, on the other hand, is more suitable for showing trends over time, while a pie chart is used to represent proportions or percentages of a whole. Therefore, the correct answer is a column chart.
31.
Strong Artificial Intelligence is __________
Correct Answer
A. the embodiment of human intellectual capabilities within a computer
Explanation
Strong Artificial Intelligence refers to the concept of creating a computer system or program that possesses the same intellectual capabilities as a human being. It involves the development of technology that can understand, learn, and reason like a human, and can perform tasks that would typically require human intelligence. This includes abilities such as problem-solving, decision-making, natural language processing, and even emotions. Strong AI aims to replicate and embody human intellectual capabilities within a computer system, enabling it to exhibit intelligent behavior and perform tasks that would be considered intelligent if done by a human.
32.
Which of the following are examples of software development tools?
Correct Answer
D. All of the mentioned
Explanation
The correct answer is "all of the mentioned". Debuggers, editors, assemblers, compilers, and interpreters are all examples of software development tools. Debuggers are used to identify and fix errors in code, editors are used to write and modify code, assemblers, compilers, and interpreters are used to convert high-level programming languages into machine code that can be executed by a computer. Therefore, all of these options are examples of tools that are commonly used in software development.
33.
Which is the first AI programming language?
Correct Answer
D. LISP
Explanation
LISP is considered the first AI programming language because it was specifically designed for the development of artificial intelligence applications. It was developed in the late 1950s and early 1960s by John McCarthy and his team at MIT. LISP stands for "LISt Processing" and it introduced many concepts that are still used in AI programming today, such as symbolic processing, recursion, and dynamic typing. Its focus on symbolic manipulation and its ability to represent and manipulate knowledge made it a popular choice for AI research and development.
34.
Which of the following algorithm is online search algorithm?
Correct Answer
C. Hill-climbing search algorithm
Explanation
Hill-climbing search algorithm is an online search algorithm because it explores the search space by continuously moving towards the goal state based on the heuristic evaluation of the current state. It does not require the entire search space to be known in advance and can make decisions based on the current state and the available information. This makes it suitable for real-time or dynamic environments where new information is constantly being received. In contrast, breadth-first search and depth-first search algorithms are offline search algorithms as they require the complete search space to be known beforehand.
35.
How the new states are generated in genetic algorithm?
Correct Answer
D. Both Mutation & Cross-over
Explanation
In genetic algorithms, new states are generated through both mutation and cross-over. Mutation involves randomly altering certain genes in an individual's genetic code, introducing new variations. On the other hand, cross-over involves combining genetic material from two parent individuals to create offspring with a mix of their traits. By utilizing both mutation and cross-over, genetic algorithms can explore a wide range of possibilities and potentially find optimal solutions by gradually improving and evolving the population over successive generations.
36.
What is the name for information sent from robot sensors to robot controllers?
Correct Answer
C. Feedback
Explanation
Feedback is the correct answer because it refers to the information that is sent from robot sensors to robot controllers. Feedback is crucial for the robot controllers to make necessary adjustments and decisions based on the input received from the sensors. It helps in monitoring and controlling the robot's actions and ensuring its proper functioning. Temperature, pressure, and signal are not specifically related to the information sent from sensors to controllers, making them incorrect options.
37.
Which of the following terms refers to the rotational motion of a robot arm?
Correct Answer
D. Roll
Explanation
Roll refers to the rotational motion of a robot arm. In robotics, roll is the movement of the arm around its longitudinal axis, similar to how a rolling pin moves. This motion allows the robot arm to rotate and reposition itself in a circular or rolling manner. Swivel refers to a horizontal rotation, axle refers to a shaft on which a wheel rotates, and retrograde refers to a backward or reverse motion.
38.
What is the complex system of structured message?
Correct Answer
A. Languages
Explanation
Languages are the complex systems of structured messages. They consist of a set of rules and symbols that allow individuals to communicate and convey meaning. Languages are composed of words, which are the basic units of meaning, and they can be expressed through signs or speech. Through languages, individuals can convey complex ideas, express emotions, and engage in social interactions. Therefore, languages are the most suitable option as the complex system of structured message.
39.
Which of the following contains output segments of AI programming?
Correct Answer
D. All of the mentioned
Explanation
All of the mentioned options contain output segments of AI programming. Printed language and synthesized output can be used to communicate the results or information generated by AI systems. The manipulation of physical objects can also be an output of AI programming, where AI systems can control robots or machines to perform specific tasks. Locomotion, or the movement of objects or robots, can also be an output of AI programming where AI systems can control the motion of autonomous vehicles or robots. Therefore, all of the mentioned options contain output segments of AI programming.
40.
When a top-level function is entered, the LISP processor does?
Correct Answer
B. It prints the result returned by the function
Explanation
When a top-level function is entered, the LISP processor prints the result returned by the function. This means that when the function is executed, the output or result of the function is displayed or printed by the LISP processor. This allows the user to see the outcome of the function and understand the result of their code.