Data Science & Ai Applications

Approved & Edited by ProProfs Editorial Team
The editorial team at ProProfs Quizzes consists of a select group of subject experts, trivia writers, and quiz masters who have authored over 10,000 quizzes taken by more than 100 million users. This team includes our in-house seasoned quiz moderators and subject matter experts. Our editorial experts, spread across the world, are rigorously trained using our comprehensive guidelines to ensure that you receive the highest quality quizzes.
Learn about Our Editorial Process
| By Isie
I
Isie
Community Contributor
Quizzes Created: 1 | Total Attempts: 1,016
Questions: 40 | Attempts: 1,018

SettingsSettingsSettings
Data Science Quizzes & Trivia

.


Questions and Answers
  • 1. 

    Point out the correct statement.

    • A.

      Raw data is original source of data

    • B.

      Preprocessed data is original source of data

    • C.

      Raw data is the data obtained after processing steps

    • D.

      None of the mentioned

    Correct Answer
    A. Raw data is original source of data
    Explanation
    The correct statement is that raw data is the original source of data. Raw data refers to the unprocessed and unorganized data that is collected directly from the source. It has not undergone any manipulation or analysis. Preprocessed data, on the other hand, refers to the data that has been cleaned, transformed, and organized for further analysis. Therefore, raw data is the initial and untouched form of data that serves as the foundation for any data processing or analysis.

    Rate this question:

  • 2. 

        Data that summarize all observations in a category are called __________ data.

    • A.

      Frequency

    • B.

      Summarized

    • C.

      raw

    • D.

      None of the mentioned

    Correct Answer
    B. Summarized
    Explanation
    Summarized data refers to data that has been condensed or aggregated to provide an overview or summary of the observations within a category. This type of data is useful for providing a high-level understanding of the information and can help to identify patterns or trends. In contrast, raw data refers to the original, unprocessed data that has not been summarized or analyzed. Therefore, the correct answer is "summarized".

    Rate this question:

  • 3. 

       Which of the following is an example of raw data?

    • A.

      Original swath files generated from a sonar system

    • B.

      Initial time-series file of temperature values

    • C.

      A real-time GPS-encoded navigation file

    • D.

      All of the mentioned

    Correct Answer
    D. All of the mentioned
    Explanation
    All of the mentioned options are examples of raw data because they are all original and unprocessed data that have been directly collected from a source. The original swath files generated from a sonar system, the initial time-series file of temperature values, and the real-time GPS-encoded navigation file are all examples of raw data that can be further analyzed and processed to extract meaningful information. Therefore, the correct answer is "all of the mentioned."

    Rate this question:

  • 4. 

      Point out the correct statement.

    • A.

      Primary data is original source of data

    • B.

      Secondary data is original source of data

    • C.

      Questions are obtained after data processing steps

    • D.

      None of the Mentioned

    Correct Answer
    A. Primary data is original source of data
    Explanation
    Primary data refers to the data that is collected firsthand from the original source. This means that primary data is the original source of data as it is collected directly by the researcher or organization for a specific purpose. Secondary data, on the other hand, is data that has been collected by someone else for their own purpose and is used by others for their research or analysis. Therefore, the correct statement is that primary data is the original source of data.

    Rate this question:

  • 5. 

        Which of the following data is put into a formula to produce commonly accepted results?

    • A.

      Raw

    • B.

      Processed

    • C.

      Synchronized

    • D.

      All of the Mentioned

    Correct Answer
    B. Processed
    Explanation
    Processed data refers to information that has been manipulated, organized, or transformed in some way to produce meaningful results. This data has undergone a series of steps or operations to extract valuable insights or to make it suitable for analysis. By using processed data in a formula, we can generate commonly accepted results that are reliable and accurate. Therefore, processed data is the correct choice as it is the type of data that is typically used in formulas to produce desired outcomes.

    Rate this question:

  • 6. 

       Which of the following is another name for raw data?

    • A.

      destination data

    • B.

      eggy data

    • C.

      secondary

    • D.

      machine learning

    Correct Answer
    B. eggy data
    Explanation
    The term "eggy data" is not commonly used or recognized in the context of raw data. It is likely a made-up or incorrect term. Raw data refers to unprocessed and unorganized data that is directly collected from a source. It has not undergone any manipulation or analysis.

    Rate this question:

  • 7. 

      Which type of data is generated by POS terminal in a busy supermarket each day?

    • A.

      Source

    • B.

      Processed

    • C.

      Synchronized

    • D.

      All of the mentioned

    Correct Answer
    A. Source
    Explanation
    The correct answer is "Source" because a POS (Point of Sale) terminal in a busy supermarket generates raw data or the original data at the point of sale. This data includes information about the products sold, quantities, prices, and payment methods used. It is the starting point or the source of data that can be further processed, synchronized, or used for various analytical purposes.

    Rate this question:

  • 8. 

      Point out the correct statement.

    • A.

      Nearly 80% of data analysis is spent on wrangling data

    • B.

      Nearly 20% of data analysis is spent on data dredging

    • C.

      Nearly 80% of data analysis is spent on the cleaning and preparing data

    • D.

      None of the mentioned

    Correct Answer
    C. Nearly 80% of data analysis is spent on the cleaning and preparing data
    Explanation
    The correct statement is that nearly 80% of data analysis is spent on the cleaning and preparing data. This means that a significant amount of time in data analysis is dedicated to organizing and formatting the data in a way that is suitable for analysis. This process involves tasks such as removing duplicates, handling missing values, and transforming the data into a usable format. By spending a majority of the time on data cleaning and preparation, analysts ensure that the data is accurate and reliable for further analysis and decision-making.

    Rate this question:

  • 9. 

    Which of the following package is used for tidy data?

    • A.

      tidyr

    • B.

      souryr

    • C.

      NumPy

    • D.

      None

    Correct Answer
    A. tidyr
    Explanation
    The tidyr package is used for tidy data. Tidy data is a structured format where each variable has its own column and each observation has its own row. The tidyr package provides functions to easily manipulate and transform data to conform to this tidy format.

    Rate this question:

  • 10. 

    Which of the following process involves structuring datasets to facilitate analysis?

    • A.

      Data tidying

    • B.

      Data mining

    • C.

      Data booting

    • D.

      none

    Correct Answer
    A. Data tidying
    Explanation
    Data tidying involves structuring datasets to facilitate analysis. This process includes cleaning and organizing the data, dealing with missing values, standardizing variables, and transforming the data into a format that is suitable for analysis. By tidying the data, it becomes easier to perform various analytical tasks such as data exploration, visualization, and modeling.

    Rate this question:

  • 11. 

       Raw data in the real-world is tidy and properly formatted.

    • A.

      True

    • B.

      False

    Correct Answer
    A. True
    Explanation
    Raw data in the real world is often messy and unstructured, with inconsistencies, missing values, and formatting issues. It requires cleaning and organizing to make it tidy and properly formatted. Therefore, the statement that raw data in the real world is tidy and properly formatted is false.

    Rate this question:

  • 12. 

      Which of the following function is used for loading flat files?

    • A.

      read.data

    • B.

      Read.sheet

    • C.

      read.table

    • D.

      None of the mentioned

    Correct Answer
    C. read.table
    Explanation
    The function "read.table" is used for loading flat files. This function is commonly used in R programming to read data from a tabular file, such as a CSV file, into a data frame. It automatically detects the structure of the file and imports the data accordingly. "read.data" and "read.sheet" are not valid functions in R, and "none of the mentioned" is not the correct answer because "read.table" is indeed used for loading flat files.

    Rate this question:

  • 13. 

      Which of the following is an important parameter of read.table function?

    • A.

      file

    • B.

      Header

    • C.

      Sep

    • D.

      all of the mentioned

    Correct Answer
    D. all of the mentioned
    Explanation
    The correct answer is "all of the mentioned." This is because all three parameters - file, header, and sep - are important in the read.table function. The "file" parameter specifies the file to be read, the "header" parameter indicates whether the file has a header row, and the "sep" parameter specifies the separator used in the file. Therefore, all of these parameters are necessary for the function to properly read the data from a file.

    Rate this question:

  • 14. 

    Which of the following will set the character that represents missing value?

    • A.

      Na.quote

    • B.

      na.strings

    • C.

      nrows

    • D.

      All of the mentioned

    Correct Answer
    B. na.strings
    Explanation
    The na.strings command is used to set the character that represents missing values in R. By using this command, any string specified will be treated as a missing value in the dataset. Therefore, the correct answer is na.strings.

    Rate this question:

  • 15. 

        Which of the following package is used for reading excel data?

    • A.

      Xlsx

    • B.

      xlsc

    • C.

      Read.sheet

    • D.

      all of the mentioned

    Correct Answer
    A. Xlsx
    Explanation
    The correct answer is "xlsx". The xlsx package is used for reading excel data.

    Rate this question:

  • 16. 

       Which of the following can be used to view all the tables in memory?

    • A.

      Tables

    • B.

      alltable

    • C.

      Table

    • D.

      None

    Correct Answer
    A. Tables
    Explanation
    The correct answer is "tables". This command is used to view all the tables that are currently stored in memory. By executing the "tables" command, a list of all the tables will be displayed, allowing the user to see what tables are available for use or manipulation.

    Rate this question:

  • 17. 

    Which of the following is used to extract data from HTML code of websites?

    • A.

      Webscraping

    • B.

      Webdredging

    • C.

      Webcleaning

    • D.

      All of the mentioned

    Correct Answer
    A. Webscraping
    Explanation
    Webscraping is the process of extracting data from the HTML code of websites. It involves using automated tools or scripts to navigate through the website's structure and retrieve specific information. Webdredging and webcleaning are not commonly used terms for this process. Therefore, the correct answer is Webscraping.

    Rate this question:

  • 18. 

    Which of the following function is used to read data off the webpages?

    • A.

      read.web

    • B.

      Read.Lines

    • C.

      read.Line

    • D.

      all of the mentioned

    Correct Answer
    B. Read.Lines
    Explanation
    The correct answer is "read.Lines." This function is used to read data off webpages. It allows for reading multiple lines of data at once, making it suitable for extracting information from webpages that contain multiple lines of text. The other options, "read.web" and "read.Line," are not valid functions in R. Therefore, "read.Lines" is the correct answer.

    Rate this question:

  • 19. 

    Which of the following function gives information about top level data?

    • A.

      Head

    • B.

      tail

    • C.

      Summary

    • D.

      None of the mention

    Correct Answer
    A. Head
    Explanation
    The function "head" gives information about the top level data. It returns the first few rows of a dataset, allowing users to quickly view the structure and content of the data. This function is commonly used in data analysis and exploration tasks to get a glimpse of the dataset before performing further operations.

    Rate this question:

  • 20. 

        Point out the correct statement.

    • A.

      head function work on string

    • B.

      tail function work on string

    • C.

      Head function work on string but tail function do not

    • D.

      none of the mentioned

    Correct Answer
    D. none of the mentioned
    Explanation
    The correct statement is "none of the mentioned." This means that neither the head function nor the tail function work on a string.

    Rate this question:

  • 21. 

    Which of the following function is used for quantiles of quantitative values?

    • A.

      quantile

    • B.

      quantity

    • C.

      quantiles

    • D.

      all of the mentioned

    Correct Answer
    A. quantile
    Explanation
    The correct answer is "quantile". The quantile function is used to calculate the quantiles of quantitative values. It is a statistical function that divides a dataset into equal-sized intervals or groups. These intervals represent the percentiles or quantiles of the data. The quantile function is commonly used in data analysis and statistics to understand the distribution and spread of data.

    Rate this question:

  • 22. 

    Who is a data scientist?

    • A.

      Software Programmer

    • B.

      Statistician

    • C.

      Mathematician

    • D.

      All of the mentioned

    Correct Answer
    D. All of the mentioned
    Explanation
    A data scientist is someone who possesses a combination of skills in programming, statistics, and mathematics. They are proficient in coding and software development, allowing them to manipulate and analyze large datasets. They also have a deep understanding of statistical concepts and techniques, enabling them to extract meaningful insights from data. Additionally, their mathematical skills help them build models and algorithms to solve complex problems. Therefore, the correct answer is "All of the mentioned" as a data scientist requires expertise in software programming, statistics, and mathematics.

    Rate this question:

  • 23. 

    Which of the following function is used for casting data frames?

    • A.

      Dcast

    • B.

      Ucast

    • C.

      Rcast

    • D.

      all of the mentioned

    Correct Answer
    A. Dcast
    Explanation
    The function "dcast" is used for casting data frames. This function is used to reshape the data frame from long to wide format, allowing for a more organized and structured representation of the data. It takes a data frame as input and allows the user to specify the variables to be used as row and column identifiers, as well as the values to be populated in the resulting wide format. Therefore, "dcast" is the correct answer for this question.

    Rate this question:

  • 24. 

      Mutate function is used for casting as multi dimensional arrays.

    • A.

      TRUE

    • B.

      FALSE.

    Correct Answer
    B. FALSE.
    Explanation
    The statement is false because the mutate function is not used for casting as multi-dimensional arrays. The mutate function is used in programming languages like JavaScript to modify or change the value of an existing array, but it does not have any specific functionality for casting multi-dimensional arrays.

    Rate this question:

  • 25. 

      How do you check if “m” is a matrix data object in R?

    • A.

      As.matrix()

    • B.

      M.matrix()

    • C.

      And.matrix()

    • D.

      None

    Correct Answer
    C. And.matrix()
    Explanation
    The correct answer is "And.matrix()". In R, the function "is.matrix()" is used to check if an object is a matrix. However, in the given options, there is no "is.matrix()" function available. Therefore, the correct option would be "And.matrix()" which suggests that we can use the "and" operator to check if "m" is a matrix data object.

    Rate this question:

  • 26. 

    What is the intersection of a column and a row on a Excel worksheet called?

    • A.

      Column

    • B.

      Value

    • C.

      Address

    • D.

      Cell

    Correct Answer
    D. Cell
    Explanation
    The intersection of a column and a row on an Excel worksheet is called a cell. A cell is the smallest unit of a worksheet and is identified by a unique address, which is a combination of the column letter and the row number. It is where data can be entered, edited, and displayed.

    Rate this question:

  • 27. 

    What type of chart will you use to compare performance of two employees in the year 2016?

    • A.

      Column matrix

    • B.

      Line chart

    • C.

      Pie chart

    • D.

      Dot chart

    Correct Answer
    B. Line chart
    Explanation
    A line chart is the most suitable type of chart to compare the performance of two employees in the year 2016. This type of chart is effective for showing trends and changes over time. By plotting the performance data for each employee on a line chart, it becomes easy to compare their progress and identify any patterns or differences in their performance throughout the year. The line chart allows for a clear visualization of how each employee's performance fluctuates over time, making it an ideal choice for this comparison.

    Rate this question:

  • 28. 

    The __________ feature of MS Excel quickly completes a series of data

    • A.

      Auto complete

    • B.

      Auto fill

    • C.

      Fill handle

    • D.

      Sorting

    Correct Answer
    B. Auto fill
    Explanation
    The Auto fill feature of MS Excel quickly completes a series of data by automatically filling in the subsequent values based on the pattern established by the initial selection. This saves time and effort by eliminating the need to manually enter each value in the series.

    Rate this question:

  • 29. 

    What type of chart is useful for comparing parts of a whole?

    • A.

      Pie chart

    • B.

      Scatter chart

    • C.

      Line chart

    • D.

      Dot graph

    Correct Answer
    A. Pie chart
    Explanation
    A pie chart is useful for comparing parts of a whole because it visually represents the proportion of each category or component in relation to the whole. The chart is divided into slices, with each slice representing a specific category or component. The size of each slice is proportional to its percentage or share of the whole, making it easy to compare and understand the relative importance or distribution of each part. This type of chart is commonly used to show market share, budget allocation, or demographic distribution.

    Rate this question:

  • 30. 

    What type of chart will you use to compare performance of sales of two products

    • A.

      Column chart

    • B.

      Line chart

    • C.

      Pie chart

    • D.

      Both A and B

    Correct Answer
    A. Column chart
    Explanation
    A column chart is the most suitable type of chart to compare the performance of sales of two products. This chart displays the data in vertical columns, with each column representing a different product. The height of each column represents the sales performance, allowing for easy visual comparison between the two products. A line chart, on the other hand, is more suitable for showing trends over time, while a pie chart is used to represent proportions or percentages of a whole. Therefore, the correct answer is a column chart.

    Rate this question:

  • 31. 

    Strong Artificial Intelligence is __________

    • A.

      the embodiment of human intellectual capabilities within a computer

    • B.

      A set of computer programs that produce output that would be considered to reflect intelligence if it were generated by humans

    • C.

      the study of mental faculties through the use of mental models implemented on a computer

    • D.

      all of the mentioned

    Correct Answer
    A. the embodiment of human intellectual capabilities within a computer
    Explanation
    Strong Artificial Intelligence refers to the concept of creating a computer system or program that possesses the same intellectual capabilities as a human being. It involves the development of technology that can understand, learn, and reason like a human, and can perform tasks that would typically require human intelligence. This includes abilities such as problem-solving, decision-making, natural language processing, and even emotions. Strong AI aims to replicate and embody human intellectual capabilities within a computer system, enabling it to exhibit intelligent behavior and perform tasks that would be considered intelligent if done by a human.

    Rate this question:

  • 32. 

    Which of the following are examples of software development tools?

    • A.

      debuggers

    • B.

      Editors

    • C.

      assemblers, compilers and interpreters

    • D.

      All of the mentioned

    Correct Answer
    D. All of the mentioned
    Explanation
    The correct answer is "all of the mentioned". Debuggers, editors, assemblers, compilers, and interpreters are all examples of software development tools. Debuggers are used to identify and fix errors in code, editors are used to write and modify code, assemblers, compilers, and interpreters are used to convert high-level programming languages into machine code that can be executed by a computer. Therefore, all of these options are examples of tools that are commonly used in software development.

    Rate this question:

  • 33. 

    Which is the first AI programming language?

    • A.

      BASIC

    • B.

      FORTRAN

    • C.

      IPL(Inductive logic programming)

    • D.

      LISP

    Correct Answer
    D. LISP
    Explanation
    LISP is considered the first AI programming language because it was specifically designed for the development of artificial intelligence applications. It was developed in the late 1950s and early 1960s by John McCarthy and his team at MIT. LISP stands for "LISt Processing" and it introduced many concepts that are still used in AI programming today, such as symbolic processing, recursion, and dynamic typing. Its focus on symbolic manipulation and its ability to represent and manipulate knowledge made it a popular choice for AI research and development.

    Rate this question:

  • 34. 

    Which of the following algorithm is online search algorithm?

    • A.

      Breadth-first search algorithm

    • B.

      Depth-first search algorithm

    • C.

      Hill-climbing search algorithm

    • D.

      None of the mentioned

    Correct Answer
    C. Hill-climbing search algorithm
    Explanation
    Hill-climbing search algorithm is an online search algorithm because it explores the search space by continuously moving towards the goal state based on the heuristic evaluation of the current state. It does not require the entire search space to be known in advance and can make decisions based on the current state and the available information. This makes it suitable for real-time or dynamic environments where new information is constantly being received. In contrast, breadth-first search and depth-first search algorithms are offline search algorithms as they require the complete search space to be known beforehand.

    Rate this question:

  • 35. 

    How the new states are generated in genetic algorithm?

    • A.

      Composition

    • B.

      Mutation

    • C.

      Cross-over

    • D.

      Both Mutation & Cross-over

    Correct Answer
    D. Both Mutation & Cross-over
    Explanation
    In genetic algorithms, new states are generated through both mutation and cross-over. Mutation involves randomly altering certain genes in an individual's genetic code, introducing new variations. On the other hand, cross-over involves combining genetic material from two parent individuals to create offspring with a mix of their traits. By utilizing both mutation and cross-over, genetic algorithms can explore a wide range of possibilities and potentially find optimal solutions by gradually improving and evolving the population over successive generations.

    Rate this question:

  • 36. 

    What is the name for information sent from robot sensors to robot controllers?

    • A.

      temperature

    • B.

      Pressure

    • C.

      Feedback

    • D.

      Signal

    Correct Answer
    C. Feedback
    Explanation
    Feedback is the correct answer because it refers to the information that is sent from robot sensors to robot controllers. Feedback is crucial for the robot controllers to make necessary adjustments and decisions based on the input received from the sensors. It helps in monitoring and controlling the robot's actions and ensuring its proper functioning. Temperature, pressure, and signal are not specifically related to the information sent from sensors to controllers, making them incorrect options.

    Rate this question:

  • 37. 

    Which of the following terms refers to the rotational motion of a robot arm?

    • A.

      swivel

    • B.

      Axle

    • C.

      retrograde

    • D.

      Roll

    Correct Answer
    D. Roll
    Explanation
    Roll refers to the rotational motion of a robot arm. In robotics, roll is the movement of the arm around its longitudinal axis, similar to how a rolling pin moves. This motion allows the robot arm to rotate and reposition itself in a circular or rolling manner. Swivel refers to a horizontal rotation, axle refers to a shaft on which a wheel rotates, and retrograde refers to a backward or reverse motion.

    Rate this question:

  • 38. 

    What is the complex system of structured message?

    • A.

      Languages

    • B.

      Words

    • C.

      Signs

    • D.

      Speech

    Correct Answer
    A. Languages
    Explanation
    Languages are the complex systems of structured messages. They consist of a set of rules and symbols that allow individuals to communicate and convey meaning. Languages are composed of words, which are the basic units of meaning, and they can be expressed through signs or speech. Through languages, individuals can convey complex ideas, express emotions, and engage in social interactions. Therefore, languages are the most suitable option as the complex system of structured message.

    Rate this question:

  • 39. 

    Which of the following contains output segments of AI programming?

    • A.

      Printed language and synthesized

    • B.

      Manipulation of physical object

    • C.

      Locomotion

    • D.

      All of the mentioned

    Correct Answer
    D. All of the mentioned
    Explanation
    All of the mentioned options contain output segments of AI programming. Printed language and synthesized output can be used to communicate the results or information generated by AI systems. The manipulation of physical objects can also be an output of AI programming, where AI systems can control robots or machines to perform specific tasks. Locomotion, or the movement of objects or robots, can also be an output of AI programming where AI systems can control the motion of autonomous vehicles or robots. Therefore, all of the mentioned options contain output segments of AI programming.

    Rate this question:

  • 40. 

    When a top-level function is entered, the LISP processor does?

    • A.

      It reads the function entered

    • B.

      It prints the result returned by the function

    • C.

      Large memory and high-speed processor

    • D.

      All of the mentioned

    Correct Answer
    B. It prints the result returned by the function
    Explanation
    When a top-level function is entered, the LISP processor prints the result returned by the function. This means that when the function is executed, the output or result of the function is displayed or printed by the LISP processor. This allows the user to see the outcome of the function and understand the result of their code.

    Rate this question:

Quiz Review Timeline +

Our quizzes are rigorously reviewed, monitored and continuously updated by our expert board to maintain accuracy, relevance, and timeliness.

  • Current Version
  • Oct 07, 2024
    Quiz Edited by
    ProProfs Editorial Team
  • Feb 07, 2020
    Quiz Created by
    Isie
Back to Top Back to top
Advertisement
×

Wait!
Here's an interesting quiz for you.

We have other quizzes matching your interest.