1.
A data mart is a ... ?
Correct Answer
B. Small data store for individual business units.
Explanation
A data mart is a small data store for individual business units. This means that it is a database that is specifically designed to store and manage data for a specific department or team within an organization. Unlike a centralized store of business data, which would encompass all data for the entire organization, a data mart focuses on a smaller subset of data that is relevant to a particular business unit. This allows for more efficient and targeted data analysis and reporting for that specific department.
2.
Select two benefits of using a distributed architecture.
Correct Answer(s)
B. Scalability
D. High availability
Explanation
A distributed architecture allows for scalability by allowing the system to handle increased workload and user demand by distributing the processing across multiple servers. This allows for better performance and the ability to handle more users or data without overloading a single server. Additionally, a distributed architecture also provides high availability by distributing data and processing across multiple servers, ensuring that if one server fails, the system can continue to operate without interruption. This improves system reliability and minimizes downtime.
3.
What is a consideration for estimating CPU size
Correct Answer
D. All of the above
Explanation
All of the above options are considerations for estimating CPU size. The number of concurrent users accessing the system, the average size of queries being processed, and the maximum consumption rate of the CPU are all factors that need to be taken into account when estimating the required CPU size.
4.
Why would you choose a dedicated storage area network over direct attached storage?
Correct Answer
B. To simplify managing the configuration and adding future storage
Explanation
Choosing a dedicated storage area network (SAN) over direct attached storage (DAS) would simplify managing the configuration and adding future storage. With a SAN, storage devices are connected to a separate network, allowing for centralized management and easier scalability. This means that administrators can easily add or remove storage devices without impacting the servers directly connected to the SAN. In contrast, DAS requires individual management of each storage device, making it more complex and time-consuming to configure and expand the storage capacity.
5.
Fact tables are ... ?
Correct Answer
A. Related to business processes
Explanation
Fact tables are related to business processes. Fact tables are a key component of a data warehouse and are used to store the quantitative data or facts about a business process. They typically contain foreign keys that link to dimension tables, which provide context and descriptive attributes for the facts. By storing the facts related to business processes in a fact table, it becomes easier to analyze and gain insights into the performance and trends of the business.
6.
The three types of measures are; Additive, ________, Non-Additive
Correct Answer
Semi-Additive
Explanation
The given statement mentions three types of measures: Additive, [Blank], and Non-Additive. The correct answer provided is "Semi-Additive." This suggests that Semi-Additive is the missing type of measure that fits in the blank space. Therefore, the three types of measures mentioned in the statement are Additive, Semi-Additive, and Non-Additive.
7.
Why use database partitioning?
Correct Answer
C. Improved data load performance
Explanation
Database partitioning is a technique used to divide a large database into smaller, more manageable partitions. By doing so, it allows for improved data load performance. When data is partitioned, it can be distributed across multiple disks or servers, enabling parallel processing and faster data retrieval. This can significantly enhance the overall performance of data loading operations, especially in scenarios where large volumes of data need to be loaded into the database. Therefore, using database partitioning can help optimize data load performance and improve the efficiency of data processing tasks.
8.
Dimensions should always have indexes on the surrogate key column; true or false?
Correct Answer
A. True
Explanation
Indexes on the surrogate key column in dimensions can improve the performance of queries and joins in data warehousing environments. Surrogate keys are artificial keys assigned to each dimension record, and they are used as the primary key for the dimension table. By having indexes on the surrogate key column, the database can quickly locate and retrieve specific dimension records based on their surrogate key values, leading to faster data retrieval and improved query performance. Therefore, it is recommended to have indexes on the surrogate key column in dimensions.
9.
Clustered indexes are different than non-clustered indexes because?
Correct Answer
C. They pHysically sort the data in the files
Explanation
Clustered indexes are different than non-clustered indexes because they physically sort the data in the files. Unlike non-clustered indexes that create a separate structure to store the index data, clustered indexes dictate the physical order of the data rows in the table. This means that the data in a table with a clustered index is physically sorted based on the indexed column(s), which can improve query performance when accessing data in a specific order.
10.
Column store indexes are used primarily for tiny tables; true or false?
Correct Answer
B. False
Explanation
Column store indexes are not used primarily for tiny tables. In fact, they are specifically designed for large tables with a high number of columns. These indexes store and retrieve data by column rather than by row, which can significantly improve query performance for analytical workloads. Therefore, the correct answer is False.
11.
Select the benefits of columns store indexes.
Correct Answer(s)
C. Increased compression
D. Faster query times for aggregations
Explanation
Column store indexes offer several benefits, including increased compression and faster query times for aggregations. By storing data in a columnar format, column store indexes can achieve higher compression ratios compared to row indexes, resulting in reduced storage requirements. This increased compression also leads to improved query performance for aggregations, as the database engine can quickly scan and process the compressed columnar data. Therefore, the correct answer includes both increased compression and faster query times for aggregations. The other options, such as faster query times for small detail queries and faster build times, are not specifically associated with column store indexes.