Deck 14: Big Data, Data-Warehouses, and Business Intelligence Systems
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/108
Play
Full screen (f)
Deck 14: Big Data, Data-Warehouses, and Business Intelligence Systems
1
Operational databases store historical data.
False
2
Data warehouses also store the data warehouse metadata.
True
3
Data mining uses sophisticated statistical and mathematical techniques to perform what-if analyses,to make predictions,and to facilitate decision making.
True
4
Problematic data are called dirty data.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
5
Business Intelligence (BI)systems are information systems that help users analyze and use data.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
6
A data mart is a collection of data that addresses a particular component of a functional area of a business.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
7
Big Data is the name given to the enormous datasets generated by Web 2.0 applications.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
8
Dimensional databases are used for analytical data processing.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
9
Data warehouses often include data purchased from outside vendors.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
10
Business Intelligence (BI)reporting systems can analyze data using standard SQL.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
11
A data warehouse is a database system that has data,programs,and personnel specialized in Business Intelligence (BI)processing.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
12
Business Intelligence (BI)reporting systems are used to filter data,sort data,group data,and make simple calculations based on the data.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
13
Data warehouses are populated with data prepared by data extraction,transformation,and load (ETL)programs.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
14
Business Intelligence (BI)systems support operational activities.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
15
Report delivery is more important for data mining than it is for reporting systems.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
16
Dimensional databases use the star schema.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
17
Data warehouse data are frequently denormalized.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
18
Business Intelligence (BI)reporting systems summarize the current status of business activities and compare that status with past events,but not with predicted future activities.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
19
Business Intelligence (BI)systems obtain data in three different ways.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
20
Metadata about the data's source,format,assumptions,and constraints are kept in a data warehouse metadata database.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
21
In a common form of RFM analysis,a score of 1 is "high" or "good" while a score of 5 is "low" or "bad."
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
22
An OLAP cube is limited to three axes.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
23
In RFM analysis,R stands for "how recently."
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
24
Microsoft Excel 2013 allows us to connect directly to an SQL Server 2014 database when building a PivotTable.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
25
Operational databases contain a fact table.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
26
In a snowflake table,each dimension table is normalized.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
27
In RFM analysis,M stands for "how much money."
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
28
In a common form of RFM analysis,customers are sorted into five groups and given an associated score depending on their group.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
29
OLAP provides the ability to sum,count,average,and perform other simple arithmetic operations on groups of data.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
30
A star schema resembles a star,with a dimension table at the center and fact tables radiating out from the center.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
31
RFM analysis is a way of analyzing and ranking customers based on online survey data.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
32
Although Microsoft Excel 2013 will create a PivotTable report using SQL Server 2014 data,it does not have formatting tools that can be used with the report.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
33
To create an OLAP report for an SQL Server 2014 database,use the PivotTable tool in SQL Server 2014.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
34
In a common form of RFM analysis,customers with an R score of 5 are in the 20% of customers who have the most recent orders.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
35
The term drill down refers to the capability of seeing the data in smaller and smaller units.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
36
In RFM analysis,F stands for "how frequently."
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
37
Business Intelligence (BI)reporting systems are intended to create meaningful information from disparate data sources and to deliver that information to the proper users on a timely basis.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
38
In a common form of RFM analysis,an RFM score of {5 1 1} means that the customer orders frequently and orders items of high monetary value but has not ordered anything for some time.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
39
When creating an OLAP report based on SQL Server 2014 data,it is often a good idea to create a view to organize the data needed for the OLAP report.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
40
Microsoft Excel 2013 cannot import SQL Server 2014 data directly into a PivotTable report,but must first place the data into a worksheet.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
41
Data mining is the application of mathematical and statistical techniques to find patterns and relationships that can be used to classify and predict future outcomes.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
42
Facebook uses the Apache Software Foundation's Cassandra NoSQL database.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
43
In the MapReduce process,the Reduce step is followed by the Map step.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
44
Most of NoSQL nonrelational database methodologies are known as structured storage.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
45
Business Intelligence (BI)systems do which of the following?
A)Analyze current and past activities
B)Predict future events
C)Record and process transactions
D)Both A and B are correct
A)Analyze current and past activities
B)Predict future events
C)Record and process transactions
D)Both A and B are correct
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
46
NoSQL really stands of "Not only SQL."
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
47
Business Intelligence (BI)systems fall into which of the following categories?
A)Processing
B)Reporting
C)Data mining
D)Both B and C are correct
A)Processing
B)Reporting
C)Data mining
D)Both B and C are correct
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
48
Structured storage column families are indistinguishable from relational database tables.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
49
Business Intelligence (BI)reporting systems can do which of the following operations?
A)Filter data
B)Group data
C)Modify data
D)Both A and B are correct
A)Filter data
B)Group data
C)Modify data
D)Both A and B are correct
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
50
Business Intelligence (BI)systems obtain their data by all of the following means except ________.
A)read and process data from an operational database
B)process extracts from operational databases
C)process data purchased from data vendors
D)read and process data entered by BI system users
A)read and process data from an operational database
B)process extracts from operational databases
C)process data purchased from data vendors
D)read and process data entered by BI system users
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
51
The movement that uses different database methods than the relational model and/or SQL is called the NoSQL movement.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
52
One Business Intelligence (BI)reporting system that uses extensions to SQL is ________.
A)cluster analysis
B)OLAP
C)regression analysis
D)RFM analysis
A)cluster analysis
B)OLAP
C)regression analysis
D)RFM analysis
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
53
Amazon.com's Dynamo was an early example of structured storage.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
54
We have obtained access to the company's operational data.We examine 50 records for customers with phone numbers that should use the current area code of 345.Of these 50 records,we find 10 that still use an older area code of 567.This is an example of ________.
A)dirty data
B)inconsistent data
C)nonintegrated data
D)a "too much data" problem
A)dirty data
B)inconsistent data
C)nonintegrated data
D)a "too much data" problem
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
55
Data mining applications are used to accomplish all of the following tasks except ________.
A)perform what-if analysis
B)make predications
C)facilitate decision making
D)update the database
A)perform what-if analysis
B)make predications
C)facilitate decision making
D)update the database
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
56
Google's Bigtable was an early example of structured storage.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
57
Most data mining techniques are simple and easy to use.
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
58
We have obtained access to the company's operational data.In one record,we find that a customer's age has been recorded as "337." This is an example of ________.
A)dirty data
B)inconsistent data
C)nonintegrated data
D)a "wrong format" problem
A)dirty data
B)inconsistent data
C)nonintegrated data
D)a "wrong format" problem
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
59
Which of the following is not a reason that operational data are difficult to read?
A)Dirty data
B)Current data
C)Nonintegrated data
D)Missing values
A)Dirty data
B)Current data
C)Nonintegrated data
D)Missing values
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
60
Which of the following is (are)true about data mining applications?
A)They use sophisticated mathematical techniques.
B)They use sophisticated statistical techniques.
C)Their report delivery is more important than report delivery for reporting systems.
D)Both A and B are correct
A)They use sophisticated mathematical techniques.
B)They use sophisticated statistical techniques.
C)Their report delivery is more important than report delivery for reporting systems.
D)Both A and B are correct
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
61
An OLAP cube is called that because some products show OLAP displays on ________ axes.
A)one
B)two
C)three
D)four
A)one
B)two
C)three
D)four
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
62
OLAP stands for ________.
A)OnLine Analytical Processing
B)OffLine Analytical Processing
C)OnLine Analysis Process
D)Old,Lazy And Particular
A)OnLine Analytical Processing
B)OffLine Analytical Processing
C)OnLine Analysis Process
D)Old,Lazy And Particular
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
63
Dimensional databases are used to track historical data,and therefore must have a(n)________.
A)time dimension
B)customer dimension
C)sales dimension
D)order dimension
A)time dimension
B)customer dimension
C)sales dimension
D)order dimension
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
64
In OLAP,the data item of interest is called a ________.
A)level
B)dimension
C)measure
D)member
A)level
B)dimension
C)measure
D)member
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
65
We have obtained access to the company's operational data.We have been asked to produce a report with an item by item analysis of sales,but the only sales figure available is the total sale value for each order.This is an example of ________.
A)dirty data
B)inconsistent data
C)nonintegrated data
D)a "wrong format" problem
A)dirty data
B)inconsistent data
C)nonintegrated data
D)a "wrong format" problem
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
66
RFM analysis analyzes and ranks customers based on ________.
A)their purchasing patterns
B)their income status
C)their residential location
D)Both A and B are correct
A)their purchasing patterns
B)their income status
C)their residential location
D)Both A and B are correct
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
67
The term drill down means the user wants to ________.
A)summarize data
B)get older data
C)sort data
D)get more details
A)summarize data
B)get older data
C)sort data
D)get more details
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
68
The "M" in RFM analysis stands for ________.
A)money
B)mostly
C)modest
D)modern
A)money
B)mostly
C)modest
D)modern
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
69
Slowly changing dimensions are handled by a(n)________.
A)operational database
B)dimensional database
C)structured storage
D)object-relational data model
A)operational database
B)dimensional database
C)structured storage
D)object-relational data model
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
70
A data mart differs from a data warehouse in that ________.
A)it has a larger database
B)it deals with a particular component or functional area of the business
C)data mart users must have more data management expertise than data warehouse employees
D)it is updated more frequently by the data mart users
A)it has a larger database
B)it deals with a particular component or functional area of the business
C)data mart users must have more data management expertise than data warehouse employees
D)it is updated more frequently by the data mart users
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
71
A data warehouse database differs from an operational database because ________.
A)data warehouse data are not stored in tables
B)data warehouse databases do not have metadata
C)data warehouse data are often denormalized
D)Both B and C are correct
A)data warehouse data are not stored in tables
B)data warehouse databases do not have metadata
C)data warehouse data are often denormalized
D)Both B and C are correct
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
72
RFM scores commonly range from ________,with ________ being the "high" or "most desirable" (from the vendor's point of view)score.
A)0 to 5;0
B)0 to 5;5
C)1 to 5;1
D)1 to 5,5
A)0 to 5;0
B)0 to 5;5
C)1 to 5;1
D)1 to 5,5
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
73
Star schemas have a ________ at the center of the star.
A)fact table
B)dimension table
C)map table
D)reduce table
A)fact table
B)dimension table
C)map table
D)reduce table
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
74
A Business Intelligence (BI)reporting system ________.
A)creates meaningful information from disparate data sources
B)delivers information to users at the DBA's convenience
C)uses statistical procedures to predict future events
D)uses operational data
A)creates meaningful information from disparate data sources
B)delivers information to users at the DBA's convenience
C)uses statistical procedures to predict future events
D)uses operational data
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
75
Data warehouses use a(n)________.
A)operational database
B)dimensional database
C)structured storage
D)object-relational data model
A)operational database
B)dimensional database
C)structured storage
D)object-relational data model
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
76
In OLAP,the characteristic of a measure is called a ________.
A)level
B)dimension
C)slice
D)member
A)level
B)dimension
C)slice
D)member
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
77
Which of the following is not a component of a data warehouse?
A)Data extract,transform,and load (ETL)preparation programs
B)Data warehouse data
C)Operational database updates
D)Data warehouse metadata
A)Data extract,transform,and load (ETL)preparation programs
B)Data warehouse data
C)Operational database updates
D)Data warehouse metadata
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
78
We have done an RFM analysis on our customer data.John Smith has a score of {5 1 1}.This means that John ________.
A)has ordered recently,and orders a lot when he orders
B)hasn't ordered recently,but orders a lot when he orders
C)has ordered recently,but doesn't order a lot when he orders
D)hasn't ordered recently,and doesn't order a lot when he orders
A)has ordered recently,and orders a lot when he orders
B)hasn't ordered recently,but orders a lot when he orders
C)has ordered recently,but doesn't order a lot when he orders
D)hasn't ordered recently,and doesn't order a lot when he orders
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
79
The "R" in RFM analysis stands for ________.
A)rank
B)recent
C)relationship
D)readiness
A)rank
B)recent
C)relationship
D)readiness
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck
80
Snowflake schemas have normalized ________.
A)fact tables
B)dimension tables
C)map tables
D)reduce tables
A)fact tables
B)dimension tables
C)map tables
D)reduce tables
Unlock Deck
Unlock for access to all 108 flashcards in this deck.
Unlock Deck
k this deck