Deck 17: Data Mining

ملء الشاشة (f)
exit full mode
سؤال
Clustering is considered a supervised data mining technique.
استخدم زر المسافة أو
up arrow
down arrow
لقلب البطاقة.
سؤال
A facts table has:

A) few rows and many columns
B) many rows and many columns
C) many rows and few columns
D) few rows and few columns
سؤال
Bridget has partitioned data into two subsets.The original file contains 300,000 observations.The subset she is currently working with has 60,000 observations.Which subset is she most likely to be using?

A) The training set
B) The original set
C) The testing set
D) The prediction set
سؤال
Create a pivot table that illustrates something meaningful about the variables in the accompanying table.
سؤال
If the regression coefficient estimate from a logistic regression is positive,the probability of the dependent variable taking on a value of 1:

A) decreases
B) approaches zero
C) increases
D) remains constant
سؤال
A data mart is typically smaller than a data warehouse.
سؤال
The predicted value from a logistic regression will be:

A) between 0 and 1
B) between -1 and 1
C) less than 0
D) greater than 1
سؤال
Segmentation is also known as clustering,and involves trying to group entities into similar clusters.
سؤال
If the estimate on the female variable is positive,what does this indicate about credit card usage?
سؤال
What does the pivot table indicate about spending habits?
سؤال
Suppose the odds of Team A winning are 5 to 1.Then,the odds ratio is:

A) 5/1
B) 1/5
C) 6/1
D) 1/6
سؤال
Data mining is used to examine known,expected patterns and relationships among variables.
سؤال
The higher the "score" for a particular member in logistic regression,the:

A) higher the likelihood that member is in category 1
B) lower the likelihood that member is in category 1
C) higher the likelihood that member is in category 0
D) higher the likelihood that member is not in a category
Amount Spent
Female
Credit Card
99
0
0
50
0
0
75
0
1
600
1
0
325
1
0
97
0
1
540
1
0
137
1
0
22
1
1
94
1
1
111
1
1
سؤال
In a facts table,a supermarket database is likely to have which item listed in rows?

A) The number of units sold
B) Revenue generated from a particular unit
C) The department in which the unit was purchased
D) The individual items purchased
سؤال
Which methodology is used to group products that customers purchase together?

A) Market basket analysis
B) Prediction
C) Classification analysis
D) Forecasting
سؤال
Megan is examining the likelihood of people riding the subway.The dependent variable takes on the value of 1 if the individual rides the subway and 0 otherwise.Therefore,she could use logistic regression to examine this question.
سؤال
Mya is investigating the factors that impact soda consumption.She examines a host of variables that help explain the amount consumed.Which type of data mining methodology is she most likely to use?

A) Market basket analysis
B) Prediction
C) Classification analysis
D) Forecasting
سؤال
The testing set in data partitioning is:

A) The first subset of data,which usually contains 70% of the records
B) The second subset of data,which usually contains less than 70% of the records
C) The initial dataset from which subsets are created
D) The first subset of data,which usually contains 30% of the records
فتح الحزمة
قم بالتسجيل لفتح البطاقات في هذه المجموعة!
Unlock Deck
Unlock Deck
1/18
auto play flashcards
العب
simple tutorial
ملء الشاشة (f)
exit full mode
Deck 17: Data Mining
1
Clustering is considered a supervised data mining technique.
False
2
A facts table has:

A) few rows and many columns
B) many rows and many columns
C) many rows and few columns
D) few rows and few columns
A
3
Bridget has partitioned data into two subsets.The original file contains 300,000 observations.The subset she is currently working with has 60,000 observations.Which subset is she most likely to be using?

A) The training set
B) The original set
C) The testing set
D) The prediction set
C
4
Create a pivot table that illustrates something meaningful about the variables in the accompanying table.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
5
If the regression coefficient estimate from a logistic regression is positive,the probability of the dependent variable taking on a value of 1:

A) decreases
B) approaches zero
C) increases
D) remains constant
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
6
A data mart is typically smaller than a data warehouse.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
7
The predicted value from a logistic regression will be:

A) between 0 and 1
B) between -1 and 1
C) less than 0
D) greater than 1
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
8
Segmentation is also known as clustering,and involves trying to group entities into similar clusters.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
9
If the estimate on the female variable is positive,what does this indicate about credit card usage?
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
10
What does the pivot table indicate about spending habits?
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
11
Suppose the odds of Team A winning are 5 to 1.Then,the odds ratio is:

A) 5/1
B) 1/5
C) 6/1
D) 1/6
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
12
Data mining is used to examine known,expected patterns and relationships among variables.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
13
The higher the "score" for a particular member in logistic regression,the:

A) higher the likelihood that member is in category 1
B) lower the likelihood that member is in category 1
C) higher the likelihood that member is in category 0
D) higher the likelihood that member is not in a category
Amount Spent
Female
Credit Card
99
0
0
50
0
0
75
0
1
600
1
0
325
1
0
97
0
1
540
1
0
137
1
0
22
1
1
94
1
1
111
1
1
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
14
In a facts table,a supermarket database is likely to have which item listed in rows?

A) The number of units sold
B) Revenue generated from a particular unit
C) The department in which the unit was purchased
D) The individual items purchased
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
15
Which methodology is used to group products that customers purchase together?

A) Market basket analysis
B) Prediction
C) Classification analysis
D) Forecasting
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
16
Megan is examining the likelihood of people riding the subway.The dependent variable takes on the value of 1 if the individual rides the subway and 0 otherwise.Therefore,she could use logistic regression to examine this question.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
17
Mya is investigating the factors that impact soda consumption.She examines a host of variables that help explain the amount consumed.Which type of data mining methodology is she most likely to use?

A) Market basket analysis
B) Prediction
C) Classification analysis
D) Forecasting
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
18
The testing set in data partitioning is:

A) The first subset of data,which usually contains 70% of the records
B) The second subset of data,which usually contains less than 70% of the records
C) The initial dataset from which subsets are created
D) The first subset of data,which usually contains 30% of the records
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.
فتح الحزمة
k this deck
locked card icon
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 18 في هذه المجموعة.