Deck 4: Data Mining

ملء الشاشة (f)
exit full mode
سؤال
In the cancer research case study,data mining algorithms that predict cancer survivability with high predictive power are good replacements for medical professionals.
استخدم زر المسافة أو
up arrow
down arrow
لقلب البطاقة.
سؤال
The number of users of free/open source data mining software now exceeds that of users of commercial software versions.
سؤال
Interval data is a type of numerical data.
سؤال
When a problem has many attributes that impact the classification of different patterns,decision trees may be a useful approach.
سؤال
Statistics and data mining both look for data sets that are as large as possible.
سؤال
Ratio data is a type of categorical data.
سؤال
When training a data mining model,the testing dataset is always larger than the training dataset.
سؤال
Data mining requires specialized data analysts to ask ad hoc questions and obtain answers quickly from the system.
سؤال
During classification in data mining,a false positive is an occurrence classified as true by the algorithm while being false in reality.
سؤال
The cost of data storage has plummeted recently,making data mining feasible for more firms.
سؤال
Using data mining on data about imports and exports can help to detect tax avoidance and money laundering.
سؤال
In the 2degrees case study,the main effectiveness of the new analytics system was in dissuading potential churners from leaving the company.
سؤال
In the Memphis Police Department case study,predictive analytics helped to identify the best schedule for officers in order to pay the least overtime.
سؤال
The entire focus of the predictive analytics system in the Infinity P&C case was on detecting and handling fraudulent claims for the company's benefit.
سؤال
In data mining,classification models help in prediction.
سؤال
Market basket analysis is a useful and entertaining way to explain data mining to a technologically less savvy audience,but it has little business significance.
سؤال
If using a mining analogy,"knowledge mining" would be a more appropriate term than "data mining."
سؤال
In the Cabela's case study,the SAS/Teradata solution enabled the direct marketer to better identify likely customers and market to them based mostly on external data sources.
سؤال
Data that is collected,stored,and analyzed in data mining is often private and personal.There is no way to maintain individuals' privacy other than being very careful about physical data security.
سؤال
Data mining can be very useful in detecting patterns such as credit card fraud,but is of little help in improving sales.
سؤال
In the Cabela's case study,what types of models helped the company understand the value of customers,using a five-point scale?

A) reporting and association models
B) simulation and geographical models
C) simulation and regression models
D) clustering and association models
سؤال
What does the robustness of a data mining method refer to?

A) its ability to predict the outcome of a previously unknown data set accurately
B) its speed of computation and computational costs in using the mode
C) its ability to construct a prediction model efficiently given a large amount of data
D) its ability to overcome noisy data to make somewhat accurate predictions
سؤال
In the Target case study,why did Target send a teen maternity ads?

A) Target's analytic model confused her with an older woman with a similar name.
B) Target was sending ads to all women in a particular neighborhood.
C) Target's analytic model suggested she was pregnant based on her buying habits.
D) Target was using a special promotion that targeted all teens in her geographical area.
سؤال
The data field "salary" can be best described as

A) nominal data.
B) interval data.
C) ordinal data.
D) ratio data.
سؤال
Understanding customers better has helped Amazon and others become more successful.The understanding comes primarily from

A) collecting data about customers and transactions.
B) developing a philosophy that is data analytics-centric.
C) analyzing the vast data amounts routinely collected.
D) asking the customers what they want.
سؤال
Third party providers of publicly available datasets protect the anonymity of the individuals in the data set primarily by

A) asking data users to use the data ethically.
B) leaving in identifiers (e.g., name), but changing other variables.
C) removing identifiers such as names and social security numbers.
D) letting individuals in the data know their data is being accessed.
سؤال
The data field "ethnic group" can be best described as

A) nominal data.
B) interval data.
C) ordinal data.
D) ratio data.
سؤال
Prediction problems where the variables have numeric values are most accurately defined as

A) classifications.
B) regressions.
C) associations.
D) computations.
سؤال
What does the scalability of a data mining method refer to?

A) its ability to predict the outcome of a previously unknown data set accurately
B) its speed of computation and computational costs in using the mode
C) its ability to construct a prediction model efficiently given a large amount of data
D) its ability to overcome noisy data to make somewhat accurate predictions
سؤال
All of the following statements about data mining are true EXCEPT

A) the process aspect means that data mining should be a one-step process to results.
B) the novel aspect means that previously unknown patterns are discovered.
C) the potentially useful aspect means that results should lead to some business benefit.
D) the valid aspect means that the discovered patterns should hold true on new data.
سؤال
What is the main reason parallel processing is sometimes used for data mining?

A) because the hardware exists in most organizations and it is available to use
B) because the most of the algorithms used for data mining require it
C) because of the massive data amounts and search efforts involved
D) because any strategic application requires parallel processing
سؤال
Which broad area of data mining applications partitions a collection of objects into natural groupings with similar features?

A) associations
B) visualization
C) classification
D) clustering
سؤال
In data mining,finding an affinity of two products to be commonly together in a shopping cart is known as

A) association rule mining.
B) cluster analysis.
C) decision trees.
D) artificial neural networks.
سؤال
Which of the following is a data mining myth?

A) Data mining is a multistep process that requires deliberate, proactive design and use.
B) Data mining requires a separate, dedicated database.
C) The current state-of-the-art is ready to go for almost any business.
D) Newer Web-based tools enable managers of all educational levels to do data mining.
سؤال
In estimating the accuracy of data mining (or other)classification models,the true positive rate is

A) the ratio of correctly classified positives divided by the total positive count.
B) the ratio of correctly classified negatives divided by the total negative count.
C) the ratio of correctly classified positives divided by the sum of correctly classified positives and incorrectly classified positives.
D) the ratio of correctly classified positives divided by the sum of correctly classified positives and incorrectly classified negatives.
سؤال
The data mining algorithm type used for classification somewhat resembling the biological neural networks in the human brain is

A) association rule mining.
B) cluster analysis.
C) decision trees.
D) artificial neural networks.
سؤال
Identifying and preventing incorrect claim payments and fraudulent activities falls under which type of data mining applications?

A) insurance
B) retailing and logistics
C) customer relationship management
D) computer hardware and software
سؤال
Which data mining process/methodology is thought to be the most comprehensive,according to kdnuggets.com rankings?

A) SEMMA
B) proprietary organizational methodologies
C) KDD Process
D) CRISP-DM
سؤال
All of the following statements about data mining are true EXCEPT

A) understanding the business goal is critical.
B) understanding the data, e.g., the relevant variables, is critical to success.
C) building the model takes the most time and effort.
D) data is typically preprocessed and/or cleaned before use.
سؤال
Which broad area of data mining applications analyzes data,forming rules to distinguish between defined classes?

A) associations
B) visualization
C) classification
D) clustering
سؤال
The basic idea behind a ________ is that it recursively divides a training set until each division consists entirely or primarily of examples from one class.
سؤال
The data mining in cancer research case study explains that data mining methods are capable of extracting patterns and ________ hidden deep in large and complex medical databases.
سؤال
In the Memphis Police Department case study,shortly after all precincts embraced Blue CRUSH,________ became one of the most potent weapons in the Memphis police department's crime-fighting arsenal.
سؤال
In the opening vignette,Cabela's uses SAS data mining tools to create ________ models to optimize customer selection for all customer contacts.
سؤال
In the terrorist funding case study,an observed price ________ may be related to income tax avoidance/evasion,money laundering,or terrorist financing.
سؤال
While prediction is largely experience and opinion based,________ is data and model based.
سؤال
Because of its successful application to retail business problems,association rule mining is commonly called ________.
سؤال
In ________,a classification method,the complete data set is randomly split into mutually exclusive subsets of approximately equal size and tested multiple times on each left-out subset,using the others as a training set.
سؤال
Knowledge extraction,pattern analysis,data archaeology,information harvesting,pattern searching,and data dredging are all alternative names for ________.
سؤال
Fayyad et al.(1996)defined ________ in databases as a process of using data mining methods to find useful information and patterns in the data.
سؤال
Patterns have been manually ________ from data by humans for centuries,but the increasing volume of data in modern times has created a need for more automatic approaches.
سؤال
Whereas ________ starts with a well-defined proposition and hypothesis,data mining starts with a loosely defined discovery statement.
سؤال
There has been an increase in data mining to deal with global competition and customers' more sophisticated ________ and wants.
سؤال
Data preparation,the third step in the CRISP-DM data mining process,is more commonly known as ________.
سؤال
Customer ________ management extends traditional marketing by creating one-on-one relationships with customers.
سؤال
The ________ is the most commonly used algorithm to discover association rules.Given a set of itemsets,the algorithm attempts to find subsets that are common to at least a minimum number of the itemsets.
سؤال
Data are often buried deep within very large ________,which sometimes contain data from several years.
سؤال
One way to accomplish privacy and protection of individuals' rights when data mining is by ________ of the customer records prior to applying data mining applications,so that the records cannot be traced to an individual.
سؤال
As described in the 2degrees case study,a common problem in the mobile telecommunications industry is defined by the term ________,which means customers leaving.
سؤال
________ represent the labels of multiple classes used to divide a variable into specific groups,examples of which include race,sex,age group,and educational level.
سؤال
List five reasons for the growing popularity of data mining in the business world.
سؤال
List and briefly describe the six steps of the CRISP-DM data mining process.
سؤال
Describe the role of the simple split in estimating the accuracy of classification models.
سؤال
Briefly describe five techniques (or algorithms)that are used for classification modeling.
سؤال
List four myths associated with data mining.
سؤال
In the data mining in Hollywood case study,how successful were the models in predicting the success or failure of a Hollywood movie?
سؤال
Describe cluster analysis and some of its applications.
سؤال
List six common data mining mistakes.
سؤال
What are the differences between nominal,ordinal,interval and ratio data? Give examples.
سؤال
In lessons learned from the Target case,what legal warnings would you give another retailer using data mining for marketing?
فتح الحزمة
قم بالتسجيل لفتح البطاقات في هذه المجموعة!
Unlock Deck
Unlock Deck
1/70
auto play flashcards
العب
simple tutorial
ملء الشاشة (f)
exit full mode
Deck 4: Data Mining
1
In the cancer research case study,data mining algorithms that predict cancer survivability with high predictive power are good replacements for medical professionals.
False
2
The number of users of free/open source data mining software now exceeds that of users of commercial software versions.
True
3
Interval data is a type of numerical data.
True
4
When a problem has many attributes that impact the classification of different patterns,decision trees may be a useful approach.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
5
Statistics and data mining both look for data sets that are as large as possible.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
6
Ratio data is a type of categorical data.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
7
When training a data mining model,the testing dataset is always larger than the training dataset.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
8
Data mining requires specialized data analysts to ask ad hoc questions and obtain answers quickly from the system.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
9
During classification in data mining,a false positive is an occurrence classified as true by the algorithm while being false in reality.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
10
The cost of data storage has plummeted recently,making data mining feasible for more firms.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
11
Using data mining on data about imports and exports can help to detect tax avoidance and money laundering.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
12
In the 2degrees case study,the main effectiveness of the new analytics system was in dissuading potential churners from leaving the company.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
13
In the Memphis Police Department case study,predictive analytics helped to identify the best schedule for officers in order to pay the least overtime.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
14
The entire focus of the predictive analytics system in the Infinity P&C case was on detecting and handling fraudulent claims for the company's benefit.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
15
In data mining,classification models help in prediction.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
16
Market basket analysis is a useful and entertaining way to explain data mining to a technologically less savvy audience,but it has little business significance.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
17
If using a mining analogy,"knowledge mining" would be a more appropriate term than "data mining."
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
18
In the Cabela's case study,the SAS/Teradata solution enabled the direct marketer to better identify likely customers and market to them based mostly on external data sources.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
19
Data that is collected,stored,and analyzed in data mining is often private and personal.There is no way to maintain individuals' privacy other than being very careful about physical data security.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
20
Data mining can be very useful in detecting patterns such as credit card fraud,but is of little help in improving sales.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
21
In the Cabela's case study,what types of models helped the company understand the value of customers,using a five-point scale?

A) reporting and association models
B) simulation and geographical models
C) simulation and regression models
D) clustering and association models
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
22
What does the robustness of a data mining method refer to?

A) its ability to predict the outcome of a previously unknown data set accurately
B) its speed of computation and computational costs in using the mode
C) its ability to construct a prediction model efficiently given a large amount of data
D) its ability to overcome noisy data to make somewhat accurate predictions
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
23
In the Target case study,why did Target send a teen maternity ads?

A) Target's analytic model confused her with an older woman with a similar name.
B) Target was sending ads to all women in a particular neighborhood.
C) Target's analytic model suggested she was pregnant based on her buying habits.
D) Target was using a special promotion that targeted all teens in her geographical area.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
24
The data field "salary" can be best described as

A) nominal data.
B) interval data.
C) ordinal data.
D) ratio data.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
25
Understanding customers better has helped Amazon and others become more successful.The understanding comes primarily from

A) collecting data about customers and transactions.
B) developing a philosophy that is data analytics-centric.
C) analyzing the vast data amounts routinely collected.
D) asking the customers what they want.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
26
Third party providers of publicly available datasets protect the anonymity of the individuals in the data set primarily by

A) asking data users to use the data ethically.
B) leaving in identifiers (e.g., name), but changing other variables.
C) removing identifiers such as names and social security numbers.
D) letting individuals in the data know their data is being accessed.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
27
The data field "ethnic group" can be best described as

A) nominal data.
B) interval data.
C) ordinal data.
D) ratio data.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
28
Prediction problems where the variables have numeric values are most accurately defined as

A) classifications.
B) regressions.
C) associations.
D) computations.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
29
What does the scalability of a data mining method refer to?

A) its ability to predict the outcome of a previously unknown data set accurately
B) its speed of computation and computational costs in using the mode
C) its ability to construct a prediction model efficiently given a large amount of data
D) its ability to overcome noisy data to make somewhat accurate predictions
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
30
All of the following statements about data mining are true EXCEPT

A) the process aspect means that data mining should be a one-step process to results.
B) the novel aspect means that previously unknown patterns are discovered.
C) the potentially useful aspect means that results should lead to some business benefit.
D) the valid aspect means that the discovered patterns should hold true on new data.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
31
What is the main reason parallel processing is sometimes used for data mining?

A) because the hardware exists in most organizations and it is available to use
B) because the most of the algorithms used for data mining require it
C) because of the massive data amounts and search efforts involved
D) because any strategic application requires parallel processing
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
32
Which broad area of data mining applications partitions a collection of objects into natural groupings with similar features?

A) associations
B) visualization
C) classification
D) clustering
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
33
In data mining,finding an affinity of two products to be commonly together in a shopping cart is known as

A) association rule mining.
B) cluster analysis.
C) decision trees.
D) artificial neural networks.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
34
Which of the following is a data mining myth?

A) Data mining is a multistep process that requires deliberate, proactive design and use.
B) Data mining requires a separate, dedicated database.
C) The current state-of-the-art is ready to go for almost any business.
D) Newer Web-based tools enable managers of all educational levels to do data mining.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
35
In estimating the accuracy of data mining (or other)classification models,the true positive rate is

A) the ratio of correctly classified positives divided by the total positive count.
B) the ratio of correctly classified negatives divided by the total negative count.
C) the ratio of correctly classified positives divided by the sum of correctly classified positives and incorrectly classified positives.
D) the ratio of correctly classified positives divided by the sum of correctly classified positives and incorrectly classified negatives.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
36
The data mining algorithm type used for classification somewhat resembling the biological neural networks in the human brain is

A) association rule mining.
B) cluster analysis.
C) decision trees.
D) artificial neural networks.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
37
Identifying and preventing incorrect claim payments and fraudulent activities falls under which type of data mining applications?

A) insurance
B) retailing and logistics
C) customer relationship management
D) computer hardware and software
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
38
Which data mining process/methodology is thought to be the most comprehensive,according to kdnuggets.com rankings?

A) SEMMA
B) proprietary organizational methodologies
C) KDD Process
D) CRISP-DM
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
39
All of the following statements about data mining are true EXCEPT

A) understanding the business goal is critical.
B) understanding the data, e.g., the relevant variables, is critical to success.
C) building the model takes the most time and effort.
D) data is typically preprocessed and/or cleaned before use.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
40
Which broad area of data mining applications analyzes data,forming rules to distinguish between defined classes?

A) associations
B) visualization
C) classification
D) clustering
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
41
The basic idea behind a ________ is that it recursively divides a training set until each division consists entirely or primarily of examples from one class.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
42
The data mining in cancer research case study explains that data mining methods are capable of extracting patterns and ________ hidden deep in large and complex medical databases.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
43
In the Memphis Police Department case study,shortly after all precincts embraced Blue CRUSH,________ became one of the most potent weapons in the Memphis police department's crime-fighting arsenal.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
44
In the opening vignette,Cabela's uses SAS data mining tools to create ________ models to optimize customer selection for all customer contacts.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
45
In the terrorist funding case study,an observed price ________ may be related to income tax avoidance/evasion,money laundering,or terrorist financing.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
46
While prediction is largely experience and opinion based,________ is data and model based.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
47
Because of its successful application to retail business problems,association rule mining is commonly called ________.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
48
In ________,a classification method,the complete data set is randomly split into mutually exclusive subsets of approximately equal size and tested multiple times on each left-out subset,using the others as a training set.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
49
Knowledge extraction,pattern analysis,data archaeology,information harvesting,pattern searching,and data dredging are all alternative names for ________.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
50
Fayyad et al.(1996)defined ________ in databases as a process of using data mining methods to find useful information and patterns in the data.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
51
Patterns have been manually ________ from data by humans for centuries,but the increasing volume of data in modern times has created a need for more automatic approaches.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
52
Whereas ________ starts with a well-defined proposition and hypothesis,data mining starts with a loosely defined discovery statement.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
53
There has been an increase in data mining to deal with global competition and customers' more sophisticated ________ and wants.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
54
Data preparation,the third step in the CRISP-DM data mining process,is more commonly known as ________.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
55
Customer ________ management extends traditional marketing by creating one-on-one relationships with customers.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
56
The ________ is the most commonly used algorithm to discover association rules.Given a set of itemsets,the algorithm attempts to find subsets that are common to at least a minimum number of the itemsets.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
57
Data are often buried deep within very large ________,which sometimes contain data from several years.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
58
One way to accomplish privacy and protection of individuals' rights when data mining is by ________ of the customer records prior to applying data mining applications,so that the records cannot be traced to an individual.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
59
As described in the 2degrees case study,a common problem in the mobile telecommunications industry is defined by the term ________,which means customers leaving.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
60
________ represent the labels of multiple classes used to divide a variable into specific groups,examples of which include race,sex,age group,and educational level.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
61
List five reasons for the growing popularity of data mining in the business world.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
62
List and briefly describe the six steps of the CRISP-DM data mining process.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
63
Describe the role of the simple split in estimating the accuracy of classification models.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
64
Briefly describe five techniques (or algorithms)that are used for classification modeling.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
65
List four myths associated with data mining.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
66
In the data mining in Hollywood case study,how successful were the models in predicting the success or failure of a Hollywood movie?
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
67
Describe cluster analysis and some of its applications.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
68
List six common data mining mistakes.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
69
What are the differences between nominal,ordinal,interval and ratio data? Give examples.
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
70
In lessons learned from the Target case,what legal warnings would you give another retailer using data mining for marketing?
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.
فتح الحزمة
k this deck
locked card icon
فتح الحزمة
افتح القفل للوصول البطاقات البالغ عددها 70 في هذه المجموعة.