Services
Discover
Homeschooling
Ask a Question
Log in
Sign up
Filters
Done
Question type:
Essay
Multiple Choice
Short Answer
True False
Matching
Topic
Statistics
Study Set
Business Statistics
Quiz 5: Predictive Analytics I: Trees, K-Nearest Neighbors, Naive Bayes,
Path 4
Access For Free
Share
All types
Filters
Study Flashcards
Practice Exam
Learn
Question 1
True/False
For a sufficiently large value of k, the k-nearest neighbors classification approach will always result in a lower misclassification rate than the simple branch splitting approach of the classification tree.
Question 2
True/False
A quantitative variable which can have only the values of zero (0) or one (1) and which is used to represent a qualitative variable is known as a (1, 0) dummy variable.
Question 3
True/False
To predict a qualitative, or categorical, response variable we could use a classification tree.
Question 4
True/False
The optimal value of k to use for the k-nearest neighbors approach to predicting a quantitative response variable is the value of k that minimizes RMSE (the square root of the mean of the squared deviations of the predicted values from the observed values).
Question 5
True/False
To predict a quantitative response variable, we could use a regression tree.
Question 6
True/False
Naive Bayes' Theorem assumes that the events that the predictor variables take on the values x
1
, x
2
, …, x
k
are highly correlated for observations that fall into the particular category and statistically independent for observations that do not fall into the particular category.
Question 7
True/False
Because different trust levels may be appropriate for different techniques, ensemble estimates may use a weighted average of the different results given by the different techniques.
Question 8
True/False
The process of assigning items to prespecified categories is known as classification.
Question 9
True/False
The confusion matrix for a classification tree shows which combinations of predictor variables cannot be used to predict the response variable.
Question 10
True/False
The confusion matrix shows the number of observed response variables which are classified correctly.
Question 11
True/False
One approach to avoid overfitting a classification tree is to use a validation data set to identify valid splits and a training data set to train the classification tree on when to stop making splits.