Deck 4: Regression Models

Full screen (f)
exit full mode
Question
The null hypothesis in the F-test is that there is a linear relationship between the X and Y variables.
Use Space or
up arrow
down arrow
to flip the card.
Question
There is no relationship between variables unless the data points lie in a straight line.
Question
In regression, a dependent variable is sometimes called a predictor variable.
Question
The coefficient of determination gives the proportion of the variability in the dependent variable that is explained by the regression equation.
Question
The correlation coefficient has values between −1 and +1.
Question
Both the p-value for the F-test and r2 can be interpreted the same with multiple regression models as they are with simple linear models.
Question
Errors are also called residuals.
Question
The regression line minimizes the sum of the squared errors.
Question
When the significance level is small enough in the F-test, we can reject the null hypothesis that there is no linear relationship.
Question
An F-test is used to determine if there is a relationship between the dependent and independent variables.
Question
The coefficients of each independent variable in a multiple regression model represent slopes.
Question
If the significance level for the F-test is high enough, there is a relationship between the dependent and independent variables.
Question
For statistical tests of significance about the coefficients, the null hypothesis is that the slope is 1.
Question
The multiple regression model includes several dependent variables.
Question
Estimates of the slope, intercept, and error of a regression model are found from sample data.
Question
Summing the error values in a regression model is misleading because negative errors cancel out positive errors.
Question
In regression, there is random error that can be predicted.
Question
In regression, an independent variable is sometimes called a response variable.
Question
Often, a plot of the residuals will highlight any glaring violations of the assumptions.
Question
If the assumptions of regression have been met, errors plotted against the independent variable will typically show patterns.
Question
A dummy variable can be assigned up to three values.
Question
If multicollinearity exists, then individual interpretation of the variables is questionable, but the overall model is still good for prediction purposes.
Question
The random error in a regression equation

A)is the predicted error.
B)includes both positive and negative terms.
C)will sum to a large positive number.
D)is used to estimate the accuracy of the slope.
Question
A high correlation always implies that one variable is causing a change in the other variable.
Question
Multicollinearity exists when a variable is correlated to other variables.
Question
The adjusted r2 will always increase as additional variables are added to the model.
Question
The best model is a statistically significant model with a high r-square and few variables.
Question
Which of these statements is best?

A)In regression, an independent variable is sometimes called a response variable.
B)A scatter diagram is a graphical depiction of the relationship between the dependent and independent variables.
C)In a scatter diagram, the dependent variable is typically plotted on the horizontal axis.
D)There is no relationship between variables unless the data points lie in a straight line.
Question
Dummy variables for regression analysis can take on a value of either -1 or +1.
Question
Which of the following statements is true regarding a scatter diagram?

A)It provides very little information about the relationship between the regression variables.
B)It is a plot of the independent and dependent variables.
C)It is a line chart of the independent and dependent variables.
D)It has a value between -1 and +1.
Question
The value of r2 can never decrease when more variables are added to the model.
Question
The multiple regression model includes several intercept terms.
Question
In regression, a binary variable is also called an indicator variable.
Question
Transformations may be used when nonlinear relationships exist between variables.
Question
The number of dummy variables must equal 1 less than the number of categories of the qualitative variable.
Question
A variable should be added to the model regardless of the impact (increase or decrease)on the adjusted r2 value.
Question
Another name for a dummy variable is a binary variable.
Question
Which of these statements is best?

A)In regression, an independent variable is sometimes called a response variable.
B)One purpose of regression is to understand the relationship between variables.
C)One purpose of regression is to predict the value of one variable based on the standard deviation of that variable.
D)The variable to be predicted is the independent variable.
Question
A reference to the criterion used to select the regression line, to minimize the squared distances between the estimated straight line and the observed values is called

A)Mean square error.
B)Sum of Squares.
C)Maximum likelihood.
D)Least Squares.
Question
The multiple regression model includes multiple slope coefficients.
Question
Which of the following conditions can be detected from residual analysis?

A)nonconstant variance
B)multicollinearity
C)dummy variables
D)presence of transformations
Question
The problem of nonconstant error variance is detected in residual analysis by which of the following?

A)a cone pattern
B)an arched pattern
C)a random pattern
D)an increasing pattern
Question
Which of these statements is best?

A)The regression model assumes the error terms are dependent.
B)The regression model assumes the errors are normally distributed.
C)The errors in a regression model are assumed to have an increasing mean.
D)The errors in a regression model are assumed to have zero variance.
Question
The correlation coefficient resulting from a particular regression analysis was 0.25.What was the coefficient of determination?

A)0.5
B)-0.5
C)0.0625
D)There is insufficient information to answer the question.
Question
The coefficient of determination resulting from a particular regression analysis was 0.85.What was the slope of the regression line?

A)0.85
B)-0.85
C)0.922
D)There is insufficient information to answer the question.
Question
Which of the following statements is true about r2?

A)It is also called the coefficient of correlation.
B)It is also called the coefficient of determination.
C)It represents the percent of variation in X that is explained by Y.
D)It represents the percent of variation in the error that is explained by Y.
Question
Which of the following is not an assumption of the regression model?

A)The errors are independent.
B)The errors should have a standard deviation equal to one.
C)The errors are normally distributed.
D)The errors have constant variance.
Question
Which of these statements is best?

A)The SST measures the total variability in the dependent variable about the regression line.
B)The SSE measures the total variability in the independent variable about the regression line.
C)The SSR indicates how much of the total variability in the dependent variable is explained by the regression model.
D)The coefficient of determination takes on values between -1 and + 1.
Question
A dummy variable is also called a(n)

A)indicator variable.
B)dependent variable.
C)continuous variable.
D)response variable.
Question
In a good regression model the residual plot shows

A)a cone pattern.
B)an arched pattern.
C)a random pattern.
D)an increasing pattern.
Question
The diagram below illustrates data with a <strong>The diagram below illustrates data with a  </strong> A)negative correlation coefficient. B)zero correlation coefficient. C)positive correlation coefficient. D)correlation coefficient equal to +1. <div style=padding-top: 35px>

A)negative correlation coefficient.
B)zero correlation coefficient.
C)positive correlation coefficient.
D)correlation coefficient equal to +1.
Question
The coefficient of determination resulting from a particular regression analysis was 0.85.What was the correlation coefficient, assuming a positive linear relationship?

A)0.5
B)-0.5
C)0.922
D)There is insufficient information to answer the question.
Question
The sum of squared error (SSE)is

A)a measure of the total variation in Y about the mean.
B)a measure of the total variation in X about the mean.
C)a measure in the variation of Y about the regression line.
D)a measure in the variation of X about the regression line.
Question
Which of the following is an assumption of the regression model?

A)The errors are independent.
B)The errors are not normally distributed.
C)The errors have a standard deviation of zero.
D)The errors have an irregular variance.
Question
Which of the following statements is not true about regression models?

A)Estimates of the slope are found from sample data.
B)The regression line minimizes the sum of the squared errors.
C)The error is found by subtracting the actual data value from the predicted data value.
D)The dependent variable is the explanatory variable.
Question
Which of the following equalities is correct?

A)SST = SSR + SSE
B)SSR = SST + SSE
C)SSE = SSR + SST
D)SST = SSC + SSR
Question
Which of these statements is best?

A)If the assumptions of regression have been met, errors plotted against the independent variable will typically show patterns.
B)The error standard deviation is estimated by MSE.
C)Often, a plot of the residuals will highlight any glaring violations of the assumptions.
D)The standard error of the estimate is also called the variance of the regression.
Question
Which of these statements is best?

A)In any regression model, there is an implicit assumption that there is no preexisting relationship between the variables.
B)In regression, there is random error that can be predicted.
C)Estimates of the slope, intercept, and error of a regression model are found from sample data.
D)Error is the difference in the actual value and the predicted value.
Question
If computing a causal linear regression model of Y = a + bX and the resultant r2 is very near zero, then one would be able to conclude that

A)Y = a + bX is a good forecasting method.
B)Y = a + bX is not a good forecasting method.
C)a multiple linear regression model is a good forecasting method for the data.
D)a multiple linear regression model is not a good forecasting method for the data.
Question
The problem of a nonlinear relationship is detected in residual analysis by which of the following?

A)a cone pattern
B)an arched pattern
C)a random pattern
D)an increasing pattern
Question
When an independent variable is correlated with one other independent variable, the variables are said to be

A)collinear.
B)pairwise.
C)independent.
D)mutually exclusive.
Question
Which of the following is true regarding a regression model with multicollinearity, a high r2 value, and a low F-test significance level?

A)The model is not a good prediction model.
B)The high value of r2 is due to the multicollinearity.
C)The interpretation of the coefficients is valuable.
D)The significance level tests for the coefficients are not valid.
Question
If a qualitative variable has three categories, how many dummy variables are needed?

A)0
B)1
C)2
D)3
E)4
Question
A prediction equation for starting salaries (in $1,000s)and SAT scores was performed using simple linear regression.In the regression printout shown below, what can be said about the level of significance for the overall model? SUMMARY OUTPUT
<strong>A prediction equation for starting salaries (in $1,000s)and SAT scores was performed using simple linear regression.In the regression printout shown below, what can be said about the level of significance for the overall model? SUMMARY OUTPUT   ANOVA    </strong> A)SAT is not a good predictor for starting salary. B)The significance level for the intercept indicates the model is not valid. C)The significance level for SAT indicates the slope is equal to zero. D)The significance level for SAT indicates the slope is not equal to zero. <div style=padding-top: 35px> ANOVA
<strong>A prediction equation for starting salaries (in $1,000s)and SAT scores was performed using simple linear regression.In the regression printout shown below, what can be said about the level of significance for the overall model? SUMMARY OUTPUT   ANOVA    </strong> A)SAT is not a good predictor for starting salary. B)The significance level for the intercept indicates the model is not valid. C)The significance level for SAT indicates the slope is equal to zero. D)The significance level for SAT indicates the slope is not equal to zero. <div style=padding-top: 35px> <strong>A prediction equation for starting salaries (in $1,000s)and SAT scores was performed using simple linear regression.In the regression printout shown below, what can be said about the level of significance for the overall model? SUMMARY OUTPUT   ANOVA    </strong> A)SAT is not a good predictor for starting salary. B)The significance level for the intercept indicates the model is not valid. C)The significance level for SAT indicates the slope is equal to zero. D)The significance level for SAT indicates the slope is not equal to zero. <div style=padding-top: 35px>

A)SAT is not a good predictor for starting salary.
B)The significance level for the intercept indicates the model is not valid.
C)The significance level for SAT indicates the slope is equal to zero.
D)The significance level for SAT indicates the slope is not equal to zero.
Question
The sum of the squares total (SST)

A)measures the total variability in Y about the mean.
B)measures the total variability in X about the mean.
C)measures the variability in Y about the regression line.
D)measures the variability in X about the regression line.
Question
A healthcare executive is using regression to predict total revenues.She is deciding whether or not to include both patient length of stay and insurance type in her model.Her first regression model only included patient length of stay.The resulting r2 was .83, with an adjusted r2 of .82 and her level of significance was .003.In the second model, she included both patient length of stay and insurance type.The r2 was .84 and the adjusted r2 was .80 for the second model and the level of significance did not change.Which of the following statements is true?

A)The second model is a better model.
B)The first model is a better model.
C)The r2 increased when additional variables were added because these variables significantly contribute to the prediction of total revenues.
D)The adjusted r2 always increases when additional variables are added to the model.
Question
Suppose that you believe that a cubic relationship exists between the independent variable (of time)and the dependent variable Y.Which of the following would represent a valid linear regression model?

A)Y = b0 + b1 X, where X = time3
B)Y = b0 + b1 X3, where X = time
C)Y = b0 + 3b1 X, where X = time3
D)Y = b0 + 3b1 X, where X = time
Question
Which of these statements about multiple regression is best?

A)The p-value for the F-test may not be interpreted the same with multiple regression models as they are with simple linear models.
B)The multiple regression model includes several dependent variables.
C)The multiple regression model includes several intercept terms.
D)The multiple regression model includes multiple slope coefficients.
Question
In the model Y = β0 + β1X1 + β2X2 + ε, which of these parameters represents an independent variable?

A)the X1
B)the Y
C)the β1
D)the ε
Question
An automated process to systematically add or delete independent variables from a regression model is called

A)nonlinear regression.
B)linear regression.
C)residual analysis.
D)stepwise regression.
Question
Which of the following statements provides the best guidance for model building?

A)If the value of r2 increases as more variables are added to the model, the variables should remain in the model, regardless of the magnitude of increase.
B)If the value of the adjusted r2 increases as more variables are added to the model, the variables should remain in the model.
C)If the value of r2 increases as more variables are added to the model, the variables should not remain in the model, regardless of the magnitude of the increase.
D)If the value of the adjusted r2 increases as more variables are added to the model, the variables should not remain in the model.
Question
The condition of an independent variable being correlated to one or more other independent variables is referred to as

A)multicollinearity.
B)statistical significance.
C)linearity.
D)nonlinearity.
Question
The mean square error (MSE)is

A)denoted by s.
B)denoted by k.
C)the SSE divided by the number of observations.
D)the SSE divided by the degrees of freedom.
Question
Which of the following statements is false concerning the hypothesis testing procedure for a regression model?

A)The F-test statistic is used.
B)The null hypothesis is that the true slope coefficient is equal to zero.
C)The null hypothesis is rejected if the adjusted r2 is above the critical value.
D)An α level must be selected.
Question
A prediction equation for sales and payroll was performed using simple linear regression.In the regression printout shown below, which of the following statements is not true? SUMMARY OUTPUT
<strong>A prediction equation for sales and payroll was performed using simple linear regression.In the regression printout shown below, which of the following statements is not true? SUMMARY OUTPUT   ANOVA    </strong> A)Payroll is a good predictor of Sales based on α = 0.05. B)There is evidence of a positive linear relationship between Sales and Payroll based on α = 0.05. C)Payroll is not a good predictor of Sales based on α = 0.01. D)The coefficient of determination is equal to 0.833333. <div style=padding-top: 35px> ANOVA
<strong>A prediction equation for sales and payroll was performed using simple linear regression.In the regression printout shown below, which of the following statements is not true? SUMMARY OUTPUT   ANOVA    </strong> A)Payroll is a good predictor of Sales based on α = 0.05. B)There is evidence of a positive linear relationship between Sales and Payroll based on α = 0.05. C)Payroll is not a good predictor of Sales based on α = 0.01. D)The coefficient of determination is equal to 0.833333. <div style=padding-top: 35px> <strong>A prediction equation for sales and payroll was performed using simple linear regression.In the regression printout shown below, which of the following statements is not true? SUMMARY OUTPUT   ANOVA    </strong> A)Payroll is a good predictor of Sales based on α = 0.05. B)There is evidence of a positive linear relationship between Sales and Payroll based on α = 0.05. C)Payroll is not a good predictor of Sales based on α = 0.01. D)The coefficient of determination is equal to 0.833333. <div style=padding-top: 35px>

A)Payroll is a good predictor of Sales based on α = 0.05.
B)There is evidence of a positive linear relationship between Sales and Payroll based on α = 0.05.
C)Payroll is not a good predictor of Sales based on α = 0.01.
D)The coefficient of determination is equal to 0.833333.
Question
Which of the following represents the underlying linear model for hypothesis testing?

A)Y = b0 + b1 X + ε
B)Y = b0 + b1 X
C)Y = β0 + β1 X + ε
D)Y = β0 + β1 X
Question
The primary difference between r2 and the adjusted r2 is that

A)the adjusted r2 accounts for the total number of variables in the regression model.
B)the adjusted r2 accounts for the number of independent variables in the regression model.
C)the adjusted r2 accounts for the number of dependent variables in the regression model.
D)the adjusted r2 accounts for multicollinearity.
Question
Which of the following is not a common pitfall of regression?

A)If the assumptions are not met, the statistical tests may not be valid.
B)Nonlinear relationships cannot be incorporated.
C)Two variables may be highly correlated to one another but one is not causing the other to change.
D)Concluding that a statistically significant relationship implies practical value.
Question
A healthcare executive is using regression to predict total revenues.She has decided to include both patient length of stay and insurance type in her model.Insurance type can be grouped into the following categories: Medicare, Medicaid, Managed Care, Self-Pay, and Charity.Which of the following is true?

A)Insurance type will be represented in the regression model by five binary variables.
B)Insurance type will be represented in the regression model by six dummy variables.
C)Insurance type will be represented in the regression model by five dummy variables.
D)Insurance type will be represented in the regression model by four binary variables.
Question
A healthcare executive is using regression to predict total revenues.She has decided to include both patient length of stay and insurance type in her model.Insurance type can be grouped into three categories: Government-Funded, Private-Pay, and Other.Her model is

A)Y = b0.
B)Y = b0 + b1 X1.
C)Y = b0 + b1 X1 + b2 X2.
D)Y = b0 + b1 X1 + b2 X2 + b3 X3.
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/129
auto play flashcards
Play
simple tutorial
Full screen (f)
exit full mode
Deck 4: Regression Models
1
The null hypothesis in the F-test is that there is a linear relationship between the X and Y variables.
False
2
There is no relationship between variables unless the data points lie in a straight line.
False
3
In regression, a dependent variable is sometimes called a predictor variable.
False
4
The coefficient of determination gives the proportion of the variability in the dependent variable that is explained by the regression equation.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
5
The correlation coefficient has values between −1 and +1.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
6
Both the p-value for the F-test and r2 can be interpreted the same with multiple regression models as they are with simple linear models.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
7
Errors are also called residuals.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
8
The regression line minimizes the sum of the squared errors.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
9
When the significance level is small enough in the F-test, we can reject the null hypothesis that there is no linear relationship.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
10
An F-test is used to determine if there is a relationship between the dependent and independent variables.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
11
The coefficients of each independent variable in a multiple regression model represent slopes.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
12
If the significance level for the F-test is high enough, there is a relationship between the dependent and independent variables.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
13
For statistical tests of significance about the coefficients, the null hypothesis is that the slope is 1.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
14
The multiple regression model includes several dependent variables.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
15
Estimates of the slope, intercept, and error of a regression model are found from sample data.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
16
Summing the error values in a regression model is misleading because negative errors cancel out positive errors.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
17
In regression, there is random error that can be predicted.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
18
In regression, an independent variable is sometimes called a response variable.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
19
Often, a plot of the residuals will highlight any glaring violations of the assumptions.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
20
If the assumptions of regression have been met, errors plotted against the independent variable will typically show patterns.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
21
A dummy variable can be assigned up to three values.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
22
If multicollinearity exists, then individual interpretation of the variables is questionable, but the overall model is still good for prediction purposes.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
23
The random error in a regression equation

A)is the predicted error.
B)includes both positive and negative terms.
C)will sum to a large positive number.
D)is used to estimate the accuracy of the slope.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
24
A high correlation always implies that one variable is causing a change in the other variable.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
25
Multicollinearity exists when a variable is correlated to other variables.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
26
The adjusted r2 will always increase as additional variables are added to the model.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
27
The best model is a statistically significant model with a high r-square and few variables.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
28
Which of these statements is best?

A)In regression, an independent variable is sometimes called a response variable.
B)A scatter diagram is a graphical depiction of the relationship between the dependent and independent variables.
C)In a scatter diagram, the dependent variable is typically plotted on the horizontal axis.
D)There is no relationship between variables unless the data points lie in a straight line.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
29
Dummy variables for regression analysis can take on a value of either -1 or +1.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
30
Which of the following statements is true regarding a scatter diagram?

A)It provides very little information about the relationship between the regression variables.
B)It is a plot of the independent and dependent variables.
C)It is a line chart of the independent and dependent variables.
D)It has a value between -1 and +1.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
31
The value of r2 can never decrease when more variables are added to the model.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
32
The multiple regression model includes several intercept terms.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
33
In regression, a binary variable is also called an indicator variable.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
34
Transformations may be used when nonlinear relationships exist between variables.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
35
The number of dummy variables must equal 1 less than the number of categories of the qualitative variable.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
36
A variable should be added to the model regardless of the impact (increase or decrease)on the adjusted r2 value.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
37
Another name for a dummy variable is a binary variable.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
38
Which of these statements is best?

A)In regression, an independent variable is sometimes called a response variable.
B)One purpose of regression is to understand the relationship between variables.
C)One purpose of regression is to predict the value of one variable based on the standard deviation of that variable.
D)The variable to be predicted is the independent variable.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
39
A reference to the criterion used to select the regression line, to minimize the squared distances between the estimated straight line and the observed values is called

A)Mean square error.
B)Sum of Squares.
C)Maximum likelihood.
D)Least Squares.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
40
The multiple regression model includes multiple slope coefficients.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
41
Which of the following conditions can be detected from residual analysis?

A)nonconstant variance
B)multicollinearity
C)dummy variables
D)presence of transformations
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
42
The problem of nonconstant error variance is detected in residual analysis by which of the following?

A)a cone pattern
B)an arched pattern
C)a random pattern
D)an increasing pattern
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
43
Which of these statements is best?

A)The regression model assumes the error terms are dependent.
B)The regression model assumes the errors are normally distributed.
C)The errors in a regression model are assumed to have an increasing mean.
D)The errors in a regression model are assumed to have zero variance.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
44
The correlation coefficient resulting from a particular regression analysis was 0.25.What was the coefficient of determination?

A)0.5
B)-0.5
C)0.0625
D)There is insufficient information to answer the question.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
45
The coefficient of determination resulting from a particular regression analysis was 0.85.What was the slope of the regression line?

A)0.85
B)-0.85
C)0.922
D)There is insufficient information to answer the question.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
46
Which of the following statements is true about r2?

A)It is also called the coefficient of correlation.
B)It is also called the coefficient of determination.
C)It represents the percent of variation in X that is explained by Y.
D)It represents the percent of variation in the error that is explained by Y.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
47
Which of the following is not an assumption of the regression model?

A)The errors are independent.
B)The errors should have a standard deviation equal to one.
C)The errors are normally distributed.
D)The errors have constant variance.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
48
Which of these statements is best?

A)The SST measures the total variability in the dependent variable about the regression line.
B)The SSE measures the total variability in the independent variable about the regression line.
C)The SSR indicates how much of the total variability in the dependent variable is explained by the regression model.
D)The coefficient of determination takes on values between -1 and + 1.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
49
A dummy variable is also called a(n)

A)indicator variable.
B)dependent variable.
C)continuous variable.
D)response variable.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
50
In a good regression model the residual plot shows

A)a cone pattern.
B)an arched pattern.
C)a random pattern.
D)an increasing pattern.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
51
The diagram below illustrates data with a <strong>The diagram below illustrates data with a  </strong> A)negative correlation coefficient. B)zero correlation coefficient. C)positive correlation coefficient. D)correlation coefficient equal to +1.

A)negative correlation coefficient.
B)zero correlation coefficient.
C)positive correlation coefficient.
D)correlation coefficient equal to +1.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
52
The coefficient of determination resulting from a particular regression analysis was 0.85.What was the correlation coefficient, assuming a positive linear relationship?

A)0.5
B)-0.5
C)0.922
D)There is insufficient information to answer the question.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
53
The sum of squared error (SSE)is

A)a measure of the total variation in Y about the mean.
B)a measure of the total variation in X about the mean.
C)a measure in the variation of Y about the regression line.
D)a measure in the variation of X about the regression line.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
54
Which of the following is an assumption of the regression model?

A)The errors are independent.
B)The errors are not normally distributed.
C)The errors have a standard deviation of zero.
D)The errors have an irregular variance.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
55
Which of the following statements is not true about regression models?

A)Estimates of the slope are found from sample data.
B)The regression line minimizes the sum of the squared errors.
C)The error is found by subtracting the actual data value from the predicted data value.
D)The dependent variable is the explanatory variable.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
56
Which of the following equalities is correct?

A)SST = SSR + SSE
B)SSR = SST + SSE
C)SSE = SSR + SST
D)SST = SSC + SSR
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
57
Which of these statements is best?

A)If the assumptions of regression have been met, errors plotted against the independent variable will typically show patterns.
B)The error standard deviation is estimated by MSE.
C)Often, a plot of the residuals will highlight any glaring violations of the assumptions.
D)The standard error of the estimate is also called the variance of the regression.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
58
Which of these statements is best?

A)In any regression model, there is an implicit assumption that there is no preexisting relationship between the variables.
B)In regression, there is random error that can be predicted.
C)Estimates of the slope, intercept, and error of a regression model are found from sample data.
D)Error is the difference in the actual value and the predicted value.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
59
If computing a causal linear regression model of Y = a + bX and the resultant r2 is very near zero, then one would be able to conclude that

A)Y = a + bX is a good forecasting method.
B)Y = a + bX is not a good forecasting method.
C)a multiple linear regression model is a good forecasting method for the data.
D)a multiple linear regression model is not a good forecasting method for the data.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
60
The problem of a nonlinear relationship is detected in residual analysis by which of the following?

A)a cone pattern
B)an arched pattern
C)a random pattern
D)an increasing pattern
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
61
When an independent variable is correlated with one other independent variable, the variables are said to be

A)collinear.
B)pairwise.
C)independent.
D)mutually exclusive.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
62
Which of the following is true regarding a regression model with multicollinearity, a high r2 value, and a low F-test significance level?

A)The model is not a good prediction model.
B)The high value of r2 is due to the multicollinearity.
C)The interpretation of the coefficients is valuable.
D)The significance level tests for the coefficients are not valid.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
63
If a qualitative variable has three categories, how many dummy variables are needed?

A)0
B)1
C)2
D)3
E)4
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
64
A prediction equation for starting salaries (in $1,000s)and SAT scores was performed using simple linear regression.In the regression printout shown below, what can be said about the level of significance for the overall model? SUMMARY OUTPUT
<strong>A prediction equation for starting salaries (in $1,000s)and SAT scores was performed using simple linear regression.In the regression printout shown below, what can be said about the level of significance for the overall model? SUMMARY OUTPUT   ANOVA    </strong> A)SAT is not a good predictor for starting salary. B)The significance level for the intercept indicates the model is not valid. C)The significance level for SAT indicates the slope is equal to zero. D)The significance level for SAT indicates the slope is not equal to zero. ANOVA
<strong>A prediction equation for starting salaries (in $1,000s)and SAT scores was performed using simple linear regression.In the regression printout shown below, what can be said about the level of significance for the overall model? SUMMARY OUTPUT   ANOVA    </strong> A)SAT is not a good predictor for starting salary. B)The significance level for the intercept indicates the model is not valid. C)The significance level for SAT indicates the slope is equal to zero. D)The significance level for SAT indicates the slope is not equal to zero. <strong>A prediction equation for starting salaries (in $1,000s)and SAT scores was performed using simple linear regression.In the regression printout shown below, what can be said about the level of significance for the overall model? SUMMARY OUTPUT   ANOVA    </strong> A)SAT is not a good predictor for starting salary. B)The significance level for the intercept indicates the model is not valid. C)The significance level for SAT indicates the slope is equal to zero. D)The significance level for SAT indicates the slope is not equal to zero.

A)SAT is not a good predictor for starting salary.
B)The significance level for the intercept indicates the model is not valid.
C)The significance level for SAT indicates the slope is equal to zero.
D)The significance level for SAT indicates the slope is not equal to zero.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
65
The sum of the squares total (SST)

A)measures the total variability in Y about the mean.
B)measures the total variability in X about the mean.
C)measures the variability in Y about the regression line.
D)measures the variability in X about the regression line.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
66
A healthcare executive is using regression to predict total revenues.She is deciding whether or not to include both patient length of stay and insurance type in her model.Her first regression model only included patient length of stay.The resulting r2 was .83, with an adjusted r2 of .82 and her level of significance was .003.In the second model, she included both patient length of stay and insurance type.The r2 was .84 and the adjusted r2 was .80 for the second model and the level of significance did not change.Which of the following statements is true?

A)The second model is a better model.
B)The first model is a better model.
C)The r2 increased when additional variables were added because these variables significantly contribute to the prediction of total revenues.
D)The adjusted r2 always increases when additional variables are added to the model.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
67
Suppose that you believe that a cubic relationship exists between the independent variable (of time)and the dependent variable Y.Which of the following would represent a valid linear regression model?

A)Y = b0 + b1 X, where X = time3
B)Y = b0 + b1 X3, where X = time
C)Y = b0 + 3b1 X, where X = time3
D)Y = b0 + 3b1 X, where X = time
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
68
Which of these statements about multiple regression is best?

A)The p-value for the F-test may not be interpreted the same with multiple regression models as they are with simple linear models.
B)The multiple regression model includes several dependent variables.
C)The multiple regression model includes several intercept terms.
D)The multiple regression model includes multiple slope coefficients.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
69
In the model Y = β0 + β1X1 + β2X2 + ε, which of these parameters represents an independent variable?

A)the X1
B)the Y
C)the β1
D)the ε
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
70
An automated process to systematically add or delete independent variables from a regression model is called

A)nonlinear regression.
B)linear regression.
C)residual analysis.
D)stepwise regression.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
71
Which of the following statements provides the best guidance for model building?

A)If the value of r2 increases as more variables are added to the model, the variables should remain in the model, regardless of the magnitude of increase.
B)If the value of the adjusted r2 increases as more variables are added to the model, the variables should remain in the model.
C)If the value of r2 increases as more variables are added to the model, the variables should not remain in the model, regardless of the magnitude of the increase.
D)If the value of the adjusted r2 increases as more variables are added to the model, the variables should not remain in the model.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
72
The condition of an independent variable being correlated to one or more other independent variables is referred to as

A)multicollinearity.
B)statistical significance.
C)linearity.
D)nonlinearity.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
73
The mean square error (MSE)is

A)denoted by s.
B)denoted by k.
C)the SSE divided by the number of observations.
D)the SSE divided by the degrees of freedom.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
74
Which of the following statements is false concerning the hypothesis testing procedure for a regression model?

A)The F-test statistic is used.
B)The null hypothesis is that the true slope coefficient is equal to zero.
C)The null hypothesis is rejected if the adjusted r2 is above the critical value.
D)An α level must be selected.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
75
A prediction equation for sales and payroll was performed using simple linear regression.In the regression printout shown below, which of the following statements is not true? SUMMARY OUTPUT
<strong>A prediction equation for sales and payroll was performed using simple linear regression.In the regression printout shown below, which of the following statements is not true? SUMMARY OUTPUT   ANOVA    </strong> A)Payroll is a good predictor of Sales based on α = 0.05. B)There is evidence of a positive linear relationship between Sales and Payroll based on α = 0.05. C)Payroll is not a good predictor of Sales based on α = 0.01. D)The coefficient of determination is equal to 0.833333. ANOVA
<strong>A prediction equation for sales and payroll was performed using simple linear regression.In the regression printout shown below, which of the following statements is not true? SUMMARY OUTPUT   ANOVA    </strong> A)Payroll is a good predictor of Sales based on α = 0.05. B)There is evidence of a positive linear relationship between Sales and Payroll based on α = 0.05. C)Payroll is not a good predictor of Sales based on α = 0.01. D)The coefficient of determination is equal to 0.833333. <strong>A prediction equation for sales and payroll was performed using simple linear regression.In the regression printout shown below, which of the following statements is not true? SUMMARY OUTPUT   ANOVA    </strong> A)Payroll is a good predictor of Sales based on α = 0.05. B)There is evidence of a positive linear relationship between Sales and Payroll based on α = 0.05. C)Payroll is not a good predictor of Sales based on α = 0.01. D)The coefficient of determination is equal to 0.833333.

A)Payroll is a good predictor of Sales based on α = 0.05.
B)There is evidence of a positive linear relationship between Sales and Payroll based on α = 0.05.
C)Payroll is not a good predictor of Sales based on α = 0.01.
D)The coefficient of determination is equal to 0.833333.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
76
Which of the following represents the underlying linear model for hypothesis testing?

A)Y = b0 + b1 X + ε
B)Y = b0 + b1 X
C)Y = β0 + β1 X + ε
D)Y = β0 + β1 X
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
77
The primary difference between r2 and the adjusted r2 is that

A)the adjusted r2 accounts for the total number of variables in the regression model.
B)the adjusted r2 accounts for the number of independent variables in the regression model.
C)the adjusted r2 accounts for the number of dependent variables in the regression model.
D)the adjusted r2 accounts for multicollinearity.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
78
Which of the following is not a common pitfall of regression?

A)If the assumptions are not met, the statistical tests may not be valid.
B)Nonlinear relationships cannot be incorporated.
C)Two variables may be highly correlated to one another but one is not causing the other to change.
D)Concluding that a statistically significant relationship implies practical value.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
79
A healthcare executive is using regression to predict total revenues.She has decided to include both patient length of stay and insurance type in her model.Insurance type can be grouped into the following categories: Medicare, Medicaid, Managed Care, Self-Pay, and Charity.Which of the following is true?

A)Insurance type will be represented in the regression model by five binary variables.
B)Insurance type will be represented in the regression model by six dummy variables.
C)Insurance type will be represented in the regression model by five dummy variables.
D)Insurance type will be represented in the regression model by four binary variables.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
80
A healthcare executive is using regression to predict total revenues.She has decided to include both patient length of stay and insurance type in her model.Insurance type can be grouped into three categories: Government-Funded, Private-Pay, and Other.Her model is

A)Y = b0.
B)Y = b0 + b1 X1.
C)Y = b0 + b1 X1 + b2 X2.
D)Y = b0 + b1 X1 + b2 X2 + b3 X3.
Unlock Deck
Unlock for access to all 129 flashcards in this deck.
Unlock Deck
k this deck
locked card icon
Unlock Deck
Unlock for access to all 129 flashcards in this deck.