Deck 13: Multiple Regression

Full screen (f)
exit full mode
Question
If the probability plot of residuals resembles a straight line,the residuals show a fairly good fit to the normal distribution.
Use Space or
up arrow
down arrow
to flip the card.
Question
The F statistic in a multiple regression is significant if at least one of the predictors has a significant t statistic at a given α.
Question
If R2 and R2adj differ greatly,we should probably add a few predictors to improve the fit.
Question
Confidence intervals for Y may be unreliable when the residuals are not normally distributed.
Question
A common misinterpretation of the principle of Occam's Razor is that a simple regression model (rather than a multiple regression model)is always best.
Question
R2adj can exceed R2 if there are several weak predictors.
Question
In a regression,the model with the best fit is preferred over all other models.
Question
For a certain firm,the regression equation Bonus = 2,000 + 257 Experience + 0.046 Salary describes employee bonuses with a standard error of 125.John has 10 years' experience,earns $50,000,and earned a bonus of $7,000.John is an outlier.
Question
A binary (categorical)predictor should not be used along with nonbinary (numerical)predictors.
Question
A negative estimated coefficient in a regression usually indicates a weak predictor.
Question
In a multiple regression with 3 predictors in a sample of 25 U.S.cities,we would use F3,21 in a test of overall significance.
Question
If a regression model's F test statistic is Fcalc = 43.82,we could say that the explained variance is approximately 44 percent.
Question
The model Y = β0 + β1X + β2X2 cannot be estimated by Excel because of the nonlinear term.
Question
The effect of a binary predictor is to shift the regression intercept.
Question
Evans' Rule says that if n = 50 you need at least 5 predictors to have a good model.
Question
In regression,the dependent variable is referred to as the response variable.
Question
There is one residual for each predictor in the regression model.
Question
A predictor whose pairwise correlation with Y is near zero can still have a significant t-value in a multiple regression when other predictors are included.
Question
The random error term in a regression model reflects all factors omitted from the model.
Question
A parsimonious model is one with many weak predictors but a few strong ones.
Question
Non-normality of the residuals from a regression can best be detected by looking at the residual plots against the fitted Y values.
Question
A multiple regression with 60 observations should not have 13 predictors.
Question
A high variance inflation factor (VIF)indicates a significant predictor in the regression.
Question
In a regression model of student grades,we would code the nine categories of business courses taken (ACC,FIN,ECN,MGT,MKT,MIS,ORG,POM,QMM)by including nine binary (0 or 1)predictors in the regression.
Question
A disadvantage of Excel's Data Analysis regression tool is that it expects the independent variables to be in a block of contiguous columns,so you must delete a column if you want to eliminate a predictor from the model.
Question
The ill effects of heteroscedasticity might be mitigated by redefining totals (e.g. ,total number of homicides)as relative values (e.g. ,homicide rate per 100,000 population).
Question
A widening pattern of residuals as X increases would suggest heteroscedasticity.
Question
Autocorrelation of the residuals may affect the reliability of the t-values for the estimated coefficients of the predictors X1,X2,... ,Xk.
Question
A regression of Y using four independent variables X1,X2,X3,X4 could also have up to four nonlinear terms (X2)and six simple interaction terms (XjXk)if you have enough observations to justify them.
Question
Plotting the residuals against a binary predictor (X = 0,1)reveals nothing about heteroscedasticity.
Question
When autocorrelation is present,the estimates of the coefficients will be unbiased.
Question
Statisticians who work with cross-sectional data generally do not anticipate autocorrelation.
Question
The regression equation Bonus = 2,812 + 27 Experience + 0.046 Salary says that Experience is the most significant predictor of Bonus.
Question
The first differences transformation might be tried if autocorrelation is found in a time-series data set.
Question
A disadvantage of Excel's regression is that it does not give as much accuracy in the estimated regression coefficients as a package like Minitab.
Question
Multicollinearity can be detected from t tests of the predictor variables.
Question
If the residuals in your regression are nonnormal,a larger sample size might help improve the reliability of confidence intervals for Y.
Question
The F statistic and its p-value give a global test of significance for a multiple regression.
Question
When multicollinearity is present,the regression model is of no use for making predictions.
Question
Autocorrelation may be detected by looking at a plot of the residuals against time.
Question
Heteroscedasticity exists when all the errors (residuals)have the same variance.
Question
In a multiple regression with six predictors in a sample of 67 U.S.cities,what would be the critical value for an F-test of overall significance at α = .05?

A)2.29
B)2.25
C)2.37
D)2.18
Question
A squared predictor is used to test for nonlinearity in the predictor's relationship to Y.
Question
The t test shows the ratio of an estimated coefficient to its standard error.
Question
Which of the following is not true of the standard error of the regression?

A)It is a measure of the accuracy of the prediction.
B)It is based on squared vertical deviations between the actual and predicted values of Y.
C)It would be negative when there is an inverse relationship in the model.
D)It is used in constructing confidence and prediction intervals for Y.
Question
If X2 is a binary predictor in Y = β0 + β1X1 + β2X2,then which statement is most nearly correct?

A)X2 = 1 should represent the most desirable condition.
B)X2 would be a significant predictor if β2 = 423.72.
C)X2 = 0,X2 = 1,X2 = 2 would be appropriate if three categories exist.
D)X2 will shift the estimated equation either by 0 units or by β2 units.
Question
Unlike other predictors,a binary predictor has a t-value that is either 0 or 1.
Question
The unexplained sum of squares measures variation in the dependent variable Y about the

A)mean of the Y values.
B)estimated Y values.
C)mean of the X values.
D)Y-intercept.
Question
Given that the fitted regression is Y = 76.40 − 6.388X1 + 0.870X2,the standard error of b1 is 1.453,and n = 63,at α = .05,we can conclude that X1 is a significant predictor of Y.
Question
Nonnormal residuals lead to biased estimates of the coefficients in a regression model.
Question
A test is conducted in 22 cities to see if giving away free transit system maps will increase the number of bus riders.In a regression analysis,the dependent variable Y is the increase in bus riders (in thousands of persons)from the start of the test until its conclusion.The independent variables are X1 = the number (in thousands)of free maps distributed and a binary variable X2 = 1 if the city has free downtown parking,0 otherwise.The estimated regression equation is Y = 1.32 + 0.0345X1 − 1.45X2.In city 3,the observed Y value is 7.3,X1 = 140,and X2 = 0.The residual for city 3 (in thousands)is

A)6.15
B)1.15
C)4.83
D)1.57
Question
When predictor variables are strongly related to each other,the ________ of the regression estimates is questionable.

A)logic
B)fit
C)parsimony
D)stability
Question
Nonnormality of residuals is not usually considered a major problem unless there are outliers.
Question
Multicollinearity refers to relationships among the independent variables.
Question
A large VIF (e.g. ,10 or more)would indicate multicollinearity.
Question
In a multiple regression with five predictors in a sample of 56 U.S.cities,what would be the critical value for an F-test of overall significance at α = .05?

A)2.45
B)2.37
C)2.40
D)2.56
Question
In a multiple regression with five predictors in a sample of 56 U.S.cities,we would use F5,50 in a test of overall significance.
Question
A multiple regression analysis with two independent variables yielded the following results in the ANOVA table: SS(Total)= 798,SS(Regression)= 738,SS(Error)= 60.The multiple correlation coefficient is

A)0.2742
B)0.0752
C)0.9248
D)0.9617
Question
In the fitted regression Y = 12 + 3X1 − 5X2 + 27X3 + 2X4 the most significant predictor is X3.
Question
A fitted multiple regression equation is Y = 12 + 3X1 - 5X2 + 7X3 + 2X4.When X1 increases 2 units and X2 increases 2 units as well,while X3 and X4 remain unchanged,what change would you expect in your estimate of Y?

A)Decrease by 2
B)Decrease by 4
C)Increase by 2
D)No change in Y
Question
A useful guideline in determining the extent of collinearity in a multiple regression model is

A)Sturges' Rule.
B)Klein's Rule.
C)Occam's Rule.
D)Pearson's Rule.
Question
Part of a regression output is provided below.Some of the information has been omitted.The approximate value of F is <strong>Part of a regression output is provided below.Some of the information has been omitted.The approximate value of F is   The approximate value of F is</strong> A)1605.7 B)0.9134 C)89.66 D)impossible to calculate with the given information. <div style=padding-top: 35px> The approximate value of F is

A)1605.7
B)0.9134
C)89.66
D)impossible to calculate with the given information.
Question
A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).Part of the regression output is provided below,based on a sample of 20 homes.Some of the information has been omitted. <strong>A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).Part of the regression output is provided below,based on a sample of 20 homes.Some of the information has been omitted.   Which statement is supported by the regression output?</strong> A)At α = .05,FP is not a significant predictor in a two-tailed test. B)A fireplace adds around $6,476 to the selling price of the average house. C)A large house with no fireplace will sell for more than a small house with a fireplace. D)FP is a more significant predictor than Size. <div style=padding-top: 35px> Which statement is supported by the regression output?

A)At α = .05,FP is not a significant predictor in a two-tailed test.
B)A fireplace adds around $6,476 to the selling price of the average house.
C)A large house with no fireplace will sell for more than a small house with a fireplace.
D)FP is a more significant predictor than Size.
Question
Which is not a standard criterion for assessing a regression model?

A)Logic of causation
B)Overall fit
C)Degree of collinearity
D)Binary predictors
Question
Refer to the following regression results.The dependent variable is Abort (the number of abortions per 1000 women of childbearing age).The regression was estimated using data for the 50 U.S.states with these predictors: EdSpend = public K − 12 school expenditure per capita,Age = median age of population,Unmar = percent of total births by unmarried women,Infmor = infant mortality rate in deaths per 1000 live births. <strong>Refer to the following regression results.The dependent variable is Abort (the number of abortions per 1000 women of childbearing age).The regression was estimated using data for the 50 U.S.states with these predictors: EdSpend = public K − 12 school expenditure per capita,Age = median age of population,Unmar = percent of total births by unmarried women,Infmor = infant mortality rate in deaths per 1000 live births.   Which statement is not supported by a two-tailed test?</strong> A)Unmar is a significant predictor at α = .01. B)EdSpend is a significant predictor at α = .20. C)Infmor is not a significant predictor at α = .05. D)Age is not a significant predictor at α = .05. <div style=padding-top: 35px> Which statement is not supported by a two-tailed test?

A)Unmar is a significant predictor at α = .01.
B)EdSpend is a significant predictor at α = .20.
C)Infmor is not a significant predictor at α = .05.
D)Age is not a significant predictor at α = .05.
Question
Using a sample of 63 observations,a dependent variable Y is regressed against two variables X1 and X2 to obtain the fitted regression equation Y = 76.40 − 6.388X1 + 0.870X2.The standard error of b1 is 3.453 and the standard error of b2 is 0.611.At α = .05,we could

A)conclude that both coefficients differ significantly from zero.
B)reject H0: β1 ≥ 0 and conclude H0: β1 < 0.
C)reject H0: β2 ≤ 0 and conclude H0: β1 > 0.
D)conclude that Evans' Rule has been violated.
Question
Part of a regression output is provided below.Some of the information has been omitted. <strong>Part of a regression output is provided below.Some of the information has been omitted.   The SS (residual)is</strong> A)3177.17 B)301.19 C)17.71 D)impossible to determine. <div style=padding-top: 35px> The SS (residual)is

A)3177.17
B)301.19
C)17.71
D)impossible to determine.
Question
A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).The regression output is provided below.Some of the information has been omitted. <strong>A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).The regression output is provided below.Some of the information has been omitted.   Which of the following conclusions can be made based on the F-test?</strong> A)The p-value on the F-test will be very high. B)At least one of the predictors is useful in explaining Y. C)The model is of no use in predicting the selling prices of houses. D)The estimates were based on a sample of 19 houses. <div style=padding-top: 35px> Which of the following conclusions can be made based on the F-test?

A)The p-value on the F-test will be very high.
B)At least one of the predictors is useful in explaining Y.
C)The model is of no use in predicting the selling prices of houses.
D)The estimates were based on a sample of 19 houses.
Question
A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).Part of the regression output is provided below,based on a sample of 20 homes.Some of the information has been omitted. <strong>A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).Part of the regression output is provided below,based on a sample of 20 homes.Some of the information has been omitted.   The estimated coefficient for Size is approximately</strong> A)9.5 B)13.8 C)122.5 D)1442.6 <div style=padding-top: 35px> The estimated coefficient for Size is approximately

A)9.5
B)13.8
C)122.5
D)1442.6
Question
Which is a characteristic of the variance inflation factor (VIF)?

A)It is insignificant unless the corresponding t statistic is significant.
B)It reveals collinearity rather than multicollinearity.
C)It measures the degree of significance of each predictor.
D)It indicates the predictor's degree of multicollinearity.
Question
If the standard error is 12,the width of a quick prediction interval for Y is

A)±15
B)±24
C)±19
D)impossible to determine without an F table.
Question
Which is not a name often given to an independent variable that takes on just two values (0 or 1)according to whether or not a given characteristic is absent or present?

A)Absent variable
B)Binary variable
C)Dummy variable
Question
A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).The regression output is provided below.Some of the information has been omitted. <strong>A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).The regression output is provided below.Some of the information has been omitted.   How many predictors (independent variables)were used in the regression?</strong> A)20 B)18 C)3 D)2 <div style=padding-top: 35px> How many predictors (independent variables)were used in the regression?

A)20
B)18
C)3
D)2
Question
Refer to the following correlation matrix that was part of a regression analysis.The dependent variable was Abort (the number of abortions per 1000 women of childbearing age).The regression was estimated using data for the 50 U.S.states with these predictors: EdSpend = public K−12 school expenditure per capita,Age = median age of population,Unmar = percent of total births by unmarried women,Infmor = infant mortality rate in deaths per 1000 live births. Correlation Matrix
<strong>Refer to the following correlation matrix that was part of a regression analysis.The dependent variable was Abort (the number of abortions per 1000 women of childbearing age).The regression was estimated using data for the 50 U.S.states with these predictors: EdSpend = public K−12 school expenditure per capita,Age = median age of population,Unmar = percent of total births by unmarried women,Infmor = infant mortality rate in deaths per 1000 live births. Correlation Matrix   Using a two-tailed correlation test,which statement is not accurate?</strong> A)Age and Infmor are not significantly correlated at α = .05. B)Abort and Unmar are significantly correlated at α = .05. C)Unmar and Infmor are significantly correlated at α = .05. D)The first column of the table shows evidence of multicollinearity. <div style=padding-top: 35px> Using a two-tailed correlation test,which statement is not accurate?

A)Age and Infmor are not significantly correlated at α = .05.
B)Abort and Unmar are significantly correlated at α = .05.
C)Unmar and Infmor are significantly correlated at α = .05.
D)The first column of the table shows evidence of multicollinearity.
Question
Refer to this ANOVA table from a regression: <strong>Refer to this ANOVA table from a regression:   For this regression,the R2 is</strong> A)0.3995 B)0.6005 C)0.6654 D)0.8822 <div style=padding-top: 35px> For this regression,the R2 is

A)0.3995
B)0.6005
C)0.6654
D)0.8822
Question
In a multiple regression,all of the following are true regarding residuals except

A)their sum always equals zero.
B)they are the differences between observed and predicted values of the response variable.
C)they may be used to detect multicollinearity.
D)they may be used to detect heteroscedasticity.
Question
The residual plot below suggests which violation(s)of regression assumptions? <strong>The residual plot below suggests which violation(s)of regression assumptions?  </strong> A)Autocorrelation B)Heteroscedasticity C)Nonnormality D)Multicollinearity <div style=padding-top: 35px>

A)Autocorrelation
B)Heteroscedasticity
C)Nonnormality
D)Multicollinearity
Question
Refer to this ANOVA table from a regression: <strong>Refer to this ANOVA table from a regression:   Which statement is not accurate?</strong> A)The F-test is significant at α = .05. B)There were 50 observations. C)There were 5 predictors. D)There would be 50 residuals. <div style=padding-top: 35px> Which statement is not accurate?

A)The F-test is significant at α = .05.
B)There were 50 observations.
C)There were 5 predictors.
D)There would be 50 residuals.
Question
A log transformation might be appropriate to alleviate which problem(s)?

A)Heteroscedastic residuals
B)Multicollinearity
C)Autocorrelated residuals
Question
A fitted multiple regression equation is Y = 28 + 5X1 - 4X2 + 7X3 + 2X4.When X1 increases 2 units and X2 increases 2 units as well,while X3 and X4 remain unchanged,what change would you expect in your estimate of Y?

A)Increase by 2
B)Decrease by 4
C)Increase by 4
D)No change in Y
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/133
auto play flashcards
Play
simple tutorial
Full screen (f)
exit full mode
Deck 13: Multiple Regression
1
If the probability plot of residuals resembles a straight line,the residuals show a fairly good fit to the normal distribution.
True
2
The F statistic in a multiple regression is significant if at least one of the predictors has a significant t statistic at a given α.
True
3
If R2 and R2adj differ greatly,we should probably add a few predictors to improve the fit.
False
4
Confidence intervals for Y may be unreliable when the residuals are not normally distributed.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
5
A common misinterpretation of the principle of Occam's Razor is that a simple regression model (rather than a multiple regression model)is always best.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
6
R2adj can exceed R2 if there are several weak predictors.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
7
In a regression,the model with the best fit is preferred over all other models.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
8
For a certain firm,the regression equation Bonus = 2,000 + 257 Experience + 0.046 Salary describes employee bonuses with a standard error of 125.John has 10 years' experience,earns $50,000,and earned a bonus of $7,000.John is an outlier.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
9
A binary (categorical)predictor should not be used along with nonbinary (numerical)predictors.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
10
A negative estimated coefficient in a regression usually indicates a weak predictor.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
11
In a multiple regression with 3 predictors in a sample of 25 U.S.cities,we would use F3,21 in a test of overall significance.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
12
If a regression model's F test statistic is Fcalc = 43.82,we could say that the explained variance is approximately 44 percent.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
13
The model Y = β0 + β1X + β2X2 cannot be estimated by Excel because of the nonlinear term.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
14
The effect of a binary predictor is to shift the regression intercept.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
15
Evans' Rule says that if n = 50 you need at least 5 predictors to have a good model.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
16
In regression,the dependent variable is referred to as the response variable.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
17
There is one residual for each predictor in the regression model.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
18
A predictor whose pairwise correlation with Y is near zero can still have a significant t-value in a multiple regression when other predictors are included.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
19
The random error term in a regression model reflects all factors omitted from the model.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
20
A parsimonious model is one with many weak predictors but a few strong ones.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
21
Non-normality of the residuals from a regression can best be detected by looking at the residual plots against the fitted Y values.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
22
A multiple regression with 60 observations should not have 13 predictors.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
23
A high variance inflation factor (VIF)indicates a significant predictor in the regression.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
24
In a regression model of student grades,we would code the nine categories of business courses taken (ACC,FIN,ECN,MGT,MKT,MIS,ORG,POM,QMM)by including nine binary (0 or 1)predictors in the regression.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
25
A disadvantage of Excel's Data Analysis regression tool is that it expects the independent variables to be in a block of contiguous columns,so you must delete a column if you want to eliminate a predictor from the model.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
26
The ill effects of heteroscedasticity might be mitigated by redefining totals (e.g. ,total number of homicides)as relative values (e.g. ,homicide rate per 100,000 population).
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
27
A widening pattern of residuals as X increases would suggest heteroscedasticity.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
28
Autocorrelation of the residuals may affect the reliability of the t-values for the estimated coefficients of the predictors X1,X2,... ,Xk.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
29
A regression of Y using four independent variables X1,X2,X3,X4 could also have up to four nonlinear terms (X2)and six simple interaction terms (XjXk)if you have enough observations to justify them.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
30
Plotting the residuals against a binary predictor (X = 0,1)reveals nothing about heteroscedasticity.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
31
When autocorrelation is present,the estimates of the coefficients will be unbiased.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
32
Statisticians who work with cross-sectional data generally do not anticipate autocorrelation.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
33
The regression equation Bonus = 2,812 + 27 Experience + 0.046 Salary says that Experience is the most significant predictor of Bonus.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
34
The first differences transformation might be tried if autocorrelation is found in a time-series data set.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
35
A disadvantage of Excel's regression is that it does not give as much accuracy in the estimated regression coefficients as a package like Minitab.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
36
Multicollinearity can be detected from t tests of the predictor variables.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
37
If the residuals in your regression are nonnormal,a larger sample size might help improve the reliability of confidence intervals for Y.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
38
The F statistic and its p-value give a global test of significance for a multiple regression.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
39
When multicollinearity is present,the regression model is of no use for making predictions.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
40
Autocorrelation may be detected by looking at a plot of the residuals against time.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
41
Heteroscedasticity exists when all the errors (residuals)have the same variance.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
42
In a multiple regression with six predictors in a sample of 67 U.S.cities,what would be the critical value for an F-test of overall significance at α = .05?

A)2.29
B)2.25
C)2.37
D)2.18
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
43
A squared predictor is used to test for nonlinearity in the predictor's relationship to Y.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
44
The t test shows the ratio of an estimated coefficient to its standard error.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
45
Which of the following is not true of the standard error of the regression?

A)It is a measure of the accuracy of the prediction.
B)It is based on squared vertical deviations between the actual and predicted values of Y.
C)It would be negative when there is an inverse relationship in the model.
D)It is used in constructing confidence and prediction intervals for Y.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
46
If X2 is a binary predictor in Y = β0 + β1X1 + β2X2,then which statement is most nearly correct?

A)X2 = 1 should represent the most desirable condition.
B)X2 would be a significant predictor if β2 = 423.72.
C)X2 = 0,X2 = 1,X2 = 2 would be appropriate if three categories exist.
D)X2 will shift the estimated equation either by 0 units or by β2 units.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
47
Unlike other predictors,a binary predictor has a t-value that is either 0 or 1.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
48
The unexplained sum of squares measures variation in the dependent variable Y about the

A)mean of the Y values.
B)estimated Y values.
C)mean of the X values.
D)Y-intercept.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
49
Given that the fitted regression is Y = 76.40 − 6.388X1 + 0.870X2,the standard error of b1 is 1.453,and n = 63,at α = .05,we can conclude that X1 is a significant predictor of Y.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
50
Nonnormal residuals lead to biased estimates of the coefficients in a regression model.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
51
A test is conducted in 22 cities to see if giving away free transit system maps will increase the number of bus riders.In a regression analysis,the dependent variable Y is the increase in bus riders (in thousands of persons)from the start of the test until its conclusion.The independent variables are X1 = the number (in thousands)of free maps distributed and a binary variable X2 = 1 if the city has free downtown parking,0 otherwise.The estimated regression equation is Y = 1.32 + 0.0345X1 − 1.45X2.In city 3,the observed Y value is 7.3,X1 = 140,and X2 = 0.The residual for city 3 (in thousands)is

A)6.15
B)1.15
C)4.83
D)1.57
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
52
When predictor variables are strongly related to each other,the ________ of the regression estimates is questionable.

A)logic
B)fit
C)parsimony
D)stability
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
53
Nonnormality of residuals is not usually considered a major problem unless there are outliers.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
54
Multicollinearity refers to relationships among the independent variables.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
55
A large VIF (e.g. ,10 or more)would indicate multicollinearity.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
56
In a multiple regression with five predictors in a sample of 56 U.S.cities,what would be the critical value for an F-test of overall significance at α = .05?

A)2.45
B)2.37
C)2.40
D)2.56
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
57
In a multiple regression with five predictors in a sample of 56 U.S.cities,we would use F5,50 in a test of overall significance.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
58
A multiple regression analysis with two independent variables yielded the following results in the ANOVA table: SS(Total)= 798,SS(Regression)= 738,SS(Error)= 60.The multiple correlation coefficient is

A)0.2742
B)0.0752
C)0.9248
D)0.9617
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
59
In the fitted regression Y = 12 + 3X1 − 5X2 + 27X3 + 2X4 the most significant predictor is X3.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
60
A fitted multiple regression equation is Y = 12 + 3X1 - 5X2 + 7X3 + 2X4.When X1 increases 2 units and X2 increases 2 units as well,while X3 and X4 remain unchanged,what change would you expect in your estimate of Y?

A)Decrease by 2
B)Decrease by 4
C)Increase by 2
D)No change in Y
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
61
A useful guideline in determining the extent of collinearity in a multiple regression model is

A)Sturges' Rule.
B)Klein's Rule.
C)Occam's Rule.
D)Pearson's Rule.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
62
Part of a regression output is provided below.Some of the information has been omitted.The approximate value of F is <strong>Part of a regression output is provided below.Some of the information has been omitted.The approximate value of F is   The approximate value of F is</strong> A)1605.7 B)0.9134 C)89.66 D)impossible to calculate with the given information. The approximate value of F is

A)1605.7
B)0.9134
C)89.66
D)impossible to calculate with the given information.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
63
A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).Part of the regression output is provided below,based on a sample of 20 homes.Some of the information has been omitted. <strong>A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).Part of the regression output is provided below,based on a sample of 20 homes.Some of the information has been omitted.   Which statement is supported by the regression output?</strong> A)At α = .05,FP is not a significant predictor in a two-tailed test. B)A fireplace adds around $6,476 to the selling price of the average house. C)A large house with no fireplace will sell for more than a small house with a fireplace. D)FP is a more significant predictor than Size. Which statement is supported by the regression output?

A)At α = .05,FP is not a significant predictor in a two-tailed test.
B)A fireplace adds around $6,476 to the selling price of the average house.
C)A large house with no fireplace will sell for more than a small house with a fireplace.
D)FP is a more significant predictor than Size.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
64
Which is not a standard criterion for assessing a regression model?

A)Logic of causation
B)Overall fit
C)Degree of collinearity
D)Binary predictors
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
65
Refer to the following regression results.The dependent variable is Abort (the number of abortions per 1000 women of childbearing age).The regression was estimated using data for the 50 U.S.states with these predictors: EdSpend = public K − 12 school expenditure per capita,Age = median age of population,Unmar = percent of total births by unmarried women,Infmor = infant mortality rate in deaths per 1000 live births. <strong>Refer to the following regression results.The dependent variable is Abort (the number of abortions per 1000 women of childbearing age).The regression was estimated using data for the 50 U.S.states with these predictors: EdSpend = public K − 12 school expenditure per capita,Age = median age of population,Unmar = percent of total births by unmarried women,Infmor = infant mortality rate in deaths per 1000 live births.   Which statement is not supported by a two-tailed test?</strong> A)Unmar is a significant predictor at α = .01. B)EdSpend is a significant predictor at α = .20. C)Infmor is not a significant predictor at α = .05. D)Age is not a significant predictor at α = .05. Which statement is not supported by a two-tailed test?

A)Unmar is a significant predictor at α = .01.
B)EdSpend is a significant predictor at α = .20.
C)Infmor is not a significant predictor at α = .05.
D)Age is not a significant predictor at α = .05.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
66
Using a sample of 63 observations,a dependent variable Y is regressed against two variables X1 and X2 to obtain the fitted regression equation Y = 76.40 − 6.388X1 + 0.870X2.The standard error of b1 is 3.453 and the standard error of b2 is 0.611.At α = .05,we could

A)conclude that both coefficients differ significantly from zero.
B)reject H0: β1 ≥ 0 and conclude H0: β1 < 0.
C)reject H0: β2 ≤ 0 and conclude H0: β1 > 0.
D)conclude that Evans' Rule has been violated.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
67
Part of a regression output is provided below.Some of the information has been omitted. <strong>Part of a regression output is provided below.Some of the information has been omitted.   The SS (residual)is</strong> A)3177.17 B)301.19 C)17.71 D)impossible to determine. The SS (residual)is

A)3177.17
B)301.19
C)17.71
D)impossible to determine.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
68
A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).The regression output is provided below.Some of the information has been omitted. <strong>A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).The regression output is provided below.Some of the information has been omitted.   Which of the following conclusions can be made based on the F-test?</strong> A)The p-value on the F-test will be very high. B)At least one of the predictors is useful in explaining Y. C)The model is of no use in predicting the selling prices of houses. D)The estimates were based on a sample of 19 houses. Which of the following conclusions can be made based on the F-test?

A)The p-value on the F-test will be very high.
B)At least one of the predictors is useful in explaining Y.
C)The model is of no use in predicting the selling prices of houses.
D)The estimates were based on a sample of 19 houses.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
69
A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).Part of the regression output is provided below,based on a sample of 20 homes.Some of the information has been omitted. <strong>A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).Part of the regression output is provided below,based on a sample of 20 homes.Some of the information has been omitted.   The estimated coefficient for Size is approximately</strong> A)9.5 B)13.8 C)122.5 D)1442.6 The estimated coefficient for Size is approximately

A)9.5
B)13.8
C)122.5
D)1442.6
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
70
Which is a characteristic of the variance inflation factor (VIF)?

A)It is insignificant unless the corresponding t statistic is significant.
B)It reveals collinearity rather than multicollinearity.
C)It measures the degree of significance of each predictor.
D)It indicates the predictor's degree of multicollinearity.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
71
If the standard error is 12,the width of a quick prediction interval for Y is

A)±15
B)±24
C)±19
D)impossible to determine without an F table.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
72
Which is not a name often given to an independent variable that takes on just two values (0 or 1)according to whether or not a given characteristic is absent or present?

A)Absent variable
B)Binary variable
C)Dummy variable
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
73
A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).The regression output is provided below.Some of the information has been omitted. <strong>A realtor is trying to predict the selling price of houses in Greenville (in thousands of dollars)as a function of Size (measured in thousands of square feet)and whether or not there is a fireplace (FP is 0 if there is no fireplace,1 if there is a fireplace).The regression output is provided below.Some of the information has been omitted.   How many predictors (independent variables)were used in the regression?</strong> A)20 B)18 C)3 D)2 How many predictors (independent variables)were used in the regression?

A)20
B)18
C)3
D)2
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
74
Refer to the following correlation matrix that was part of a regression analysis.The dependent variable was Abort (the number of abortions per 1000 women of childbearing age).The regression was estimated using data for the 50 U.S.states with these predictors: EdSpend = public K−12 school expenditure per capita,Age = median age of population,Unmar = percent of total births by unmarried women,Infmor = infant mortality rate in deaths per 1000 live births. Correlation Matrix
<strong>Refer to the following correlation matrix that was part of a regression analysis.The dependent variable was Abort (the number of abortions per 1000 women of childbearing age).The regression was estimated using data for the 50 U.S.states with these predictors: EdSpend = public K−12 school expenditure per capita,Age = median age of population,Unmar = percent of total births by unmarried women,Infmor = infant mortality rate in deaths per 1000 live births. Correlation Matrix   Using a two-tailed correlation test,which statement is not accurate?</strong> A)Age and Infmor are not significantly correlated at α = .05. B)Abort and Unmar are significantly correlated at α = .05. C)Unmar and Infmor are significantly correlated at α = .05. D)The first column of the table shows evidence of multicollinearity. Using a two-tailed correlation test,which statement is not accurate?

A)Age and Infmor are not significantly correlated at α = .05.
B)Abort and Unmar are significantly correlated at α = .05.
C)Unmar and Infmor are significantly correlated at α = .05.
D)The first column of the table shows evidence of multicollinearity.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
75
Refer to this ANOVA table from a regression: <strong>Refer to this ANOVA table from a regression:   For this regression,the R2 is</strong> A)0.3995 B)0.6005 C)0.6654 D)0.8822 For this regression,the R2 is

A)0.3995
B)0.6005
C)0.6654
D)0.8822
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
76
In a multiple regression,all of the following are true regarding residuals except

A)their sum always equals zero.
B)they are the differences between observed and predicted values of the response variable.
C)they may be used to detect multicollinearity.
D)they may be used to detect heteroscedasticity.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
77
The residual plot below suggests which violation(s)of regression assumptions? <strong>The residual plot below suggests which violation(s)of regression assumptions?  </strong> A)Autocorrelation B)Heteroscedasticity C)Nonnormality D)Multicollinearity

A)Autocorrelation
B)Heteroscedasticity
C)Nonnormality
D)Multicollinearity
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
78
Refer to this ANOVA table from a regression: <strong>Refer to this ANOVA table from a regression:   Which statement is not accurate?</strong> A)The F-test is significant at α = .05. B)There were 50 observations. C)There were 5 predictors. D)There would be 50 residuals. Which statement is not accurate?

A)The F-test is significant at α = .05.
B)There were 50 observations.
C)There were 5 predictors.
D)There would be 50 residuals.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
79
A log transformation might be appropriate to alleviate which problem(s)?

A)Heteroscedastic residuals
B)Multicollinearity
C)Autocorrelated residuals
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
80
A fitted multiple regression equation is Y = 28 + 5X1 - 4X2 + 7X3 + 2X4.When X1 increases 2 units and X2 increases 2 units as well,while X3 and X4 remain unchanged,what change would you expect in your estimate of Y?

A)Increase by 2
B)Decrease by 4
C)Increase by 4
D)No change in Y
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
locked card icon
Unlock Deck
Unlock for access to all 133 flashcards in this deck.