Question 1

One method of dealing with heteroscedasticity is to try a logarithmic transformation of the data.

Accepted Answer

Logarithmic transformation can help to reduce the impact of heteroscedasticity by stabilizing the variance in the data.

Question 2

One of the potential characteristics of an outlier is that the value of the dependent variable is much larger or smaller than predicted by the regression line.

Accepted Answer

This is a correct statement about outliers. Outliers have extreme values that are different from the majority of the data points, and they can affect the slope and intercept of the regression line. If an outlier has a value of the dependent variable that is significantly higher or lower than what would be expected based on the independent variable(s), it can distort the fit of the line and reduce the accuracy of the predictions.

Question 3

In order to estimate with 90% confidence a particular value of Y for a given value of X in a simple linear regression problem,a random sample of 20 observations is taken.The appropriate t-value that would be used is 1.734.

Accepted Answer

This is true based on the given information. In order to construct a 90% confidence interval for a single value of Y in a simple linear regression problem with a sample size of 20, the appropriate t-value to use is 1.734.

Question 4

In a multiple regression problem involving 30 observations and four explanatory variables,SST = 800 and SSE = 240.The value of the F-statistic for testing the significance of this model is 14.583.

Accepted Answer

The F-statistic for testing the significance of the model is calculated as F = (SST - SSE) / p / (SSE / (n - p - 1)), where p is the number of explanatory variables and n is the number of observations. Substituting the given values, we get F = (800 - 240) / 4 / (240 / 25) = 14.583. This matches the given value, so the answer is A.

Question 5

In time series data,errors are often not probabilistically independent.

Accepted Answer

In time series data, errors can be correlated over time, meaning that the error term at one point in time can be influenced by the error terms at other points in time. This violates the assumption of independent errors, which is often made in classical statistical models.

Question 6

In multiple regression with k explanatory variables,the t-tests of the individual coefficients allows us to determine whether

B _ { i } \neq 0

(for i = 1,2,…. ,k),which tells us whether a linear relationship exists between

x

and Y.

Accepted Answer

The t-tests for individual coefficients in multiple regression assess if each coefficient is significantly different from zero, indicating a potential linear relationship between the predictor and the response variable.

Question 7

If exact multicollinearlity exists,that means that there is redundancy in the data.

Accepted Answer

Exact multicollinearity means that there is a perfect linear relationship between two or more independent variables, which leads to redundancy in the data. This can cause problems in statistical analysis, and it is important to detect and handle multicollinearity in data before drawing conclusions.

Question 8

Multicollinearity is a situation in which two or more of the explanatory variables are highly correlated with each other.

Accepted Answer

Multicollinearity occurs when there is a high correlation between two or more explanatory variables, which can cause issues in regression analysis such as unstable coefficients and inaccurate predictions.

Question 9

Suppose that one equation has 3 explanatory variables and an F-ratio of 49.Another equation has 5 explanatory variables and an F-ratio of 38.The first equation will always be considered a better model.

Accepted Answer

The F-ratio only indicates the overall significance of the model, it does not necessarily determine which model is better. It is important to consider other factors such as the adjusted R-squared, AIC/BIC, and the significance of individual coefficients to determine the best model. Therefore, we cannot conclude that the first equation with 3 explanatory variables and an F-ratio of 49 is always a better model than the second equation with 5 explanatory variables and an F-ratio of 38.

Question 10

In simple linear regression,if the error variable

\varepsilon

is normally distributed,the test statistic for testing

H _ { 0 } : B _ { 1 } = 0

is t-distributed with n - 2 degrees of freedom.

Accepted Answer

In simple linear regression, the test statistic for the slope $B_1$ follows a t-distribution with $n-2$ degrees of freedom when the error term $\varepsilon$ is normally distributed.

Question 11

In order to test the significance of a multiple regression model involving 4 explanatory variables and 40 observations,the numerator and denominator degrees of freedom for the critical value of F are 4 and 35,respectively.

Accepted Answer

The answer of In order to test the significance of...

Question 12

In regression analysis,the total variation in the dependent variable Y,measured by

\sum \left( y _ { i } - \bar { y } \right) ^ { 2 }

and referred to as SST,can be decomposed into two parts: the explained variation,measured by SSR,and the unexplained variation,measured by SSE.

Accepted Answer

The answer of In regression analysis,the total variation in the...

Question 13

In multiple regression,the problem of multicollinearity affects the t-tests of the individual coefficients as well as the F-test in the analysis of variance for regression,since the F-test combines these t-tests into a single test.

Accepted Answer

The answer of In multiple regression,the problem of multicollinearity affects...

Question 14

In a multiple regression analysis involving 4 explanatory variables and 40 data points,the degrees of freedom associated with the sum of squared errors,SSE,is 35.

Accepted Answer

The answer of In a multiple regression analysis involving 4...

Question 15

In multiple regression,if there is multicollinearity between independent variables,the t-tests of the individual coefficients may indicate that some variables are not linearly related to the dependent variable,when in fact they are.

Accepted Answer

The answer of In multiple regression,if there is multicollinearity between...

Question 16

In a simple linear regression problem,if the standard error of estimate $S _ { e }$ &#10;= 15 and n = 8,then the sum of squares for error,SSE,is 1,350.

Accepted Answer

The answer of In a simple linear regression problem,if the...

Question 17

A multiple regression model involves 40 observations and 4 explanatory variables produces SST = 1000 and SSR = 804.The value of MSE is 5.6.

Accepted Answer

The answer of A multiple regression model involves 40 observations...

Question 18

Multiple regression represents an improvement over simple regression because it allows any number of response variables to be included in the analysis.

Accepted Answer

The answer of Multiple regression represents an improvement over simple...

Question 19

Heteroscedasticity means that the variability of Y values is larger for some X values&#10;than for others.

Accepted Answer

The answer of Heteroscedasticity means that the variability of Y...

Question 20

When there is a group of explanatory variables that are in some sense logically related,all of them must be included in the regression equation.

Accepted Answer

The answer of When there is a group of explanatory...

Question 21

In a simple linear regression model,testing whether the slope

\beta _ { 1 }

of the population regression line could be zero is the same as testing whether or not the linear relationship between the response variable Y and the explanatory variable X is significant.

Accepted Answer

The answer of In a simple linear regression model,testing whether...

Question 22

One method of diagnosing heteroscedasticity is to plot the residuals against the predicted values of Y,then look for a change in the spread of the plotted values.

Accepted Answer

The answer of One method of diagnosing heteroscedasticity is to...

Question 23

In regression analysis,homoscedasticity refers to constant error variance.

Accepted Answer

The answer of In regression analysis,homoscedasticity refers to constant error...

Question 24

In testing the overall fit of a multiple regression model in which there are three explanatory variables,the null hypothesis is $H _ { 0 } : B _ { 1 } = B _ { 2 } = B _ { 3 }$ &#10;.

Accepted Answer

The answer of In testing the overall fit of a...

Question 25

The residuals are observations of the error variable $\varepsilon$ &#10;.Consequently,the minimized sum of squared deviations is called the sum of squared error,labeled SSE.

Accepted Answer

The answer of The residuals are observations of the error...

Question 26

The Durbin-Watson statistic can be used to measure of autocorrelation.

Accepted Answer

The answer of The Durbin-Watson statistic can be used to...

Question 27

The value of the sum of squares due to regression,SSR,can never be larger than the value of the sum of squares total,SST.

Accepted Answer

The answer of The value of the sum of squares...

Question 28

Homoscedasticity means that the variability of Y values is the same for all X values.

Accepted Answer

The answer of Homoscedasticity means that the variability of Y...

Question 29

A confidence interval constructed around a point prediction from a regression model is called a prediction interval,because the actual point being estimated is not a population parameter

Accepted Answer

The answer of A confidence interval constructed around a point...

Question 30

Which of the following would be considered a definition of an outlier?&#10;A) An extreme value for one or more variables&#10;B) A value whose residual is abnormally large in magnitude&#10;C) Values for individual explanatory variables that fall outside the general pattern of the other observations&#10;D) All of these options

Accepted Answer

The answer of Which of the following would be considered...

Question 31

When determining whether to include or exclude a variable in regression analysis,if the p-value associated with the variable's t-value is above some accepted significance value,such as 0.05,then the variable:

A) is a candidate for inclusion
B) is a candidate for exclusion
C) is redundant
D) not fit the guidelines of parsimony

Accepted Answer

The answer of When determining whether to include or exclude...

Question 32

The assumptions of regression are: 1)there is a population regression line,2)the dependent variable is normally distributed,3)the standard deviation of the response variable remains constant as the explanatory variables increase,and 4)the errors are probabilistically independent.

Accepted Answer

The answer of The assumptions of regression are: 1)there is...

Question 33

Which of the following is not one of the guidelines for including/excluding variables in a regression equation?&#10;A) Look at t-value and associated p-value&#10;B) Check whether t-value is less than or greater than 1.0&#10;C) Variables are logically related to one another&#10;D) Use economic or physical theory to make decision&#10;E) All of these options are guidelines

Accepted Answer

The answer of Which of the following is not one...

Question 34

A backward procedure is a type of equation building procedure that begins with all potential explanatory variables in the regression equation and deletes them two at a time until further deletion would reduce the percentage of variation explained to a value less than 0.50.

Accepted Answer

The answer of A backward procedure is a type of...

Question 35

A scatterplot that exhibits a &#34;fan&#34; shape (the variation of Y increases as X increases)is an example of:&#10;A) homoscedasticity&#10;B) heteroscedasticity&#10;C) autocorrelation&#10;D) multicollinearity

Accepted Answer

The answer of A scatterplot that exhibits a &#34;fan&#34; shape...

Question 36

Suppose you run a regression of a person's height on his/her right and left foot sizes,and you suspect that there may be multicollinearity between the foot sizes.What types of problems might you see if your suspicions are true?

A) "Wrong" values for the coefficients for the left and right foot size
B) Large p-values for the coefficients for the left and right foot size
C) Small t-values for the coefficients for the left and right foot size
D) All of these options

Accepted Answer

The answer of Suppose you run a regression of a...

Question 37

In multiple regressions,a large value of the test statistic F indicates that most of the variation in Y is unexplained by the regression equation and that the model is useless.A small value of F indicates that most of the variation in Y is explained by the regression equation and that the model is useful.

Accepted Answer

The answer of In multiple regressions,a large value of the...

Question 38

The can be used to test for autocorrelation.&#10;A) regression coefficient&#10;B) correlation coefficient&#10;C) Durbin-Watson statistic&#10;D) F-test or t-test

Accepted Answer

The answer of The can be used to test for...

Question 39

In regression analysis,the unexplained part of the total variation in the response variable Y is referred to as sum of squares due to regression,SSR.

Accepted Answer

The answer of In regression analysis,the unexplained part of the...

Question 40

A forward procedure is a type of equation building procedure that begins with only one explanatory variable in the regression equation and successively adds one variable at a time until no remaining variables make a significant contribution.

Accepted Answer

The answer of A forward procedure is a type of...

Question 41

Another term for constant error variance is:&#10;A) homoscedasticity&#10;B) heteroscedasticity&#10;C) autocorrelation&#10;D) multicollinearity

Accepted Answer

The answer of Another term for constant error variance is:&#10;A)...

Question 42

The t-value for testing $H _ { 0 } : B _ { i } = 0$
Is calculated using which of the following equations:

A) n - k - 1
B)

\sum \left( X _ { i } / Y _ { i } \right)

C)

B _ { j } / s _ { i }

D)

b _ { i } / s _ { b _ { i } }

Accepted Answer

The answer of The t-value for testing \(H _ {...

Question 43

The appropriate hypothesis test for a regression coefficient is:&#10;A)&#10;$H _ { 0 } : B \neq 0 , H _ { \alpha } : B = 0$&#10;B)&#10;$H _ { 0 } : B = 0 , H _ { \alpha } : B \neq 0$&#10;C)&#10;$H _ { 0 } : B = 1 , H _ { \alpha } : B \neq 1$&#10;D) None of these options

Accepted Answer

The answer of The appropriate hypothesis test for a regression...

Question 44

The objective typically used in the tree types of equation-building procedures are to: A) find the equation with a small s_e B) find the equation with a large R² C) find the equation with a small s_e and a large R² D) find the equation with the largest F-statistic

Accepted Answer

The answer of The objective typically used in the tree...

Question 45

The value k in the number of degrees of freedom,n-k-1,for the sampling distribution of the regression coefficients represents:&#10;A) the sample size&#10;B) the population size&#10;C) the number of coefficients in the regression equation,including the constant&#10;D) the number of independent variables included in the equation

Accepted Answer

The answer of The value k in the number of...

Question 46

In the standardized value $\left( b _ { i } - B _ { i } \right) / s _ { b _ { j } }$
,the symbol $s _ { b _ { 1 } }$
Represents the:

A) mean of

b _ { j }

B) variance of

b _ { j }

C) standard error of

b _ { j }

D) degrees of freedom of

b _ { j }

Accepted Answer

The answer of In the standardized value \(\left( b _...

Question 47

The appropriate hypothesis test for an ANOVA test is:&#10;A)&#10;$H _ { 0 } : \text { all } B \neq 0 , H _ { \alpha } : \text { at least one } B = 0$&#10;B)&#10;$H _ { 0 } \text { : all } B = 0 , H _ { \alpha } \text { : at least one } B \neq 0$&#10;C)&#10;$H _ { 0 } \text { : at least on } B \neq 0 , H _ { a } : \text { all } B = 0$&#10;D)&#10;$H _ { 0 } \text { : at least one } B = 0 , H _ { \alpha } \text { : all } B \neq 0$

Accepted Answer

The answer of The appropriate hypothesis test for an ANOVA...

Question 48

Suppose you forecast the values of all of the independent variables and insert them into a multiple regression equation and obtain a point prediction for the dependent variable.You could then use the standard error of the estimate to obtain an approximate

A) confidence interval
B) prediction interval
C) hypothesis test
D) independence test

Accepted Answer

The answer of Suppose you forecast the values of all...

Question 49

Which of the following is the relevant sampling distribution for regression coefficients?&#10;A) Normal distribution&#10;B) t-distribution with n-1 degrees of freedom&#10;C) t-distribution with n-1-k degrees of freedom&#10;D) F-distribution with n-1-k degrees of freedom

Accepted Answer

The answer of Which of the following is the relevant...

Question 50

The ANOVA table splits the total variation into two parts.They are the&#10;A) acceptable and unacceptable variation&#10;B) adequate and inadequate variation&#10;C) resolved and unresolved variation&#10;D) explained and unexplained variation

Accepted Answer

The answer of The ANOVA table splits the total variation...

Question 51

Which of the following is not one of the assumptions of regression?&#10;A) There is a population regression line&#10;B) The response variable is normally distributed&#10;C) The standard deviation of the response variable increases as the explanatory variables increase&#10;D) The errors are probabilistically independent

Accepted Answer

The answer of Which of the following is not one...

Question 52

A point that &#34;tilts&#34; the regression line toward it,is referred to as a(n):&#10;A) magnetic point&#10;B) influential point&#10;C) extreme point&#10;D) explanatory point

Accepted Answer

The answer of A point that &#34;tilts&#34; the regression line...

Question 53

Determining which variables to include in regression analysis by estimating a series of regression equations by successively adding or deleting variables according to prescribed rules is referred to as:

A) elimination regression
B) forward regression
C) backward regression
D) stepwise regression

Accepted Answer

The answer of Determining which variables to include in regression...

Question 54

The test statistic in an ANOVA analysis is:&#10;A) the t-statistic&#10;B) the z-statistic&#10;C) the F-statistic&#10;D) the Chi-square statistic

Accepted Answer

The answer of The test statistic in an ANOVA analysis...

Question 55

In regression analysis,extrapolation is performed when you:&#10;A) attempt to predict beyond the limits of the sample&#10;B) have to estimate some of the explanatory variable values&#10;C) have to use a lag variable as an explanatory variable in the model&#10;D) don't have observations for every period in the sample

Accepted Answer

The answer of In regression analysis,extrapolation is performed when you:&#10;A)...

Question 56

The term autocorrelation refers to:&#10;A) analyzed data refers to itself&#10;B) sample is related too closely to the population&#10;C) data are in a loop (values repeat themselves)&#10;D) time series variables are usually related to their own past values

Accepted Answer

The answer of The term autocorrelation refers to:&#10;A) analyzed data...

Question 57

In regression analysis,multicollinearity refers to:&#10;A) the response variables being highly correlated&#10;B) the explanatory variables being highly correlated&#10;C) the response variable(s)and the explanatory variable(s)are highly correlated with one another&#10;D) the response variables are highly correlated over time.

Accepted Answer

The answer of In regression analysis,multicollinearity refers to:&#10;A) the response...

Question 58

Time series data often exhibits which of the following characteristics?&#10;A) homoscedasticity&#10;B) heteroscedasticity&#10;C) autocorrelation&#10;D) multicollinearity

Accepted Answer

The answer of Time series data often exhibits which of...

Question 59

Many statistical packages have three types of equation-building procedures.They are:&#10;A) forward,linear and non-linear&#10;B) forward,backward and stepwise&#10;C) simple,complex and stepwise&#10;D) inclusion,exclusion and linear

Accepted Answer

The answer of Many statistical packages have three types of...

Question 60

The error term represents the vertical distance from any point to the&#10;A) estimated regression line&#10;B) population regression line&#10;C) value of the Y's&#10;D) mean value of the X's

Accepted Answer

The answer of The error term represents the vertical distance...

Question 61

A researcher can check whether the errors are normally distributed by using:&#10;A) a t-test or an F-test&#10;B) the Durbin-Watson statistic&#10;C) a frequency distribution or the value of the regression coefficient&#10;D) a histogram or a Q-Q plot

Accepted Answer

The answer of A researcher can check whether the errors...

Question 62

In regression analysis,the ANOVA table analyzes:&#10;A) the variation of the response variable Y&#10;B) the variation of the explanatory variable X&#10;C) the total variation of all variables&#10;D) All of these options

Accepted Answer

The answer of In regression analysis,the ANOVA table analyzes:&#10;A) the...

Question 63

If you can determine that the outlier is not really a member of the relevant population,then it is appropriate and probably best to:&#10;A) average it&#10;B) reduce it&#10;C) delete it&#10;D) leave it

Accepted Answer

The answer of If you can determine that the outlier...

Question 64

Which of the following definitions best describes parsimony?&#10;A) Explaining the most with the least&#10;B) Explaining the least with the most&#10;C) Being able to explain all of the change in the response variable&#10;D) Being able to predict the value of the response variable far into the future

Accepted Answer

The answer of Which of the following definitions best describes...

Deck 11: Regression Analysis: Statistical Inference