| Term 
 | Definition 
 
        | Analysis of the equational relationship between X and Y. Regression SS/Total SS |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | has a t distribution which standardizes its value to see if it is significantly different from 0. When the p-value of the slope is greater than the level of significance, one should assume the correlation coefficient will e close to 0. |  | 
        |  | 
        
        | Term 
 | Definition 
 | 
        |  | 
        
        | Term 
 | Definition 
 
        | R, indicates nature and strength of the linear relationship variables |  | 
        |  | 
        
        | Term 
 
        | Coefficient of determination |  | Definition 
 
        | R2, the ration of explained variation in Y to the total variation. |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | minimizes the squared vertical distances between the points and the regression line resulting in the line of best fit. |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | will be smaller for better predictive equation. if the override value of y varies widely about the regression line, the standard error of the slope will be large. Square root of the MS residual
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | is the study of the nature and degree of the relationship between variables. A correlation coefficient of +1 or -1 means x and y are perfectly, linearly related. An r value of 0 indicates absolutely no relationship |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | is using Xs beyond the range of the given Xs to predict Y. THis can cause large errors in prediction. Relationship of slope to the correlation coefficient. signs are the same. |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | when Xs are highly correlated-this gives redundant information. |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | non-constant variance in the residuals |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | constant variance in the residuals |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | atypical values in a data set (anomalies) |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | CURVILINEAR patterns or LOGARITHMIC relationships |  | 
        |  | 
        
        | Term 
 
        | Multiple regression analysis includes |  | Definition 
 
        | one dependent variable and more than one independent |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | tries all combinations of variables and produces the best predictors in order of their predictive power. |  | 
        |  | 
        
        | Term 
 
        | Artificially inflated R-squared occurs when.. |  | Definition 
 
        | there are too many predictors and not enough samples. |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | you should have at least 10 times the number  of observations as predictor variables. |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | should produce a nearly straight line without outliers. |  | 
        |  | 
        
        | Term 
 
        | T distribution vs. F Distribution |  | Definition 
 
        | T is usedto test the individual coefficients where F tests the overall or "global" model. |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | are the differences in the observed value of Y at a given X and the predicted value. Absolute values between 2 and 3 are usually just suspicious while those over the absolute value of 3 are severe. |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | should fall within +/-3 in order to be considered normal values. |  | 
        |  | 
        
        | Term 
 
        | Transform Y and/or X when... |  | Definition 
 
        | any of the assumptions are violated |  | 
        |  | 
        
        | Term 
 
        | In simple linear regression the use of regression lines is to ... |  | Definition 
 
        | predict the average value of y that can be expected to occur at a given value of x. |  | 
        |  | 
        
        | Term 
 
        | A high correlation between x and y.. |  | Definition 
 
        | does NOT prove that x causes y |  | 
        |  | 
        
        | Term 
 
        | dependent variable plotted.. independent variable plotted...
 |  | Definition 
 
        | vertical axis horizontal axis
 |  | 
        |  | 
        
        | Term 
 
        | If the confidence interval on the slope contains 0... |  | Definition 
 
        | there is no significant relationship between x and y |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | you CANNOT assume that the slope is also positive. |  | 
        |  | 
        
        | Term 
 
        | The slope of the regression line represents... |  | Definition 
 
        | the amount of change that is expected to take place in y when x increases by one unit. |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | using values beyond the range of the given Xs to predict Y |  | 
        |  | 
        
        | Term 
 
        | if null hypothesis is rejected... |  | Definition 
 
        | there is a relationship between x and y. |  | 
        |  | 
        
        | Term 
 
        | if no correlation between two variables... |  | Definition 
 
        | the regression line will be horizontal |  | 
        |  | 
        
        | Term 
 
        | A large value for the slope does not necessarily imply a large value for the... |  | Definition 
 | 
        |  | 
        
        | Term 
 
        | Test the individual coefficients to see which Xs are good predictors. |  | Definition 
 
        | only test these if the overall model had at least one good predictor. |  | 
        |  | 
        
        | Term 
 
        | A we add more predictors... |  | Definition 
 | 
        |  | 
        
        | Term 
 
        | When you re-run a model after taking out the poor predictor variables... |  | Definition 
 
        | you have reduced the model |  | 
        |  | 
        
        | Term 
 
        | When choosing between two models, both with good predictors for y... |  | Definition 
 
        | choose the one with the smallest standard error. |  | 
        |  | 
        
        | Term 
 
        | Check the correlation matrix to make sure the X variables... |  | Definition 
 
        | are not correlated with each other |  | 
        |  | 
        
        | Term 
 
        | check the signs of the coefficients... |  | Definition 
 
        | to make sure they are logical. |  | 
        |  | 
        
        | Term 
 
        | Never say x causes y unless it was... |  | Definition 
 | 
        |  | 
        
        | Term 
 
        | Qualitative variables in multiple regression are called.. |  | Definition 
 
        | dummy variables. do not interpret their coefficients |  | 
        |  | 
        
        | Term 
 
        | If there is a curve in the scatter diagram for any x,y chart or the residuals use... |  | Definition 
 
        | a quadratic equation... use x and x^2 |  | 
        |  | 
        
        | Term 
 
        | If you think two x variables may work together at different levels to affect y... |  | Definition 
 
        | then try an interaction term. |  | 
        |  | 
        
        | Term 
 
        | Only interpret the coefficients of... |  | Definition 
 
        | good predictors and first order terms. First order terms are linear terms. |  | 
        |  | 
        
        | Term 
 
        | Squared Xs and interacted Xs are called... |  | Definition 
 | 
        |  | 
        
        | Term 
 
        | The us of regression lines is to.. |  | Definition 
 
        | predict the average value of y that can be expected to occur at a given value of x. |  | 
        |  | 
        
        | Term 
 
        | The study of the equational relationship between variables is called... |  | Definition 
 | 
        |  |