 Simple Linear Regression
 analysis of the equational relationship between X and Y.
 Slope, ß1
 , has a t distribution which standardizes its value to see if it is significantly different from 0.
 Y-Intercept
 , ß0, is not interpreted.
 Correlation Coefficient,
 ,r, indicates direction and strength of the LINEAR relationship between X and Y.
 Coefficient of Determination
 R2, the ratio of explained variation in Y to the total variation.
 Standard Error
 will be smaller for better predictive equations.
 Correlation analysis
 is the study of the nature and degree of the relationship between variables.
 Extrapolation
 is using Xs beyond the range of the given Xs to predict Y. This can cause large errors in prediction.
 Relationship of slope to the correlation coefficient
 signs are the same.
 Multicollinearity:
 when Xs are highly correlated this gives redundant information.
 Heteroscedasticity
 - non-constant variance in the residuals
 Outlier
 atypical values in a data set.
 Non-linear relationships-
 curvilinear patterns or logarithmic relationships
 Multiple regression analysis
 includes one dependent variable and more than one independent.
 Stepwise
 tries all combinations of variables and produces the best predictors in order of their predictive power.
 Artificially Inflated R-squared
 occurs when there are too many predictors and not enough samples.
 Normal plots
 should produce a nearly straight line without outliers.
 T distribution vs. F distribution
 : t is used to test the individual coefficients where F tests the overall model.
Term
 Residuals are the differences in the observed value of Y at a given X and the predicted value.
 :D
 Studentized Residuals
 should fall within +/- 3 in order to be considered normal values.
