Term
|
Definition
| How we use the theory and data from economics, business, and social sciences, along with statistics to answer "how much" questions. |
|
|
Term
|
Definition
| Science of collecting, organizing, presenting, analyzing, and interpreting numerical data to assist in making more effective decisions |
|
|
Term
|
Definition
Some characteristic of a population of a sample. (values vary across the population) |
|
|
Term
|
Definition
1. A random variable 2. A discrete Variable 3. A continuous variable |
|
|
Term
|
Definition
| Variable whose value is unknown until it is observed |
|
|
Term
|
Definition
| when a random variable can assume only a finite number of distinct values |
|
|
Term
|
Definition
| when a random variable can take an infinite number of possible values |
|
|
Term
| Levels of measurement of Data |
|
Definition
Descriptive and inferential statistics influenced by the levels of data measurement
1. Nominal Data (Age, brand, sex) 2. Ordinal Data (Satisfaction Rating) |
|
|
Term
|
Definition
1. A measure of asymmetry in the data 2. A measure of Peakedness in the data |
|
|
Term
1. Arithmetic Mean 2. Geometric mean 3. Variance |
|
Definition
1. Measure of location and shows the central value of the data 2. Uses multiplication instead of addition to summarize data 3. Variant or dispersion, spread around the mean |
|
|
Term
|
Definition
Measures relative variability in the data (Standard Dev/MeanX100) |
|
|
Term
| Joint Probability Density Function |
|
Definition
| Probability of two events occurring simultaneously |
|
|
Term
|
Definition
| Ability to obtain the probability distribution of an individual random variable |
|
|
Term
|
Definition
| Probability of the outcome X=2 given that Y=1 has occured |
|
|
Term
| The Chi Squared Distribution |
|
Definition
Formed by summing the squared values of the independent standard normal variables (as degrees of freedom get larger, distribution becomes more symmetrical) (Deg of Freedom value determines the shape of the chi squared distribution) |
|
|
Term
|
Definition
| Variable formed by the ratio of two independent Chi squared random variables that have been divided by the degrees of freedom |
|
|
Term
|
Definition
For normal distributions, empirical rule tells us the proportion of the data we can expect surrounding the mean. (For any set of observations, the minimum proportion of the values that lie within "K" standard deviations of the mean. K is any constant greater than 1) |
|
|
Term
|
Definition
| Assuming the population was normally distributed, sample means would also be normally distributed |
|
|
Term
| Best Linear Unbiased Estimator (BLUE) |
|
Definition
| Estimator is unbiased, linear, has the smallest variance as "N" increases |
|
|
Term
|
Definition
| Range of values that may contain the true population mean |
|
|
Term
| Width of the Confidence interval depends on 3 things |
|
Definition
1. Sample Size - Larger "N" means narrower 2. Level of Confidence - higher level returns wider interval 3. Variability - larger value of O returns a wider interval |
|
|
Term
| 5 Components of Hypothesis Testing |
|
Definition
1. Null hypothesis denoted as Ho 2. Alternative hypothesis denoted as H1 3. A test Statistic 4. A level of significance/rejection region 5. Decision/Conclusion about hypothesis |
|
|
Term
|
Definition
| Computed based on knowledge of the relevant probability distribution for the variable under consideration |
|
|
Term
| Level of Significance/Rejection region |
|
Definition
| rules set to decide whether you accept or reject the null |
|
|
Term
| One sample mean comparison test |
|
Definition
| Test whether the sample mean value equals some mean value |
|
|
Term
|
Definition
| Rejection region if the decision rule the researcher adopts for rejecting or not rejecting the null (set %) |
|
|
Term
|
Definition
| Compare the computed "t" statistic w/ a critical value from the t tables w/ the relevant degrees of freedom and a particular level of significance |
|
|
Term
|
Definition
| Null hypothesis is true and we decide to reject it |
|
|
Term
|
Definition
| The null hypothesis is false and we decide not to reject it |
|
|
Term
1. Time Series form 2. Cross Section Form 3. Panel Data Form |
|
Definition
1. data collected over discrete periods of time 2. data collected over different units at 1 point in time 3. data collected over sample units over period of time |
|
|
Term
1. Quantitative Data 2. Qualitative Data |
|
Definition
1. Outcome expressed as numbers 2. outcome expressed as "either or" situations |
|
|
Term
|
Definition
Statistical process for estimating the relationships between variables -Implies Y depends on X -Line of best fit has smaller sum of squared residuals |
|
|
Term
| Assumptions in regression model |
|
Definition
1. Regression model is correctly specified 2. regression model is linear in parameters 3. X is not random 4. Values of X vary across observations 5. Zero covariance between the error term and the X values |
|
|
Term
|
Definition
Null hypothesis w/ multiple conjectures, expressed w/ more than one equal sign -Full model is called restricted model |
|
|
Term
| 3 Types of Log linear Models |
|
Definition
1. Log-log Model- all variables are in log 2. Log-Linear model- Y variable in log, X variable not in log 3. Linear-Log model- Y variable not in log, X variable in log |
|
|
Term
|
Definition
| Let you estimate separate intercepts and slopes for different demographic groups |
|
|
Term
|
Definition
| Another type of specification that frequently occurs in economics |
|
|
Term
1. Multicollinearity 2. perfect Multicollinearity |
|
Definition
1. When two or more explanatory variables are very highly correlated w/ each other 2. Cannot estimate all coefficients, individual coefficients have high standard errors |
|
|
Term
|
Definition
Involves observations on economic units of varying sizes -As size of unit increases, more certainty associated with outcomes -If errors don't have constant variance they are heteroskedastic |
|
|
Term
|
Definition
| One way to detect heteroskedasticity is to estimate your model using OLS, predict the residuals and examine graphs |
|
|
Term
| The White Test and Breusch Pagan Test |
|
Definition
| Tests for continuous changes in variance |
|
|
Term
|
Definition
| Test for changes in the variance across discrete subgroups |
|
|
Term
| 5 Steps of the White Test |
|
Definition
1. Regress Y variable by your explanators using OLS 2. Compute OLS residuals 3. Regress residual variable by a constant, all explanators, the squares of explanators, and all possible interactions between explanators 4. Compute R2 form step 3 5. Compare nR2 to the critical value from the Chi squared distribution with P degrees of freedom and decide if you should reject null |
|
|
Term
|
Definition
| Econometrician selects which explanators to include |
|
|
Term
| Steps in Goldfeld Quantd Test |
|
Definition
1. Divide the N of observations into H group sizes 2. Choose two groups and form the null hypothesis 3. Estimate model for group 1 4. Estimate model for group 2 5. Relate the two groups 6. Compare G to the Critical value for the F statistic and if G is greater then reject the null |
|
|
Term
| Generalized Least Squares |
|
Definition
Want to transform data so that it is homoskedastic, then we can apply OLS -if nR2 is larger than the critical value we reject the null |
|
|
Term
|
Definition
| Refers to a sequence of observations over time |
|
|
Term
1. Static Model 2. Dynamic Model |
|
Definition
1. A change in income at time (t) will cause a change in consumption at time (t) only 2. Past Effects the present, a change in consumption not only at time (t), but also in following periods |
|
|
Term
| Dynamic Models-Autoregressive Model |
|
Definition
| Model where a variable is related to its past values |
|
|
Term
|
Definition
Third way dynamics can enter a regression relationship is the error term -Variable is said to be autocorrelated or serial correlated if it exhibits correlation over time |
|
|
Term
| Testing for Serial Correlation |
|
Definition
a test of serial correlation would be if the null hypoth. equaled 0 and the Alt hypoth. didn't equal 0 - If we reject the null then we are saying serial correlation exists |
|
|
Term
|
Definition
1. Estimate the model 2. Obtain the estimated residuals from the regression 3. Use the residual to compute the DW test statistic -DW Test cannot accommodate time series model with lagged Y variables |
|
|
Term
| 2 Tests used for serial correlation |
|
Definition
1. Durbin Watson - Focuses on first order serial correlation 2. Breusch Godfrey- takes into account correlations among disturbances lagged more than once |
|
|
Term
|
Definition
| Variance of error terms are not constant. Includes economic units of varying sizes |
|
|
Term
| Consequences of OLS estimator under Heteroskedasticity |
|
Definition
OLS is unbiased OLS is no longer efficient Other linear estimators will have smaller variance Estimated standard errors are incorrect Confidence intervals will be incorrect |
|
|
Term
|
Definition
| Measures the relationship between the probability that an event occurs and its determinants |
|
|
Term
|
Definition
| checks to see if the time series variables have a unit root and if its data is stationary |
|
|
Term
|
Definition
| A stochastic trend that is a systematic pattern that is unpredictable |
|
|
Term
| 2 ways of measuring success in Logit Models |
|
Definition
Count R2 as the measure of predicting if the even will occur. Use Pseudo R2 as Log-Likelihood of the model |
|
|
Term
|
Definition
| Data obtained is not always convenient for presentation in a table in a regression |
|
|
Term
| Serial correlation Solutions |
|
Definition
Newey-West procedure to correct estimated standard errors Transform the data and use GLS if errors are AR(1). Use Feasible GLS and estimate ρ using residuals |
|
|
Term
|
Definition
| convert the nonstationary series to its stationary counterpart by taking first differences |
|
|
Term
|
Definition
| either convert the nonstationary series to its stationary counterpart, or estimate a regression relationship that includes a trend variable |
|
|
Term
|
Definition
| any differences across economic agents can be captured by shifts in the intercept term of a standard OLS regression |
|
|
Term
| two-way fixed effects model |
|
Definition
| equivalent dummy variable approach if there are time specific effects |
|
|
Term
|
Definition
| Using OLS to estimate a model in which the dependent variable is a binary variable |
|
|
Term
|
Definition
| Degree of similarity between a given time series and a lagged version of itself over successive time intervals |
|
|