Term
| Standardization sample, AKA normative sample |
|
Definition
| Define your target population and obtain a representative sample from each of the target groups. May be appropriate to have multiple standardization samples, considering age, race, geographic regions, etc. |
|
|
Term
|
Definition
| Simple random sampling, systematic sampling, stratified random sampling, cluster sampling |
|
|
Term
| Sampling techniques- simple random |
|
Definition
| every member of the population has an equal chance of being sampled |
|
|
Term
| Sampling techniques- systematic sampling |
|
Definition
| pick every 5th, 10th, 20th, etc |
|
|
Term
| Sampling techniques- stratified random sampling |
|
Definition
| population is divided into subgroups/strata based on demographics |
|
|
Term
| Sampling techniques- cluster sampling |
|
Definition
| Used when target population is large and when it is not feasible to list all individuals. Clusters are selected and participants are selected from each cluster (difference from stratified is that this is regional, not demographic) |
|
|
Term
|
Definition
| Developmental and within group norms |
|
|
Term
|
Definition
| age norms, grade norms, etc. This allows us to determine whether an individual's test score is similar to, below, or above the average of others at the same age or grade level |
|
|
Term
|
Definition
| make comparison for one person against the rest of the group. Use standardized/derived scores to provide meaning (percentile, z scores, t scores, deviation IQ, SAT GRE |
|
|
Term
|
Definition
(B/N)x100 B=number of observations with lower values N= total number of observations |
|
|
Term
|
Definition
In psychology, reliability refers to stability of scores over time and consistency of a test over time. Is my score at time one the same as time two? Are all of my items assessing the same construct? A reliable test will measure each person in approximately the same way every time. Higher reliability= more we are confident that scores are due to true differences |
|
|
Term
| Sources of random error that could affect reliability |
|
Definition
| testing conditions, energy level of test taker, motivation of test taker, etc |
|
|
Term
| 3 main measures of reliability to know |
|
Definition
| test-retest, equivalent forms, internal consistency (split half, inter item) |
|
|
Term
|
Definition
Gives coefficient of stability. Correlation of scores on the same test taken on two separate occasions. Assume that true ability doesnt change, but errors do. Advantages- good for measuring stable constructs Disadvantages- People may change from time one to time two. Carryover effects. Requires two administrations |
|
|
Term
| Equivalent forms reliability |
|
Definition
Gives coefficient of equivalence, also known as parallel forms reliability. Accomplished by administering one test, waiting, and giving alternate form to same group. Advantages- less time between administrations. Fewer carryover effects, because we are not using the same Q's. Disadvantages- must develop two tests instead of one. Still requires two test administrations |
|
|
Term
| Internal consistency reliability |
|
Definition
| gives us information on whether items on test are measuring the same thing. |
|
|
Term
| Internal consistency reliability- split halves |
|
Definition
give the test to one group, then split the items up and correlate scores on both halves Advantages- only one administration solves carryover problem Disadvantages- not confident that even and odd items are equal. by splitting test in half, we lose information. Doesnt tell us about stability of test scores over time. Our reliability coefficient here is underestimated, therefore we correct with the Spearman Brown formula |
|
|
Term
| Internal consistency reliability- inter item correlation |
|
Definition
Instead of splitting the test in half, we treat each item as its own equivalent form. More exact than split halves and no correction is needed. Two different ways to calculate- coefficient alpha and KR-20 Adv- same as split halves Dis- doesnt tell us about stability of scores over time |
|
|
Term
| Reliability coefficient of .90 |
|
Definition
| 90% of variance observed is due to true ability |
|
|
Term
| Acceptable reliability level |
|
Definition
|
|