Shared Flashcard Set

Details

Title

Statistical Concepts & Reliability

Description

Midterm: Measurement & Statistical Concepts/ Reliability

Total Cards

Subject

Psychology

Level

Graduate

Created

10/05/2011

Click here to study/print these flashcards.

Create your own flash cards! Sign up here.

Additional Psychology Flashcards

Cards Return to Set Details

Term

Measurement Scales

Definition

Nominal

Ordinal: ranking, no indication of magnitude

Interval: equal intervals, unit is arbitrary

Ration: absolute measure, equal intervals (absolute 0)

Term

Required test properties

Definition

Reliability- consistency and stability

Validity- measurement fits purpose

Term

standard deviation & variance equations

Definition

[image]

[image]

Term

Z score formula

Definition

[image]

Term

Pearson's r formula

Definition

[image]

Y = a + b1X1 + b2X2 + b3X3….

Term

SEM formula

Definition

SEM=SD*√(1-r_xy)

if reliability increases then standard error of measurement decreases

Term

Mediation mechanisms

Definition

-studies mechanism underlying effect

-looks at experimental design & proximal outcomes

-eg theory of reasoned action:attitudes-intention-behavior

Term

Moderated mediation

Definition

when the mediator has a different effect on the DV depending on the level of a moderator

-mediation differs for subgroups of participants

Term

mediated moderation

Definition

A mediator is intermediate in the causal sequence from an interaction effect to the DV

Term

Philosophy of science (4)

Definition

realism/utility (can you make factual statements abou unobservable things such as theories)

scientific reasoning

demarcation of falsification (falsifiable=scientific)

socio-political paradigm (academic traditions, scientific fads, funding, rewards/incentives)

Term

Construct change : maturation of literature

Definition

-nomological network

-specificity

-level of analysis

-situation/context

-over time

Term

Specific construct changes

Definition

Alpha change: difference in presense of construct between time 1 and time 2 (pre/post)

Beta change:change in scale measurement(interpretation of performance, score might be good before taking test and bad after)

Gamma change: change in meaning of construct itself

Term

Construct level of analysis

Definition

Unit of measure: should match theory of construct and level of analysis (individual, group, etc)

Composition models should be:

-direct

-additive or referent shift

-dispersion (separation, variety, disparity)

-interactive/process

Term

Reliability definition

Definition

-consistency and stability of test scores

-tests measure stable attributes, consistency reflects stability of an attribute

Term

estimation of error

Definition

-inconsistency that affects the score but has nothing to do with the attribute

Term

Methods of Estimating Reliability

Definition

Test-retest

alternative/equivalent forms

internal consistency

inter-rater

Term

test-retest stat & limitations

Definition

Coefficient of reliability: dependent measures t-test, pearson's correlation

Limitations

-reactivity

-memory effects

-more than 2 observations

-construct overlap in stability

Term

equivalent forms stat & limitations

Definition

Coefficient of equivalence: pearson for interval ratio, kappa for nominal/categorical

limitations

-difficult to create parallel forms that are exactly alike

-construct overlap

-more than 2 observations

Term

Split-half stats

Definition

-assesses internal consistency

-widely used (easier bc you don't need 2 measures or 2 observers)

Kuder-Richardson 20 (KR-20): looks at relation of each item w/ every other item for dichotomous items only

Cronbach's Alpha: continuous data, average of all possible splits

Term

Agreement

(models)

Definition

-Sensitive to extremes and levels

Consensus models (degree of shared view)

-Direct consensus: what percent agrees?

-reference-shift consensus: do group members share common view?

Dispersion model (degree of variability in view)

Term

Multiple Observations: Group-level constructs

Definition

-measuring a construct that is representative of the group

Types of Construction:

-global (e.g. grp function, size, location)

-shared (latent perception/attitudes, norm, climate)

-configural (pattern of dispersion/diversity in group e.g. group consensus)

Term

Estimating Multiple Observations

Definition

Inter-rater reliability: most common when objectifying a qualitiative measure

kappa (nominal/ordinal) or correlation coefficients (interval/ratio): shows consistency but not agreement

Interclass correlation coefficient (ICC)

Rwg (within rater agreement)

Term

Things to look at with Rwg

Definition

Rwg=within rater agreemetn

degree all judges agree

different from how consistent each judge is

judges can respond differently but very consistently (one always low one always high) which creates high kappa or coefficients but low Rwg

Term

Inter-class correlations (ICC)

Definition

ICC looks across raters and between groups of raters

EG...6 olympic judges, 3 US & 3 Canadian

-calculate overall agreement

-calculate b/ agreement (between US & Canadian)

-raters within country more likely to agree if rating athlete from same country

-raters judging athletes from other counteries rated lower and had less agreement

Term

Reliability levels

Definition

-high is better but must be balanced with practicality

-high necessary for important final decisions or finding small individual differences

-low is okay for preliminary decisions distinguishing large factors

Term

Reliability myths

Definition

Bigger is always better: don't ask the same ?, subscales should be more homogenous than FS

Alpha measures only internal consistency of scale: influenced by N of items (more than 20)

Alpha is a fixed property of a scale

Alpha ranges between 0 & 1: can be negative when items need to be reverse coded

Term

Appropriate levels of reliability for social sciences

Definition

.70 early stages

.80 basic research

.90 minimally tolerable estimate

.95 clinical purposes

(Nunnally, 1978)

if alpha is too high it shows redundency

Term

Triangulation

Definition

-composite test scores typically more reliable with the true score than individual tests alone

-less variance attributed to systematic error

-scale length increases reliability

-each subpart provides its own limitation but corrects for others so composite is more reliable

Term

Theory of generalizability

Definition

-upon understanding the sources of consistency and inconsistency in measurement, under some cicrcumstancers we can safely generalize findings from one setting to another

Term

Ways of estimating mediation

Definition

Methods of estimating

-random/double random designs; complete/partial

-causal steps

-goodman; sobel tests of standard error

Term

Split-half limitations

Definition

Limitations

-function of number of items

-function of average inter-item correlations (lots of really similar items aritificially inflates alpha)

-measures correlation between items, not really consistency

Term

Alpha is a fixed property of a scale

Definition

characterstic of test scores not test itself

depends on total score var

more heterogenous populations have higher reliability

reliability is an estimate interpreted considering sample, context, purpose

Flashcard Machine - create, study and share online flash cards

Shared Flashcard Set

Details

Additional Psychology Flashcards

Cards Return to Set Details

My Flashcards

Flashcard Library

Browse

About

Help

Mobile