1Cademy - Test-Retest Correlation

Learn Before

Test-Retest Reliability
Assessing Test-Retest Reliability

Concept

Test-Retest Correlation

Test-retest correlation is the statistical coefficient calculated between two sets of scores from a measure administered at different times. A test-retest correlation of $+.80$ or greater is generally considered to indicate good reliability for constructs expected to be stable. For example, the Rosenberg Self-Esteem Scale demonstrated a highly reliable test-retest correlation of $+.95$ when administered to students twice, a week apart.

Updated 2026-06-07

Contributors are:

Who are from:

References

KPU Research Methods in Psychology - 4th American Edition

Learn After

A researcher administers a newly developed questionnaire measuring a stable personality trait to a group of participants on two separate occasions, a week apart. They then calculate the statistical coefficient between the two sets of scores to evaluate consistency over time. What is this coefficient called, and what value would generally indicate that the questionnaire has good reliability?
If a researcher calculates a test-retest correlation of +.85 for a questionnaire measuring a stable trait, this statistical coefficient indicates that the measurement tool has demonstrated good reliability across different administrations.
A researcher is evaluating the test-retest reliability of several new psychological scales designed to measure stable personality traits. Match each calculated correlation coefficient with the most appropriate interpretation of that scale's reliability.
Examine the consistency shown in the scatterplot (Figure 4.2). If a researcher analyzed a different measure and found that the data points were widely dispersed from the regression line rather than tightly clustered, the resulting ______ ______ (specific term) would likely be below the $+.80$ threshold often used as a benchmark for stable constructs.
A researcher is critiquing the reliability evidence for four new psychometric scales intended to measure stable personality traits. Based on the standard psychological benchmark for 'good' reliability, rank the following test-retest correlation results from the finding that provides the most defensible evidence of consistency to the finding that provides the least defensible evidence.
Match each term or value related to test-retest correlation with its correct definition or benchmark based on standard psychological research practices.
As illustrated by the clustering of data points in Figure 4.2, what does a high test-retest correlation coefficient (such as $+.80$ or higher) signify about the results of a psychological measure administered at two different times?
A researcher is evaluating a newly developed psychometric questionnaire designed to measure 'grit' (a stable personality trait) in adolescents. Arrange the steps in the correct chronological sequence to calculate and evaluate the test-retest correlation for this new scale.
A researcher administers a newly developed questionnaire designed to measure 'current state mood' to a sample of participants on two occasions, a week apart, and calculates a test-retest correlation of $+.25$ . Because this correlation is far below the standard benchmark of $+.80$ , the researcher must conclude that the questionnaire is poorly designed and lacks reliability.
Researcher A evaluates a scale measuring a stable personality trait and obtains a test-retest correlation coefficient of $+.82$ . Researcher B evaluates a different scale measuring the same stable personality trait and obtains a test-retest correlation coefficient of $+.75$ . Applying the standard benchmark of $+.80$ for stable constructs, which researcher's scale has demonstrated a statistically acceptable level of reliability? Enter only the letter (A or B).

Learn Before

Related

Learn After