1Cademy - Types of Reliability

Learn Before

Reliability

Classification

Types of Reliability

Psychologists consider three primary types of consistency when evaluating a measure: consistency over time (test-retest reliability), consistency across different items on a multiple-item measure (internal consistency), and consistency across different researchers or observers (inter-rater reliability).

Updated 2026-06-15

Contributors are:

Who are from:

References

OpenStax Psychology (2nd ed.) Textbook
KPU Research Methods in Psychology - 4th American Edition

Learn After

Inter-rater Reliability
Test-Retest Reliability
Internal Consistency
Match each type of measurement reliability with the aspect of consistency it evaluates.
A researcher is developing a new 15-item scale to measure 'subjective well-being.' To evaluate the measure, the researcher checks whether participants who agree with one item (e.g., 'I am happy with my life') also tend to agree with other items on the same scale (e.g., 'My life is close to my ideal'). Which type of reliability is the researcher primarily assessing?
A researcher is developing a new coding system to measure 'prosocial behavior' in toddlers by watching video recordings of their play. To ensure the measure is consistent, she has two different research assistants code the same set of videos. If their observations are highly similar, the researcher has established high test-retest reliability.
A psychologist developing a new behavioral observation tool for 'classroom aggression' finds that her three research assistants all report the same number of aggressive acts for each child. However, when the same children are observed again under identical conditions one week later, their aggression scores have changed dramatically. This pattern suggests the tool has high inter-rater reliability but low ________ reliability.
A psychologist is evaluating the overall reliability of a new behavioral observation scale designed to measure a stable personality trait. Rank the following reliability profiles from the one that provides the 'strongest' evidence of a scientifically sound measure to the one that provides the 'weakest' evidence.
You are tasked with creating a validation protocol for a new psychological instrument that measures 'Academic Persistence' using a combination of a $15$ -item questionnaire and a timed behavioral task. To ensure your design accounts for consistency over time, consistency within the questionnaire items, and consistency between different researchers, which of the following sets of procedures must you synthesize into your research plan?
In psychological research, the consistency of a measure's results across different researchers or observers is referred to as test-retest reliability.
Match each research scenario to the specific type of measurement reliability it best demonstrates.
A researcher wants to formally evaluate the test-retest reliability of a newly developed questionnaire measuring 'mindfulness.' Arrange the following methodological steps in the correct chronological order to appropriately assess this specific type of consistency.
A research team evaluates a new $30$ -item questionnaire designed to measure a stable personality trait. They find that participants who take the survey on a Monday and then retake it one month later receive nearly identical total scores. However, upon closer inspection of a single administration, the researchers notice that how a participant answers the first half of the questions does not correspond at all to how they answer the second half. This pattern indicates that the questionnaire has e
Match each type of reliability with its corresponding description of consistency.
A clinical psychologist develops a new 10-item questionnaire to assess anxiety. To ensure the questionnaire is a reliable measure, they analyze whether participants' responses to the first five questions closely correlate with their responses to the last five questions. Which type of reliability is the psychologist primarily evaluating?
A researcher wants to ensure their new 20-item survey measuring sleep quality is reliable. They administer the survey to a group of participants and calculate the correlation between the scores on the odd-numbered items and the even-numbered items. This procedure is used to establish the survey's test-retest reliability.
A psychological measure's reliability must often be evaluated across multiple dimensions. Analyze the following research procedures and arrange them into this specific logical sequence: first, the procedure that tests internal consistency; second, the procedure that tests test-retest reliability; and finally, the procedure that tests inter-rater reliability.
A research committee is evaluating the quality of a newly proposed 50-item questionnaire designed to assess academic burnout. Upon reviewing the pilot data, they discover that participants' scores on the first 25 items are completely uncorrelated with their scores on the last 25 items. The committee determines that the questionnaire is fundamentally flawed and must be rewritten because it fails to demonstrate adequate ____ consistency.
When evaluating a psychological measure, which type of reliability specifically refers to its consistency across different researchers or observers?
A psychological measure demonstrates strong test-retest reliability if two independent researchers use it to evaluate the same participant and record highly similar scores.
A research team is developing a new observational coding system to measure childhood aggression. Match each type of reliability with the specific research procedure applied to evaluate it.
A developmental psychology lab measures toddler attachment using a parent survey and an observational task. The researchers find that parents' survey scores are highly correlated when completed at age 2 and again at age 3, indicating strong consistency over time. However, when examining the observational task data from those exact same sessions, the two lab assistants evaluating the toddlers' behaviors record completely different attachment scores for the same children. Analyzing this methodolog
A research committee is evaluating three newly developed psychological measures to determine which should be approved for a large-scale clinical trial. Evaluate the reliability profiles of each measure and arrange them in order of their demonstrated methodological quality, from the MOST reliable (demonstrating all three primary types of reliability) to the LEAST reliable (demonstrating zero reliability).

Learn Before

Related

Learn After