Correlation Coefficient
A numerical index, commonly represented as Pearson's , that measures the direction and strength of a linear relationship between two quantitative variables. The coefficient is bounded between and , where values further from zero indicate a tighter, more predictable association, and values near zero indicate a negligible relationship. In research, this coefficient frequently serves as an effect size measure to evaluate how strongly variables are connected in the population.
0
1
Contributors are:
Who are from:
References
OpenStax Psychology (2nd ed.) Textbook
OpenStax Psychology (2nd ed.) Textbook
KPU Research Methods in Psychology - 4th American Edition
KPU Research Methods in Psychology - 4th American Edition
OpenStax Psychology (2nd ed.) Textbook
KPU Research Methods in Psychology - 4th American Edition
KPU Research Methods in Psychology - 4th American Edition
KPU Research Methods in Psychology - 4th American Edition
Tags
Clinical Practice of Psychology
Psychology
Social Science
Empirical Science
Science
OpenStax
Psychology @ OpenStax
Ch.2 Psychological Research - Psychology @ OpenStax
Ch.1 Introduction to Psychology - Psychology @ OpenStax
Introduction to Psychology @ OpenStax Course
OpenStax Psychology (2nd ed.) Textbook
KPU
Research Methods in Psychology - 4th American Edition @ KPU
Related
Correlational Research Techniques
Comparison of Correlational and Experimental Research in Establishing Causation
Correlation Coefficient
The Fundamental Limitation of Correlational Research: Inability to Establish Causation
Illusory Correlation: Perceiving Nonexistent Relationships
A researcher conducts a study and finds that cities with a higher number of parks per capita also have a lower rate of reported respiratory illnesses. The researcher concludes that the presence of parks causes a decrease in respiratory illness. What is the primary flaw in this conclusion based on the research method described?
A researcher conducts a large-scale survey and finds a strong positive relationship between the amount of time people spend watching news coverage of disasters and their reported levels of anxiety. Based only on this finding, which of the following is the most valid conclusion?
Example of Correlational Research: Self-Esteem and School Achievement
Using Correlation to Establish Measurement Reliability and Validity
Example of Correlational Research: Cannabis Use and Memory
Misconception About Variables in Correlational Research
Predictive Value of Correlation
Data Collection in Correlational Research
Complex Correlational Research
Dichotomizing Skewed Variables
Factor Analysis
Line Graphs in Correlational Research
Scatterplots
Usefulness of Correlational Research
Establishing Causality via Experiments
Confounding Variable
External Validity of Correlational Research
Correlational Research as Converging Evidence
Example of Correlational Research: Need for Cognition and Occupation
Example of Distinguishing Correlational and Experimental Research
Directionality Problem
Third-Variable Problem
Which of the following best describes the primary approach of correlational research?
Match each core component of correlational research with the scenario that best illustrates its role or function in a psychological study.
A researcher is investigating the relationship between 'exposure to parental conflict' and 'childhood emotional regulation.' Because it is unethical to intentionally increase conflict in a family to observe its effects on children, the researcher should apply a correlational research design to measure these variables as they naturally occur.
A researcher discovers a strong correlation between 'daily exercise' and 'mental wellbeing.' Arrange the following steps in the logical sequence of a correlational analysis, moving from the initial description of the relationship to a critical evaluation of its limits and its final application for prediction.
In correlational research, researchers can utilize a statistical technique called regression to predict scores on one variable based on the scores of another variable.
Which of the following statements best explains why a researcher would choose a correlational research design instead of an experimental design?
A researcher wants to study the potential relationship between prenatal alcohol exposure and cognitive development in children. Because it would be unethical to intentionally require pregnant participants to consume alcohol, the researcher cannot manipulate an independent variable. Given this ethical constraint, the most appropriate design is a/an _____ research design, which allows measurement of both variables and prediction of outcomes without experimental manipulation.
A researcher is designing studies on human behavior. Match each study description to the correlational research principle it best illustrates.
A student reads a published study that measured participants' daily screen time and their self-reported life satisfaction scores in a naturalistic setting. The researchers computed a correlation coefficient but never assigned participants to any screen-time condition, nor did they hold constant any outside factors. Analyzing these features, the student correctly classifies the study as non-experimental because the researchers made little to no effort to control _____ variables—the same characteristic that prevents the authors from concluding that screen time causes changes in life satisfaction.
A research team is deciding whether to use a correlational design to study the relationship between childhood poverty and adult educational attainment. Order the following evaluative criteria from the first consideration the team should address to the final justified conclusion about design choice.
Define correlational research and identify its key characteristics regarding variable manipulation and the control of extraneous variables. In addition, state the two primary scientific goals that this research strategy allows investigators to achieve, and name the specific statistical technique used to make predictions from one variable to another.
Explain why the psychologist chose a correlational research design instead of an experimental design for this study. In your explanation, identify the nature of the variables in this scenario and explain how this design helps achieve the goals of description and prediction.
A researcher wants to study the relationship between the number of hours college students work at part-time jobs and their overall GPA. Apply the principles of correlational research to explain how the researcher should collect data on these variables, and specify the statistical technique they should use to predict a student's GPA from their work hours.
Direction of Correlations
A city analyst observes that neighborhoods with a higher number of public libraries also report, on average, a higher median household income. Based solely on this observed statistical association, which of the following is the most accurate conclusion?
Analyzing a Faulty Conclusion
Evaluating a Public Health Proposal
Interpreting Economic Data
A study reveals a strong positive statistical association between the monthly sales of ice cream and the number of drowning incidents at a beach resort. This finding proves that eating ice cream before swimming increases the risk of drowning.
Match each statistical concept with the real-world scenario that best illustrates it.
An economist observes that in years when the average price of coffee beans increases, the average quantity of tea sold in cafes also increases. What type of statistical relationship does this observation describe between the price of coffee and the quantity of tea sold?
Evaluating a Causal Claim
An automotive industry analyst observes that as the age of a specific model of car increases, its average resale value consistently decreases. This statistical relationship, where an increase in one variable is associated with a decrease in another, is described as a ______ correlation.
A researcher studying elementary school children finds a strong positive statistical association between their shoe size and their scores on a standardized reading test. As shoe size increases, reading scores tend to be higher. Which of the following statements provides the most logical analysis of this finding?
Correlation Coefficient
Predictive Value of Correlation
Figure 5.1: Inflation and US Presidential Election Outcomes (1912–2020)
Strong Correlation Between Exchange Rate Depreciation and Inflation (2009-2019)
Negative Correlation Between Central Bank Independence and Inflation (OECD, 1962-1990) [Figure E5.1]
Empirical Link Between Relatively Stable Exchange Rates and Low Inflation
Nonlinear Relationship
Distinguishing Correlation from Causation: Television and Exams
Scatterplots
Linear Relationship
What does a negative correlation indicate about the relationship between two variables?
Match each psychological research finding with the term that best describes the relationship between the variables based on the study's results.
A health psychologist finds that as the number of hours individuals spend practicing mindfulness meditation increases, their self-reported anxiety scores tend to decrease. In this scenario, mindfulness practice and anxiety are positively correlated.
A research team discovers a strong correlation between 'frequent social media use' and 'low self-esteem' among teenagers. To critically evaluate whether this finding justifies the causal claim that 'social media use decreases self-esteem,' arrange the following analytical steps in the logical order of priority required to establish a causal relationship.
A researcher is constructing a results summary for a study on 'daily mindfulness practice' (), 'perceived stress' (), and 'attention span' (). To design a finding that illustrates a negative correlation between mindfulness and stress and a positive correlation between mindfulness and attention span, which of the following statements should be synthesized for the final report?
If a strong statistical correlation is observed between variable and variable , it guarantees that changes in variable directly cause changes in variable .
A psychologist finds a statistical association between variable (daily screen time) and variable (sleep duration). While this indicates a correlation between the two variables, correlation does not imply _____, meaning we cannot assume that changes in directly cause changes in .
A study finds a strong positive correlation between the number of hours children spend reading for pleasure and their scores on a standardized empathy test. A researcher analyzing this relationship suggests that 'parental temperament' might independently influence both a child's interest in reading and their development of empathy. In this analysis, 'parental temperament' is being identified as a _____ variable.
A research methods student is reviewing four study summaries. Analyze each description of a variable relationship and match it to the correct correlation interpretation.
A psychologist publishes this conclusion: 'Teenagers who eat breakfast daily have significantly higher GPAs than those who skip breakfast. This proves that eating breakfast causes better academic performance.' Evaluate the soundness of this causal claim by ordering the following critical steps from first (1) to last (5).
Define what a correlation is, explain the difference in how variables behave in both positive and negative correlations, and state the critical limitation of correlation regarding causation.
Diagnose the error in the researcher's causal conclusion. Explain how the variables behave in this specific correlation, classify the direction of this relationship, and justify why the observed statistical association does not support the researcher's claim of causation.
A researcher measures daily study hours () and exam scores () in a group of psychology students. The results show that students with high values of daily study hours () tend to have high values of exam scores (). Apply the concepts of correlation to classify this specific relationship and state what conclusion the researcher can make about whether daily study hours cause changes in exam scores.
Correlation Coefficient
Sample Statistic
Descriptive Statistics in Experimental Research
Descriptive Statistics in Non-Experimental Research
Inferential Statistics
Understanding Descriptive Statistics
Scatterplot
Measures of Central Tendency
Measures of Dispersion
Forms of Statistical Relationship
Mean
Standard Deviation
Bar Graph
Line Graph
What is the primary purpose of using descriptive statistics in psychological research?
Match each category of descriptive statistics with the specific goal it accomplishes when a researcher is summarizing their data.
A researcher finds that a distribution of scores on a memory task is negatively skewed, meaning there are a few extremely low scores that pull the tail of the distribution to the left. Arrange the following measures of central tendency in order from the lowest numerical value to the highest numerical value based on this distribution's shape.
In a psychological study where a distribution of scores is highly skewed by a single extreme outlier, the mean is a more valid descriptive statistic than the median for evaluating the typical performance of the sample.
Measures of dispersion, such as the standard deviation and variance, are descriptive statistics used to describe the average scores within a research sample.
Tables in Research Reports
A psychologist studies the sleep patterns of college students and wants to summarize the collected data. Why must the psychologist report both a measure of central tendency (such as the mean) and a measure of dispersion (such as the standard deviation) to provide a complete descriptive summary of the sample's sleep duration?
A researcher records the number of errors made by five participants on a memory task: 3, 8, 2, 5, and 7. The range for this sample is _____.
A research team is summarizing data from a psychology study. Match each descriptive statistic on the left to the research situation on the right where it would be the most appropriate single summary to report.
A researcher reports that two groups of participants completed an identical mood-rating scale (scored 0–100). Group A had a mean of 60 and a standard deviation of 4, while Group B had a mean of 60 and a standard deviation of 22. Although both groups share the same measure of central tendency, a student analyzing these results should conclude that Group B's scores show substantially greater _____ than Group A's scores.
A researcher has collected scores on a stress questionnaire from 80 undergraduate participants and must decide which descriptive statistics to select and report. Arrange the following evaluative steps in the order that best supports an accurate, justified statistical summary of the data.
Define descriptive statistics and identify the two primary categories of quantitative techniques described in the text, including the specific measures associated with each.
Based on the case context, decide how the researcher should apply descriptive statistics to describe their data. Explain the two different types of measures the researcher needs to calculate and specify what each type of measure will communicate about the sample's performance.
A psychological researcher reports that a sample of participants completed a stress assessment, resulting in a mean score of and a standard deviation of . Explain what these specific descriptive statistics indicate about the scores of the participants in this sample.
Cohen's
Effect Size in Studies Lacking Power
Correlation Coefficient
What does an effect size estimate in a statistical analysis?
Even when a relationship is found to be statistically significant, an effect size should be reported because a p-value alone cannot convey the actual magnitude or importance of that relationship.
Match each hypothetical research outcome with the interpretation that accurately describes its practical and statistical significance based on the reported effect size and p-value.
A psychology researcher reports the following statistical outcomes from four separate studies. Arrange these results in order from the weakest relationship magnitude to the strongest relationship magnitude in the population, regardless of their statistical significance levels.
A psychology researcher is synthesizing a formal 'Discussion' section for a study that found a statistically significant difference between groups () but a very low magnitude of relationship (). Arrange the following statements in the correct logical sequence to construct a scientifically rigorous interpretation that prioritizes the actual strength of the finding over its probability.
Why are psychology researchers strongly encouraged to report an effect size (such as Cohen's or Pearson's ) in addition to a -value from a null hypothesis significance test?
A quantitative measure, such as Cohen's or Pearson's , that is used to estimate the strength of a statistical relationship within a population is known as an _____.
A researcher is evaluating two separate studies to decide which memory-enhancement technique to recommend for a clinical trial. Study 1 reports a significant result with and an effect size of . Study 2 reports a significant result with and an effect size of . If the researcher evaluates these findings based on the strength of the relationship in the population rather than just the probability of the results occurring by chance, they should conclude that Study _____ provides more compelling evidence of a powerful technique.
A researcher runs two independent studies on the same phenomenon. Study A () yields and . Study B () yields and . Analyzing only the statistical significance of these outcomes, a peer reviewer would correctly conclude that Study A provides stronger evidence for a meaningful population-level relationship than Study B.
A journal reviewer is evaluating four researcher statements about statistical results. Match each statement to the critique that best assesses its reporting quality.
Based on the provided statistical guidelines, define "effect size" and list the two specific quantitative measures of effect size mentioned in the text. Finally, state why researchers are encouraged to report these measures alongside null hypothesis tests.
Explain why the researcher's conclusion about the importance and power of the study aid is misleading. In your explanation, describe how the sample size of 15,000 students affected the -value, and explain what Cohen's reveals about the actual magnitude of the relationship.
You are writing the discussion section for a psychology experiment with a sample size of 40 participants. You found a statistically significant result (). Apply the statistical reporting recommendations from the text to justify why you should report an effect size measure (such as Pearson's ) alongside this -value.
Cohen's
Learn After
Types of Correlation
A researcher is examining the statistical association between several pairs of variables. Which of the following numerical values represents the strongest association between two variables?
Correlation Matrix
Pearson's r Formula
Restriction of Range
Identifying the Weakest Correlation
Statistical Regression
Test of the Correlation Coefficient
Example of Scatterplots for Pearson's r Values
Cohen's Guidelines for Interpreting Pearson's r
Which of the following best describes the function of a correlation coefficient in psychological research?
A researcher studying stress and academic performance obtains a correlation coefficient of r = −0.82. Because the value is negative, this result indicates a weak relationship between the two variables.
A psychological research team has conducted four different correlational studies. Match each obtained Pearson’s correlation coefficient with the research finding that best represents that specific relationship.
A psychological researcher is evaluating four separate studies to determine which relationship allows for the most accurate predictions between two variables. Arrange the following Pearson’s values in order from the coefficient representing the least predictable linear association to the one representing the most predictable linear association.
A researcher is designing a study to demonstrate a 'strong negative linear association' between two variables to serve as a meaningful effect size. Which of the following sets of research parameters must be synthesized to correctly achieve this specific statistical outcome?
The correlation coefficient, commonly represented as Pearson's , is bounded between the values of and .
A student researcher evaluates a study result where Pearson's and concludes it represents a 'weak' relationship because the value is negative. To critique this evaluation, a supervisor would explain that the _____ of the correlation coefficient, rather than its sign, indicates the strength of the linear association.
A research methods instructor presents four properties or values of Pearson's and asks students to match each one to the correct interpretation. Match each property or value of the correlation coefficient to what it specifically tells a researcher about the relationship between two quantitative variables.
A researcher reports Pearson's between daily stress ratings and quality of sleep. A classmate argues this is a weak relationship because the coefficient is negative. To correct this misanalysis, a student explains that the strength of a linear relationship is determined by the _____ of Pearson's , not by its sign.
A student is critically evaluating whether a researcher correctly applied and interpreted Pearson's as an effect size measure in a published correlational study. Arrange the following steps in the order a critical evaluator should complete them.
State the definition of the correlation coefficient, specify its common representation, and recall its numerical bounds. Briefly outline how its direction and strength are interpreted, and describe its function as an effect size in research.
Explain why the student researcher's comprehension of the correlation coefficient is incorrect. Using the properties of Pearson's mentioned in the case, justify which of the two relationships is actually stronger and more predictable.
A clinical psychologist is investigating the linear relationship between patient treatment attendance (number of sessions) and symptom severity ratings. Explain how the researcher would apply Pearson's to evaluate the connection between these two quantitative variables as an effect size in their population.