How to Run a Pearson’s Correlation in SPSS

  • 24/7 Support
  • 100+ Subjects
  • 500+ PhD statisticians

Pearson’s correlation coefficient is one of the most widely used statistical tests. It is mainly used to determine the strength and direction of the linear relationship between two continuous variables. This article provides an overview of Pearson’s correlation analysis, its assumptions, how to run Pearson’s correlation in SPSS, and how to interpret Pearson’s Correlation SPSS Outputs.

Quick Steps

  1. From the SPSS Main Menu, Click Analyze –> Correlate –> Bivariate
  2. Transfer the variables Hours_studied and Exam_score into the Variables: box.
  3. Make sure the Pearson is Checked under Correlation Coefficient
  4. Click OK.

What is Pearson’s Correlation?

Pearson’s correlation coefficient also known as Pearson Correlation is a parametric statistical test that is used to determine the strength and direction of a linear relationship between two continuous variables. Pearson’s correlation coefficient value generally lies between -1 and +1, with values closer to 1 indicating strong relationship and those close to 0 indicating weak relationships.

Below is a summary of how to determine the strength and direction of linear relationships based on Pearson’s correlation coefficient values:

Strength of relationshipPositive CorrelationNegative Correlation
Zero00
Weak0.1 to 0.3-0.1 to -0.3
Moderate0.3 to 0.7-0.3 to -0.7
Strong0.7 to 1.0-0.7 to -1.0
Perfect1-1
Correlation Coefficient and Strength of Relationship

Correlation can be categorized into 3:

  • Positive Correlation – The correlation coefficient value is positive. This suggests that as one variable increases, the other variable also tends to increase.
  • Negative Correlation – The correlation coefficient value is negative. This suggests that as one variable increases, the other variable tends to decrease.
  • No Correlation – The correlation coefficient value is 0. This suggests that there is no linear relationship between the two variables.

Pearson’s Correlation Assumptions

Before you start performing Pearson’s correlation analysis in SPSS, it is always advisable to confirm whether the assumptions are met. The main assumptions include:

  • Assumption #1: The two variables should be measured on a continuous scale (they should be measured at the interval or ratio level).  Examples of continuous variables are, exam scores measured as a percentage and the number of hours spent studying.
  • Assumption #2: There should be a linear relationship between the two variables. This assumption can be checked using SPSS by creating a scatterplot. The scatterplot can resemble one of these:  
Scatter plots showing possible relationships

If after creating a scatterplot there is a non-linear relationship, then you need to use a non-parametric equivalent of Pearson’s Correlation Coefficient. Specifically, you’ll need to run a Spearman’s Correlation instead. Alternatively, you can consider transforming your variables to see if there is a linear relationship between the transformed variables.

  • Assumption #3: There should be no significant outliers. An outlier is a single observation that lies far away from other data points. Pearson’s correlation coefficient is highly influenced by outliers. Thus, it is necessary to remove any significant outlier before running Pearson’s correlation in SPSS. The figure below demonstrates the effects of outliers on Pearson’s correlation coefficient value.

The figure above shows that Pearson’s correlation coefficient, r, is highly influenced by outliers. In other words, Pearson’s correlation coefficient is sensitive to outliers and this can have significant effects on the line of best fit. Specifically, a scatter plot with an outlier yielded a correlation coefficient value of 0.4. However, when the outlier is removed, the coefficient increases to 0.7. Therefore, it is necessary to examine whether there are significant outliers before performing Pearson’s correlation in SPSS.

Note. If the above assumptions are not met, then performing a Pearson’s correlation will yield invalid and unreliable results.

How to Perform Pearson’s Correlation in SPSS

Example Pearson’s Correlation Test

A university is interested in understanding how study habits influence academic performance. Specifically, they want to investigate whether the number of hours students study per week is correlated with their final exam scores. To address this question, data were collected from 30 university students. Each student was asked to report the number of hours they studied per week and their final exam scores. The data obtained were as shown below:

The complete dataset can be downloaded below.

The above data is in Excel format (.xlsx). Check out our detailed guide on how to import Excel data into SPSS.

Pearson’s Correlation in SPSS: Test Procedure

To run a Pearson’s correlation analysis in SPSS, just follow these 4 simple steps:

  • Step 1. From the SPSS Main Menu, Click Analyze –> Correlate –> Bivariate
how to run Pearson's Correlation in SPSS

You will be presented with the following bivariate dialog box

How to perform pearson's correlation in spss
  • Step 2. Transfer the variables Hours_studied and Exam_score into the Variables: box. This can be done either by dragging and dropping them or by clicking on them and then clicking on the arrow button pointing to the Variables: box.

You should have a similar screen display as shown below;

Pearson's correlation analysis using SPSS
  • Step 3: Make sure the Pearson is Checked under Correlation Coefficient
  • Step 4: Click OK.

You’ll have the Pearson’s Correlation SPSS Outputs as shown below;

Pearson's Correlation SPSS Outputs

Still struggling with your data analysis project? Get Started Now & Unlock 15% Off Your First Project