Use the matrix plot to examine the relationships between two continuous variables. Also, look for outliers in the relationships. Outliers can heavily influence the results for the Pearson correlation coefficient.
Determine whether the relationships are linear, monotonic, or neither. The following are examples of the types of forms that the correlation coefficients describe. The Pearson correlation coefficient is appropriate for linear forms. Spearman's correlation coefficient is appropriate for monotonic forms.
The points fall randomly on the plot, which indicates that there is no linear relationship between the variables.
Some points are close to the line but other points are far from it, which indicates only a moderate linear relationship between the variables.
The points fall close to the line, which indicates that there is a strong linear relationship between the variables. The relationship is positive because as one variable increases, the other variable also increases.
The points fall close to the line, which indicates that there is a strong negative relationship between the variables. The relationship is negative because, as one variable increases, the other variable decreases.
In a monotonic relationship, the variables tend to move in the same relative direction, but not necessarily at a constant rate. In a linear relationship, the variables move in the same direction at a constant rate. This plot shows both variables increasing concurrently, but not at the same rate. This relationship is monotonic, but not linear. The Pearson correlation coefficient for these data is 0.843, but the Spearman correlation is higher, 0.948.
This example shows a curved relationship. Even though the relationship between the variables is strong, the correlation coefficient would be close to zero. The relationship is neither linear nor monotonic.
Use the Pearson correlation coefficient to examine the strength and direction of the linear relationship between two continuous variables.
The correlation coefficient can range in value from −1 to +1. The larger the absolute value of the coefficient, the stronger the relationship between the variables.
For the Pearson correlation, an absolute value of 1 indicates a perfect linear relationship. A correlation close to 0 indicates no linear relationship between the variables.The sign of the coefficient indicates the direction of the relationship. If both variables tend to increase or decrease together, the coefficient is positive, and the line that represents the correlation slopes upward. If one variable tends to increase as the other decreases, the coefficient is negative, and the line that represents the correlation slopes downward.
Correlation type | Pearson |
---|---|
Number of rows used | 30 |
Age | Residence | Employ | Savings | Debt | |
---|---|---|---|---|---|
Residence | 0.838 | ||||
Employ | 0.848 | 0.952 | |||
Savings | 0.552 | 0.570 | 0.539 | ||
Debt | 0.032 | 0.186 | 0.247 | -0.393 | |
Credit cards | -0.130 | 0.053 | 0.023 | -0.410 | 0.474 |