Interpret the key results for Simple Correspondence Analysis

Complete the following steps to interpret a simple correspondence analysis. Key output includes principal components, inertia, proportion of inertia, quality, mass, and several graphs.

Step 1: Determine the number of principal components

Use the proportion of inertia to determine the minimum number of principal components, also called principal axes, that account for most of the deviation from the expected values in the data. Retain the principal components that explain an acceptable proportion of total inertia. The acceptable level depends on your application. Ideally, the first one, two, or three components account for most of the total inertia.

If the minimum number of principal components needed does not match the number of components that you entered for the analysis, repeat the analysis using the appropriate number of components.

Analysis of Contingency Table

AxisInertiaProportionCumulative
10.03910.47200.4720
20.03040.36660.8385
30.01090.13110.9697
40.00250.03031.0000
Total0.0829   
Key Results: Axes, Proportion, Cumulative

These results show the decomposition of total inertia of a 10 x 5 contingency table into 4 components (axes). The total inertia explained by the four components is 0.0829. Of the total inertia, the first component accounts for 47.2% of the inertia (Proportion = 0.4720) and the second component accounts for 36.66% of the inertia (Proportion = 0.3666). Together, these 2 components account for 83.85% of the total inertia (Cumulative = 0.8385). Therefore, specifying 2 components for the analysis may be sufficient.

Step 2: Interpret the principal components

Use the quality values to determine the proportion of the row inertia or column inertia represented by the components. Quality is always a number between 0 and 1. Larger quality values indicate that the row or column is well represented by the components. Lower values indicate poorer representation. The quality values for the rows and columns can help you interpret the components.

Use the contribution values for the rows and/or columns assess which column and row categories contribute most to the inertia of each component. To visually interpret the components, use a row or column plot.

Row Contributions






Component  1Component  2
IDNameQualMassInertCoordCorrContrCoordCorrContr
1Geology0.9160.1070.137-0.0760.0550.016-0.3030.8610.322
2Biochemistry0.8810.0360.119-0.1800.1190.0300.4550.7620.248
3Chemistry0.6440.1630.021-0.0380.1340.006-0.0730.5100.029
4Zoology0.9290.1510.2300.3270.8460.413-0.1020.0830.052
5Physics0.8860.1430.196-0.3160.8800.365-0.0270.0060.003
6Engineering0.8700.1110.1520.1170.1210.0390.2920.7490.310
7Microbiology0.6800.0460.010-0.0130.0090.0000.1100.6710.018
8Botany0.6540.1080.0670.1790.6250.0880.0390.0290.005
9Statistics0.5610.0360.012-0.1250.5540.014-0.0140.0070.000
10Mathematics0.3190.0980.056-0.1070.2400.0290.0610.0790.012
Key Results: Qual, Contr, Row/Column plot

In this analysis, Minitab calculates two principal components. In the Row Contributions table, highest quality values occur for Zoology (0.929) and Geology (0.916). Therefore, these two rows are best represented by the two components. Mathematics has the poorest representation, with a quality value of 0.319.

Zoology (0.413) and Physics (0.365) contribute the most to the inertia of Component 1. Geology (0.322), Engineering (0.310, and Biochemistry (0.248) contribute the most to the inertia of Component 2.

The row plot shows the row principal coordinates. Component 1, which best explains Zoology and Physics, shows these two fields farthest from the origin, but with opposite signs. Therefore, component 1 contrasts the biological sciences Zoology and Botany with Physics. Component 2 contrasts Biochemistry and Engineering with Geology.

Step 3: Examine relationships among categories

Examine calculated inertia values for the row and column categories and look for possible associations. Categories with stronger associations have a higher inertia value, which indicates they contribute more to the total chi-squared value.

You can also use an asymmetric row or column plot to visually examine possible relationships. For a row plot, the closer a row profile is to a column vertex, the higher the row profile is with respect to the column category. For a column plot, the closer a column profile is to a row vertex, the higher the column profile is with respect to the row category.

Row Contributions






Component  1Component  2
IDNameQualMassInertCoordCorrContrCoordCorrContr
1Geology0.9160.1070.137-0.0760.0550.016-0.3030.8610.322
2Biochemistry0.8810.0360.119-0.1800.1190.0300.4550.7620.248
3Chemistry0.6440.1630.021-0.0380.1340.006-0.0730.5100.029
4Zoology0.9290.1510.2300.3270.8460.413-0.1020.0830.052
5Physics0.8860.1430.196-0.3160.8800.365-0.0270.0060.003
6Engineering0.8700.1110.1520.1170.1210.0390.2920.7490.310
7Microbiology0.6800.0460.010-0.0130.0090.0000.1100.6710.018
8Botany0.6540.1080.0670.1790.6250.0880.0390.0290.005
9Statistics0.5610.0360.012-0.1250.5540.014-0.0140.0070.000
10Mathematics0.3190.0980.056-0.1070.2400.0290.0610.0790.012
Key Results: Inert, Asymmetric Row/Column plot

In the Row Contributions table, the column labeled Inert is the proportion of the total inertia contributed by each row. Thus, Geology contributes 13.7% to the total chi-squared statistic. Zoology has the highest inertia value (0.230). Therefore Zoology contributes 23% to the total chi-squared value and has a stronger association with the column categories (funding) than the other row categories.

In the asymmetric row plot, the row points represent academic disciplines and the column points represent funding levels (A is the highest level of funding and D is the lowest. E indicates no funding). Biochemistry is closest to column category E, implying that biochemistry as a discipline has the highest percentage of unfunded researchers in this study.