Example of Multiple Correspondence Analysis

A researcher wants to explore how different categories of automobile accidents relate to each other. The automobile accidents are classified according to type, severity, car size, and driver ejection.

The researcher performs multiple correspondence analysis to examine how the categories in the four-way table relate to each other.

  1. Open the sample data set, AutomobileAccident.MTW.
  2. Choose Stat > Multivariate > Multiple Correspondence Analysis.
  3. Select Categorical variables and enter CarWt DrEject AccType AccSever.
  4. In Category names, enter AccNames.
  5. In Number of components, enter 2.
  6. Click Graphs and select Display column plot.
  7. Click OK in each dialog box.

Interpret the results

The Analysis of Indicator Matrix summarizes the decomposition of the variables and shows the inertia accounted for by each component. Of the total inertia of 1, approximately 40.3% is accounted for by component (axis) 1, 25.2% is accounted for by component (axis) 2, 19% is accounted for by component (axis) 3, and 15.5% is accounted for by component (axis) 4. Cumulatively, approximately 84.5% of the total inertia is accounted for by the first 3 components.

Using the Column Contributions table and the column plot, the researcher can interpret the components in relation to the categories. In the table, the quality value (Qual) indicates the proportion of the column inertia represented by the calculated components. In this example, the car-size categories Small and Standard are best represented by the two components (Qual = 0.965). The ejection categories are the least represented by the two components (Qual = 0.474). The column labeled Mass is the proportion of the class in the whole data set. Eject (0.037) and Rollover (0.057) are relatively rare.

The Coord column provides the column coordinates, which are displayed in the column plot. Correlation (Corr) represents the contribution of the respective component to the inertia of each category. For example, component 1 accounts for 61% of the inertia of the accident type categories (Collis and Rollover). Component 2 accounts for 93.6% of the inertia of the car size categories Small and Standard.

Contr, the contribution of the row to the axis inertia, shows that Eject (Contr = 0.250) and Rollover (Contr = 0.291 contribute the most to component 1. The car sizes Small (0.771) and Standard (0.158) contribute most to component 2.

In the column plot, Eject and Rollover are most distant from the origin along the horizontal axis for component 1. This corresponds with the relatively high contribution (Contr) for these categories for component 1. Because Eject and No Eject, as well as Severe and NoSevere, are on opposite sides of the origin, component 1 contrasts these category values. Component 2 is shown on the vertical axis. The Small car size located far from the other categories on one side of the vertical axis. Therefore, component 2 contrasts the Small car size with other categories.

However, these results should be interpreted with caution, because two components may not adequately explain the variability of these data.

Analysis of Indicator Matrix

AxisInertiaProportionCumulativeHistogram
10.40320.40320.4032******************************
20.25200.25200.6552******************
30.18990.18990.8451**************
40.15490.15491.0000***********
Total1.0000     

Column Contributions






Component 1Component 2
IDNameQualMassInertCoordCorrContrCoordCorrContr
1Small0.96550.04240.20760.38140.02970.0153-2.13940.93570.7707
2Standard0.96550.20760.0424-0.07800.02970.00310.43740.93570.1576
3NoEject0.47390.21340.0366-0.28440.47170.0428-0.01970.00230.0003
4Eject0.47390.03660.21341.65870.47170.24970.11510.00230.0019
5Collis0.61330.19260.0574-0.42640.60950.08680.03380.00380.0009
6Rollover0.61330.05740.19261.42940.60950.2911-0.11330.00380.0029
7NoSevere0.56800.13530.1147-0.65230.50180.1428-0.23710.06630.0302
8Severe0.56800.11470.13530.76920.50180.16840.27950.06630.0356