Example of Binned Scatterplot with the mean of a third variable

A research team for a large jewelry retailer wants to investigate the relationship between carat, color, and price of various diamonds. A scatterplot is difficult to interpret because the data set is so large. Thus, the team decides to create a binned scatterplot to make it easier to interpret the graph.

Note

Usually, the color of a diamond is represented with a letter. Because all the variables must be numeric, the researchers code the color of the diamond to a numeric scale that ranges from 0-6. The researchers gives colorless diamonds a higher number and gives light yellow diamonds a lower number.

  1. Open the sample data, Diamonds.MTW.
  2. Choose Graph > Binned Scatterplot.
  3. In Y-variable, enter Price.
  4. In X-variable, enter Carat.
  5. Select Mean of variable and enter Color in the box.
  6. Click OK.

Interpret the results

Minitab groups the data into rectangular bins. Minitab uses dark blue to dark red for the bins based on the color of the diamond. Darker shades of red correspond to higher values of the color variable. Darker shades of blue correspond to lower values of the color variable. The research team wants to determine how the carat and color affect the price. The price of the diamond increases as the number of carats increases. However, there is still a large variation in prices for diamonds with the same number of carats. The color of the diamond explains some of this variation. For diamonds with the same number of carats, a higher color number corresponds to a higher price.