In this example, a compliance team is concerned about fraud detection accuracy in the
automotive industry; however, the data need prep before analysis can begin. Follow
these steps to prepare
insurance_fraud_data.csv for further analysis. To make
these modifications, select the column and open
Data Prep Options to access the column cleanup options.
- Open Insurance Fraud Data in the Minitab Data Center.
- For claim_number, change the data type from numeric to text.
- For claim_number, prepend # to the column values.
- For age_of_driver, filter to only include drivers that are less than or
equal to 100 years old.
- In gender, change M to male and F to
female.
- For annual_income, filter to only include drivers that make more than 1.
- For address_change, change the data type from numeric to text.
- In address_change, change 1 to yes and 0 to
no.
- For zip code, change the data type from numeric to text.
- Use Advanced Sort to sort by fraud, injury claim, and ZIP code.