Each deployment has a single champion model that is used to produce the predictions for the production environment. The deployment also allows up to two challenger models that can run the same data and provide performance metrics to your team. If and when a challenger model becomes the best model, you can easily promote it to be the new champion model.
The Models tab contains a list of the deployment’s competing models. Use the following steps to add up to two challenger models.
Use the model performance metrics and graphs on the Performance tab to evaluate the champion and challenger models. Only the champion model is live at any given time. All the prediction requests are used by the current champion model in real time. Once a day, the same prediction requests are replayed against the challengers for analytical purposes.
By assessing the challenger models with the same production data as the champion model, you can easily compare the available models in the deployment.
Review the performance metrics to determine whether one of the challenger models has better performance than the champion model. If a challenger model has better performance than the champion model, you can promote it to be the new champion model.