Assess each model's calibration, discrimination (AUC), and overall accuracy (Brier score). - Use proper scoring rules (see Chapter 6). - Evaluate on a held-out test set or via cross-validation.