Skip to main content

Table 4 Confusion matrix for the binary conversion of the likelihood of ARS estimated by the environmental NPMR model for the building (n = 1444) and validation sets (n = 364), using the cutoff value that maximized the true skill statistic (TSSmax). FPR is the false positive rate, FNR is the false negative rate, TPR is the true positive rate, and TNR is the true negative rate. The second part of the table reports a set of performance metrics for this binary conversion, including prevalence, accuracy, precision, the area under the receiver operating characteristic curve (AUC, range: 0 to 1 with larger numbers indicating a better fit), the root-mean square error (RMSE, range: 0 to infinity with smaller numbers indicating a better fit), and the Brier score (range: 0 to 1 with lower scores indicating a better calibration of the predictions)

From: Ecological correlates of blue whale movement behavior and its predictability in the California Current Ecosystem during the summer-fall feeding season

Confusion matrix:
   Predictions Classification error
Absence Presence
Observations in the building set Absence 149 77 0.34 (FPR)
Presence 401 665 0.38 (FNR)
Observations in the validation set Absence 17 47 0.73 (FPR)
Presence 12 112 0.09 (FNR)
Performance metrics:
  Building set Validation set
 TSSmax 0.28 0.18
 Cutoff 0.84 0.61
 Observed prevalencea 0.83 0.66
 Predicted prevalencea 0.57 0.85
 TNR (1-FPR) 0.66 0.27
 TPR (1-FNR) 0.63 0.91
 Accuracyb 0.63 0.69
 Precisionc 0.90 0.70
 AUC 0.69 0.57
 RMSE 0.36 0.47
 Brier scored 0.13 0.22
  1. aPrevalence is estimated as: presences/total
  2. bAccuracy is estimated as: (true positives + true negatives)/(obs. Presences + obs. absences)
  3. cPrecision is estimated as: true positives/(true positives + false positives)
  4. dThe Brier score is computed as the mean of the squared residuals