The accuracies of the seven individual CNN models for the two-class problem, based on corneal anterior and posterior EC, anterior and posterior elevation EL, anterior and posterior SAG, and CT maps for detecting KCN were 97.6%, 96.9%, 95.7%, 97.9%, 98.3%, 95.2%, and 95.5%, respectively. Accordingly, for the three-class problem, the obtained accuracies were 75.5%, 72.1%,76.6%, 75.3%, 80.8%, 72%, and 73.8%, respectively. Based on the independent validation subset, for the two-class problem, the AUC, F1 score, and accuracy were 0.99, 0.92, and 92%, respectively; for the three-class problem, the AUC, F1 score, and accuracy were 0.81, 0.69, and 68.7%, respectively. Based on the merged development subset (542 cases) and independent validation subset (150 eyes), and randomly splitting the whole set into 50%/25%/25% training/validation/testing, for the two-class problem the AUC, F1 score, and accuracy were 0.99, 0.98, and 97.7%, respectively; for the three-class problem, the AUC, F1 score, and accuracy were 0.96, 0.85, and 84.4%, respectively.
Table 1 illustrates a summary of the AUC, F1 score, and accuracy metrics.