The initial measurements of the participants in the dataset are listed in
Table 1. The final ortho-K lens parameters obtained after completing the lens fitting process are listed in
Table 2. Of the 297 patients, 121 were male and 176 were female. Among the 547 eyes analyzed, an OAD of 10.5 mm was prescribed for 368 eyes and an OAD of 11.0 mm was prescribed for 179 eyes.
The fivefold cross-validation results for the binary classification problems (toric option and OAD prediction) in the training dataset are shown in
Figure 3. Extensive performance data are presented in
Supplementary Table S1. Toric prediction determines whether an astigmatic lens (CRT dual-axis) is required. In this task, among the classifiers evaluated, the CatBoost classifier demonstrated the best performance, achieving accuracy, precision, recall, and F1 scores of 0.940, 0.941, 0.940, and 0.940, respectively. In addition, for the OAD classification model, the Extra Trees classifier outperformed the others, achieving accuracy, precision, recall, and F1 scores of 0.872, 0.895, 0.872, and 0.875, respectively.
As shown in
Figure 4, these algorithms exhibited similar performance on the test dataset. The detailed metrics are listed in
Table 3. Performance was evaluated using a test set containing 110 eyes, and the CatBoost classifier showed an accuracy of 0.927, precision of 0.931, recall of 0.927, and F1 of 0.929. In the confusion matrix, 86 of the 91 cases in which the final lens was non-toric (CRT) in the test dataset, and 16 of the 19 cases in which the final lens was CRT dual-axis (toric) were correctly classified. For the OAD classification in the test dataset, the Extra Trees classifier showed accuracy, precision, recall, and F1 scores of 0.864, 0.868, 0.864, and 0.865, respectively. The receiver operating characteristic (ROC) curves for the binary classification problems are shown in
Supplementary Figure S3. The area under the ROC curves showed 0.970 and 0.920 for the toric option and OAD prediction tasks, respectively.
Figure 5 presents the performance of the regression tasks (BC, RZD1, RZD2, and LZA) via fivefold cross-validation of the training dataset. The detailed performance data are shown in
Supplementary Table S2. For BC prediction, the least-angle regression showed the highest performance, with MAE, RMSE, and
R2 metrics of 0.054 mm, 0.083 mm, and 0.948, respectively. In the RZD1 prediction, the least-angle regression was the most accurate, with an MAE of 3.031 µm, RMSE of 8.574 µm, and
R2 of 0.708. The best model for the RZD2 prediction was the CatBoost regressor, which recorded an MAE of 3.775 µm, RMSE of 11.703 µm, and
R2 of 0.791. For the LZA prediction, the CatBoost regressor showed an MAE of 0.121°, RMSE of 0.346°, and
R2 of 0.798. In the case of LensSag, Extra Trees exhibited the best performance, with MAE, RMSE, and
R2 values of 4.372 µm, 6.008 µm, and 0.921, respectively.
As shown in
Table 4, these regression algorithms generally showed slightly lower performance in the test dataset compared to that in the fivefold cross-validation. For BC prediction using least-angle regression, the MAE, RMSE, and
R2 metrics were 0.052 mm, 0.105 mm, and 0.943, respectively. In the RZD1 prediction using least-angle regression, the MAE, RMSE, and
R2 metrics were 2.727 µm, 8.257 µm, and 0.718, respectively. For the RZD2 prediction, CatBoost showed MAE, RMSE, and R
2 values of 7.045 µm, 13.693 µm, and 0.704, respectively. For the LZA prediction, CatBoost showed MAE, RMSE, and
R2 values of 0.118°, 0.344°, and 0.723, respectively. The LensSag predictions obtained using Extra Trees showed MAE, RMSE, and
R2 values of 5.215 µm, 6.875 µm, and 0.921, respectively.
The developed methods were compared with the ILS based on manifest refraction and keratometry, using a test dataset consisting of 110 cases (
Figure 6). Regarding BC prediction, there was no significant difference between the machine-learning prediction and the achieved lens BCs (
P = 0.817). However, the ILS-based BC calculation was significantly lower than the values ascertained by both the machine-learning prediction (
P < 0.001) and the achieved lens BCs (
P < 0.001). Similar results were observed for the prediction of RZD1. The machine-learning prediction and the achieved lens RZD1 showed no significant difference, but the ILS-based calculations showed significant differences. In the LZA prediction, there were no significant differences among the machine-learning prediction, ILS-based calculation, and achieved lens LZAs.
Figure 7 shows the feature importance and two-way partial dependence plots for each prediction task. Toric option prediction is primarily affected by cylindrical refraction and keratometric astigmatism. WTW and ACD were the most important predictors of OAD. Spherical and cylindrical refraction, flattest and steepest keratometry, and flat e values were the major factors predicting BC, RZD1, RZD2, and LZA. Finally, a machine-learning calculator was developed for the web-based interface (
https://visuworks-dev.github.io/ParagonCRT-Calculator/) based on this analysis. A screenshot of the developed calculator is shown in
Figure 8.