Using datasets from a large school-based myopia study, we developed and validated several ML models for predicting cycloplegic SER and myopia status based on the noncycloplegic clinical data. When the cycloplegic refractive error cannot be obtained, ML models have the potential to be used to estimate cycloplegic refractive error of an individual child as well as to determine the myopia prevalence rate in a specific population. In both the training and validation datasets, we found that ML models performed very well in predicting cycloplegic SER and myopia status. The XGBoost model performed best for predicting cycloplegic SER and the random forest model performed best or predicting myopia status. With high sensitivity (>91%) and specificity (>96%), these best-performing models accurately predicted myopia prevalence rate in all age groups.
Using the same datasets as this study, we previously developed and validated a traditional regression-based prediction model for predicting cycloplegic SER.
29 The model had a sensitivity of 87% and specificity of 98% in the training dataset and sensitivity of 84% and specificity of 98% in the validation dataset for detecting myopia. The combination of predicted cycloplegic SER from the traditional regression-based prediction model and UCVA (i.e., defining myopia positive as SER ≤−0.5 D or UCVA 20/40 or worse in either eye) increased sensitivity and specificity, though the ML models discussed here outperformed our previous regression-based prediction models. This improved prediction performance may be due to the power of ML models to accommodate complex nonlinear relationships between predictors and refractive error outcomes. These findings support that ML models can be useful in epidemiological studies of myopia in children, particularly when administering cycloplegic agents is not feasible.
For our ML models, we considered ocular biometric measures that can be reliably measured without cycloplegia, demographics, and other easily obtainable measures such as UCVA. Based on the feature importance analysis, we found that the noncycloplegic SER, AL, AL/CR ratio and UCVA were consistently among the top four important features for predicting cycloplegic SER. These measures are well known to be associated with cycloplegic refractive error and have been used for predicting cycloplegic refractive error in prior studies.
12,19,20,28,29
Several studies have attempted to detect myopia in children without the application of cycloplegic eyedrops using statistical prediction models.
15–22 These statistical models explored the prediction of cycloplegic refractive error using various predictors including ocular biometric measures obtained under noncycloplegic conditions,
19 noncycloplegic refractive error and UCVA,
21 a combination of ocular biometric measures, noncycloplegic refractive error and UCVA.
29 Given the variability in study design, previous studies yielded mixed results, with
R2 ranging from 0.26 to 0.93. In particular, Sankaridurg et al.
21 reported a regression-based prediction model that used age, UCVA, and noncycloplegic refractive error for predicting cycloplegic refractive error in a total of 6825 Chinese children aged four to 15 years. The model yielded
R2 of 0.91, a sensitivity of 89.3% and a specificity of 97.6% for predicting myopia. The regression model developed specifically for children with UCVA worse than 6/6 yielded
R2 of 0.93 with a sensitivity of 89.5% and a specificity of 97.4%.
21 In comparison to previous studies using regression-based predictions, our study is the first to apply modern ML models that considered comprehensive predictors which are all readily obtainable without cycloplegia. Our ML models yielded better performance than previous models. Because our ML models were developed in a large sample and independently validated in another large sample of children of many ages (5 to 18 years old) with varied refractive error status (from −14.1 to 8.4 D), our model has great potential to be applicable to population-based myopia research, when measuring cycloplegic refractive error in all children is not feasible.
In this study, we developed ML models for predicting both cycloplegic refractive error (a continuous variable) and myopia status (a binary variable). Predicting cycloplegic refractive error can be useful both for detecting myopia and quantifying myopia severity, because the predicted cycloplegic SER (a continuous measure) can be later used to define the presence of myopia (e.g., predicted cycloplegic SER ≤−0.5 D as myopia) and to determine the severity of myopia based on the magnitude of the predicted cycloplegic SER. Among the 6 ML models for predicting cycloplegic SER, we found the XGBoost performed best when considering its performance in the validation dataset. This model not only performed well for predicting cycloplegic SER (R2 = 0.974, MAE = 0.256 D in training; R2 = 0.935, MAE = 0.393 D in validation), it also yielded high sensitivity (94% in training, 91% in validation) and specificity (99% in training, 97% in validation) for detecting myopia.
Estimating the prevalence rate of myopia is the primary interest of many large epidemiological studies where defining myopia using cycloplegic refractive error may not be feasible. To address this, we evaluated 4 ML models for directly predicting myopia status and found that the random forest model performed best. The random forest model provided higher sensitivity (with similar specificity) and more accurate prediction of myopia prevalence rate than using the XGBoost predicted cycloplegic SER. The myopia predicted prevalence rates from both XGBoost and random forest were very close to the observed myopia prevalence rate, supporting their potential use in the future epidemiological studies for estimating myopia rate. Because refractive error changes with age, we evaluated the performance of ML models for each age group and found that ML models performed consistently well across all groups five to 18 years, with small differences between predicted and observed myopia rate. These robust results support the use of ML models for predicting refractive error or myopia status in children of various ages.
The generalizability of our study may be slightly limited because of its design of using 0.5% tropicamide for cycloplegia, a NIDEK autorefractor, and an all-Chinese participant population. Thus the findings from this study may not be directly generalizable to other cycloplegic agents, types of autorefractors, or other races/ethnicities. These ML prediction models would likely require additional training before being employed in other settings. In this cross-sectional study, we were only able to evaluate the ML models for predicting the cycloplegic refractive error at a single time point. Future longitudinal studies are needed to evaluate how these ML models perform for monitoring the progression of refractive error over time or other cycloplegic medications.
In conclusion, we developed and validated several ML models for predicting cycloplegic refractive error and myopia status using readily available demographics and noncycloplegic measurements. These ML models (particularly the XGBoost and random forest) may help address epidemiological concerns about study attrition due to refusal of participation with cycloplegic eyedrops. The application of our ML models may provide more accurate estimates of myopia prevalence rate, severity of myopia and better determination of risk factors of myopia than noncycloplegic refractive error.