August 2024
Volume 13, Issue 8
Open Access
Refractive Intervention  |   August 2024
Machine Learning Models for Predicting Cycloplegic Refractive Error and Myopia Status Based on Non-Cycloplegic Data in Chinese Students
Author Affiliations & Notes
  • Bole Ying
    Lower Merion High School, Ardmore, PA, USA
  • Rajat S. Chandra
    University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
  • Jianyong Wang
    Department of Ophthalmology, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, P. R. China
  • Hongguang Cui
    Department of Ophthalmology, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, P. R. China
  • Julius T. Oatts
    Department of Ophthalmology, University of California San Francisco, San Francisco, CA, USA
  • Correspondence: Julius T. Oatts, Department of Ophthalmology, University of California at San Francisco, San Francisco, CA 94115, USA. e-mail: julius.oatts@ucsf.edu 
  • Hongguang Cui, 79 Qingchun Road, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou 310003, P. R. China. e-mail: 1189002@zju.edu.cn 
  • Footnotes
     BY and RSC contributed equally to this work.
Translational Vision Science & Technology August 2024, Vol.13, 16. doi:https://doi.org/10.1167/tvst.13.8.16
  • Views
  • PDF
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Bole Ying, Rajat S. Chandra, Jianyong Wang, Hongguang Cui, Julius T. Oatts; Machine Learning Models for Predicting Cycloplegic Refractive Error and Myopia Status Based on Non-Cycloplegic Data in Chinese Students. Trans. Vis. Sci. Tech. 2024;13(8):16. https://doi.org/10.1167/tvst.13.8.16.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Purpose: To develop and validate machine learning (ML) models for predicting cycloplegic refractive error and myopia status using noncycloplegic refractive error and biometric data.

Methods: Cross-sectional study of children aged five to 18 years who underwent biometry and autorefraction before and after cycloplegia. Myopia was defined as cycloplegic spherical equivalent refraction (SER) ≤−0.5 Diopter (D). Models were evaluated for predicting SER using R2 and mean absolute error (MAE) and myopia status using area under the receiver operating characteristic (ROC) curve (AUC). Best-performing models were further evaluated using sensitivity/specificity and comparison of observed versus predicted myopia prevalence rate overall and in each age group. Independent data sets were used for training (n = 1938) and validation (n = 1476).

Results: In the validation dataset, ML models predicted cycloplegic SER with high R2 (0.913–0.935) and low MAE (0.393–0.480 D). The AUC for predicting myopia was high (0.984–0.987). The best-performing model for SER (XGBoost) had high sensitivity and specificity (91.1% and 97.2%). Random forest (RF), the best-performing model for myopia, had high sensitivity and specificity (92.2% and 96.9%). Within each age group, difference between predicted and actual myopia prevalence was within 4%.

Conclusions: Using noncycloplegic refractive error and ocular biometric data, ML models performed well for predicting cycloplegic SER and myopia status. When measuring cycloplegic SER is not feasible, ML may provide a useful tool for estimating cycloplegic SER and myopia prevalence rate in epidemiological studies.

Translational Relevance: Using ML to predict cycloplegic refraction based on noncycloplegic data is a powerful tool for large, population-based studies of refractive error.

Introduction
The prevalence of myopia is growing worldwide, and is already at epidemic levels in some countries, particularly among the younger generations of East and Southeast Asia.1 The global number of individuals with myopia is predicted to reach 4.8 billion by 2050.1 In children, the prevalence of myopia increases with age until age 18, with young adult myopia rates as high as 70% to 90% in some parts of China.2 Myopia can be associated with several vision-threatening conditions including myopic maculopathy, optic neuropathy, and retinal detachment.3 Thus closely monitoring myopia prevalence, severity, and progression to high myopia is warranted at both individual and population levels to reduce the burden of myopia. 
Refractive error measured after cycloplegia is the gold standard for detecting myopia and quantifying myopia severity.4 However, measuring cycloplegic refractive error can be challenging in children, particularly in large population-based epidemiology studies, which require enormous time and resources. Non-cycloplegic refractive error is still commonly used for determining the presence or severity of myopia,58 despite its well-documented tendency to overestimate myopia prevalence and severity. Previous studies consistently demonstrated substantial differences between cycloplegic and noncycloplegic refractive error measurements, with mean spherical equivalent differences ranging from 0.60 to 1.23 D,914 suggesting noncycloplegic refractive error overestimating the prevalence and severity of myopia in the pediatric population. Thus strategies to improve myopia detection and the classification of myopia severity under noncycloplegic conditions are needed. 
Attempts have been made to develop prediction models for cycloplegic refractive error using measurements obtained without cycloplegia.1522 These prediction models are based on traditional linear regression or logistic regression models that use various predictors including demographics, ocular biometric measures, noncycloplegic refractive error and uncorrected visual acuity (UCVA). These prediction models have been variably successful (R2 ranging from 0.26 to 0.93), likely because of variations in the selected predictors, children's ages, and their refractive error.1521 
In recent years, machine learning (ML) models, including random forest, lasso, and gradient boosting models, have demonstrated great potential for application in ophthalmology and vision research,2325 including our previous works for predicting treatment burden and vision response to anti-vascular endothelial growth factor (anti-VEGF) treatment for neovascular age-related macular degeneration.26,27 In contrast to traditional regression models that usually assume the linear relationship between predictors and outcome, ML models can accommodate complex and nonlinear relationships between predictors and outcome and thus have great potential to improve prediction performance. 
In this study, we aim to develop and validate ML models for predicting cycloplegic refractive error and myopia status using easily obtainable measures under noncycloplegic conditions in large number of Chinese school students aged five to 18 years from two cities in China. The models included demographics, noncycloplegic refractive error, UCVA, and ocular biometric measurements. We hypothesized that ML models can accurately predict cycloplegic refractive error and myopia status, thus providing a useful tool to determine cycloplegic refractive error and myopia status for a child, and myopia prevalence rate in populations where cycloplegic refractive error cannot be obtained. 
Methods
This is a secondary analysis of data from a cross-sectional school-based study of myopia conducted in two cities (Jinyun, Hangzhou) of Zhejiang province, China. Details of the study have been reported in previous publications.12,28,29 In brief, from October 2020 to January 2021, students aged five to 18 years were enrolled from schools in Jinyun (rural; n = 1938) and Hangzhou (urban; n = 1498). Human subject research approval was obtained from Zhejiang University and the local Administration of the Education and School Board. Written informed consent was obtained from each participant's legal parent or guardian. The study followed the tenets of the Declaration of Helsinki. 
Eye Examinations and Ocular Biometry
All study participants underwent a comprehensive eye examination by an optometrist or ophthalmologist. This included UCVA testing with a retro-illuminated logMAR chart with tumbling-E optotypes and ocular biometry using the NIDEK A-scan (Nidek, Tokyo, Japan) under noncycloplegic conditions (biometric parameters recorded included AL, CR, central corneal thickness, and anterior chamber depth). Participants also underwent autorefraction before and after cycloplegia using a table-mounted NIDEK autorefractor (Model ARK-510A; Nidek). For cycloplegic autorefraction, one drop of 0.5% tropicamide was instilled in each eye every five minutes for four administrations and cycloplegic refractive error was taken 30 minutes after the last administration. For both noncycloplegic and cycloplegic refractive error, three readings were taken from each eye. If the difference between any of two readings of sphere or cylinder from an eye was greater than 0.5 D, the refractive error measurement was repeated. For each eye, the average of three readings of refractive error was used for statistical analysis. 
Machine Learning Models
We applied ML models for predicting two outcome measures of cycloplegic refractive error in each eye including (1) cycloplegic spherical equivalent refraction (SER) calculated as sphere + 0.5 * cylinder; (2) myopia defined as cycloplegic SER ≤−0.5 D considering the negative sign (i.e., −0.5 D or worse). The predictors for the ML models were: demographics (age, gender), wearing refractive correction (yes/no), eye laterality (left eye, right eye), noncycloplegic SER, biometric measures (AL, CR, AL/CR ratio, central corneal thickness, and anterior chamber depth), intraocular pressure and UCVA (in logMAR). To improve the generalizability of ML model performances, two large independent datasets were used. The dataset from 1938 students in Jinyun was used for training models through cross-validation, and the dataset from 1476 students in Hangzhou was used for independently validating models (i.e., for testing models). 
We evaluated six ML models for predicting cycloplegic SER: (1) support-vector machine (SVM), (2) random forest, (3) extreme gradient boosting (XGBoost), (4) multilayer perceptron (MLP) neural network, (5) linear regression, and (6) lasso regression. For predicting myopia status, we evaluated the same ML models excluding linear regression and lasso regression because these models are not as applicable for direct classification analysis. We implemented these ML models because they are among the most commonly used and, with the exception of the linear and lasso models, are all appropriate for making predictions for both regression (e.g., for continuous outcome) and classification (e.g., for discrete outcome) tasks. We have previously applied these models for predicting the number of pro re nata anti-VEGF injections and vision responses to anti-VEGF treatment for neovascular age-related macular degeneration.26,27 
The SVM model determines a separation between different classes and predicts a continuous outcome with high generalization ability by transforming data to high-dimensional spaces.30 Random forest models make predictions by combining the results of a collection of tree predictors while avoiding overfitting even with the addition of more trees.31 The XGBoost model is similar to the random forest model in that it uses an ensemble of tree predictors, but uses gradient tree boosting and is trained in an additive manner.32 The MLP model is a supervised neural network that starts with an input layer consisting of the given predictors as a set of nodes, ultimately producing an output for classification or regression after multiple transformations of values from previous layers with a weighted linear summation and nonlinear activation function.33 We used the linear regression and lasso regression models only for the regression analysis, as they are relatively simple, with lasso regression serving as an interpretable model because it can reduce the number of non-zero coefficients.34 
We tuned the hyperparameters for each of the models through 10-fold cross-validation using the training dataset, which helps avoid overfitting as the ML models learn how to predict the desired outcomes. We tuned hyperparameters of the regression models for predicting cycloplegic SER by optimizing R2 (a measure for quantifying the amount of variation in the outcome explained by the predictors). For the classification models for predicting myopia status, we tuned the hyperparameters by optimizing the F1 score (the harmonic mean of recall and precision). After tuning the hyperparameters through 10-fold cross-validation, the ML model was fit on the entire training dataset using these hyperparameters. Once trained, the ML models were evaluated on the validation dataset. The primary measures for evaluating model performance in the training and validation datasets were R2 and mean absolute error (MAE) for predicting cycloplegic SER of an eye, and accuracy and the area under the receiver operating characteristic (ROC) curve (AUC) for predicting myopia status of an eye. To quantify the importance of each feature used in the predictions, we used the permutation importance, defined by the decrease in model's R2 for predicting cycloplegic SER and the decrease in the model's AUC for predicting myopia status after shuffling each feature.31 Feature importance was assessed in both the training and validation datasets. 
We used Python 3.9 and its open-source package scikit-learn version 1.1.3 to implement the ML models.35 The code for our ML analysis can be provided upon reasonable request to the authors. We selected the best-performing ML models (based on the model performance in the validation dataset) for further statistical analysis. Using the prediction output from the best-performing ML model for predicting cycloplegic SER and the best-performing ML model for predicting myopia status, we calculated (1) the difference between predicted and observed cycloplegic SER overall, by age group, and by level of cycloplegic SER; (2) the predicted and observed myopia prevalence rate overall and by age group; and (3) sensitivity and specificity and their 95% confidence intervals (95% CI) for predicting myopia status. In these analyses, predicted person-level myopia positive was defined as predicted cycloplegic SER −0.5 D or worse in either eye from the ML model for predicting cycloplegic SER, or myopia positive in either eye from the ML model for predicting myopia status. These statistical analyses were performed in SAS v9.4 (SAS Institute Inc, Cary, NC, USA). 
Results
The training dataset included 3876 eyes from 1938 students in Jinyun. The validation dataset included 2951 eyes from 1476 students in Hangzhou, after excluding 22 students with missing data. Demographic and ocular characteristics of students in the training and validation datasets are summarized in Table 1. The age of the students ranged from five to 18 years, with a mean (SD) of 9.6 (3.6) years in the training dataset and 9.9 (3.6) years in the validation dataset. Females accounted for 51.5% in the training dataset and 49.6% in the validation dataset. Mean cycloplegic SER was −0.07 (2.11) D in the training dataset and −0.35 (2.26) D in the validation dataset. The percent of eyes with myopia was 29.6% in the training dataset and 36.6% in the validation dataset. The mean noncycloplegic SER was −1.1 (1.92) D in the training dataset and −1.2 (2.04) D in the validation dataset, with 51.1% and 53.7% of eyes having noncycloplegic SER ≤−0.5 D, respectively. UCVA of 20/200 or worse was found in 5.3% of eyes in the training dataset and 8.1% of eyes in the validation dataset, whereas 59.7% eyes in the training dataset and 36.4% eyes in the validation dataset had UCVA of 20/20 or better. Ocular biometric measurements were very similar between the training and validation datasets (respectively): AL 23.5 (1.3) mm and 23.6 (1.3) mm, CR 7.83 (0.25) mm and 7.84 (0.26) mm, AL/CR ratio 3.00 (0.14) and 3.01 (0.16), central corneal thickness 555 (31) µm and 549 (31) µm, anterior chamber depth 3.61 (0.32) mm and 3.56 (0.30) mm, intraocular pressure 17.2 (2.8) mm Hg and 17.6 (2.9) mm Hg. 
Table 1.
 
Characteristics of Study Participants in the Training and Validation Datasets
Table 1.
 
Characteristics of Study Participants in the Training and Validation Datasets
Machine Learning Models for Predicting Cycloplegic SER
We evaluated six ML models for predicting cycloplegic SER in each eye. In the training dataset, all six ML models predicted cycloplegic SER well (Table 2, online Supplementary Figure S1.1) with R2 ranging from 0.922 (lasso) to 0.994 (random forest), and MAE of 0.121 D (random forest) to 0.431 D (lasso). In the training dataset, the random forest model performed best (R2 = 0.994, MAE = 0.121 D) and XGBoost performed second best (R2 = 0.974, MAE = 0.256 D). In the validation dataset (Table 2, online Supplementary Figure S1.2), the XGBoost model performed best (R2 = 0.935, MAE = 0.393 D), SVM was second best (R2 = 0.934, MAE = 0.402 D), and the lasso model performed worst (R2 = 0.913, MAE = 0.480 D). The importance of features in each ML model in the training dataset and validation dataset is shown in online Supplementary Figures S2.1 and S2.2. Noncycloplegic SER, AL, AL/CR ratio and UCVA were consistently among the top four most important features for predicting cycloplegic SER in all six models except linear regression model where AL was replaced by CR as one of the top four most important features. 
Table 2.
 
Performance of Machine Learning Models for Predicting Cycloplegic Refractive Error in the Training and Validation Datasets
Table 2.
 
Performance of Machine Learning Models for Predicting Cycloplegic Refractive Error in the Training and Validation Datasets
Machine Learning Models for Predicting Myopia Status
We evaluated four ML models for predicting myopia status in each eye. Their performance results in the training and validation datasets are reported in Table 3. In the training dataset, the 4 ML models performed excellently for predicting myopia status with accuracy ranging from 0.965 (MLP) to 1.000 (XGBoost) and AUC ranging from 0.994 (SVM and MLP) to 1.000 (XGBoost) (Table 3, online Supplementary Figure S3.1). The XGBoost model performed best (accuracy and AUC both equal to 1), and the random forest model performed second best (accuracy of 0.980 and AUC of 0.998). In the validation dataset, the random forest model performed best with accuracy of 0.953 and AUC of 0.987, and the XGBoost model performed second best with an accuracy of 0.952 and with AUC of 0.984 (Table 3, online Supplementary Figure S3.2). The importance of features identified in each of these 4 ML models is shown in online Supplementary Figures S4.1 and S4.2. Noncycloplegic SER, AL, AL/CR ratio and UCVA were again consistently among the top four features for predicting myopia status. 
Table 3.
 
Performance of Machine Learning Models for Predicting Myopia Status in the Training and Validation Datasets
Table 3.
 
Performance of Machine Learning Models for Predicting Myopia Status in the Training and Validation Datasets
The Best-Performing Model for Predicting Cycloplegic SER by Age Group and Refractive Error Group
We further assessed model performance for XGBoost, the best-performing ML model for predicting cycloplegic SER, by evaluating its performance for subsets based on age and magnitude of cycloplegic SER. In the training dataset, the overall mean difference (SD) between predicted and observed cycloplegic SER was 0.00 (0.34) D (95% limits of agreement of −0.66 D to 0.66 D, Table 4). For all age group strata, the mean difference between predicted and observed cycloplegic SER was within 0.07 D and the mean absolute difference within 0.29 D for each age group. When evaluating differences between predicted and observed based on magnitude of cycloplegic SER, the model did best with moderate myopia and worst with high hyperopia, although all mean absolute differences were within 0.38 D. Similar trends were seen in the validation dataset (Table 4). 
Table 4.
 
Difference and Absolute Difference Between Predicted and Observed Cycloplegic Refractive Error by Age for the Best Performing Machine Learning Model XGBoost in the Training Dataset and Validation Dataset
Table 4.
 
Difference and Absolute Difference Between Predicted and Observed Cycloplegic Refractive Error by Age for the Best Performing Machine Learning Model XGBoost in the Training Dataset and Validation Dataset
The Best-Performing Models for Predicting Myopia Prevalence Rate by Age Group
We further evaluated the best-performing models (XGBoost, random forest) for predicting the myopia prevalence rate overall and by age group. The XGBoost model was selected because it performed best for predicting cycloplegic SER, and the predicted cycloplegic SER can be used to determine myopia status of a student. The random forest was selected because it performed best for predicting myopia status. 
Using the XGBoost predicted cycloplegic SER ≤−0.5 D in either eye to define myopia, the predicted overall myopia prevalence rate was 32.1% which was similar to the observed myopia prevalence rate of 33.6% in the training dataset (Table 5). The predicted myopia prevalence rate was similar to the observed rate in each age group, with differences ≤6% (Fig.). Random forest model provided even better prediction than XGBoost with predicted myopia prevalence rate closer to the observed myopia prevalence rate overall (32.8% predicted vs. 33.6% observed), and in each age group (differences in myopia prevalence rate all ≤3%, Fig.). Similar trends were seen in the validation dataset (Table 5Fig.). 
Table 5.
 
Predicted Myopia Prevalence Rate by Age Based on the Best-Performing Model for Cycloplegic SER and the Best-Performing Model for Predicting Myopia Status in Training and Validation Datasets
Table 5.
 
Predicted Myopia Prevalence Rate by Age Based on the Best-Performing Model for Cycloplegic SER and the Best-Performing Model for Predicting Myopia Status in Training and Validation Datasets
Figure.
 
The observed and predicted prevalence rate of myopia from best performing machine learning models in each age group in the training dataset (A), and in the validation dataset (B).
Figure.
 
The observed and predicted prevalence rate of myopia from best performing machine learning models in each age group in the training dataset (A), and in the validation dataset (B).
Sensitivity and Specificity of the Best-Performing Models for Predicting Myopia Status
The XGBoost had sensitivity of 94.2% (95% CI, 92.1%–95.8%) and specificity of 99.3% (95% CI, 98.8%–99.8%) in the training dataset, and similarly high in the validation dataset: sensitivity 91.1% (95% CI, 88.5%–93.2%), specificity 97.2% (95% CI, 95.9%–98.2%; Table 6). When the performance of the XGBoost was evaluated for predicting the severity of myopia, the XGBoost had a sensitivity of 95.0% (95% CI, 83.1%–99.4%) for high myopia (≤−6.0 D), 93.3% (95% CI, 91.3%–95.3%) for mild to moderate myopia (>−6.0, ≤−0.5 D), and a specificity of 99.3% (95% CI, 98.8%-99.8%) in the training dataset. In the validation dataset, the XGBoost had a sensitivity of 89.6% (95% CI, 77.3%–99.5%) for high myopia, 90.0% (87.2%–92.3%) for mild to moderate hyperopia, and a specificity of 97.2% (95% CI, 95.9%–98.2%) in the validation dataset. The random forest model yielded sensitivity of 95.7% (95% CI, 93.9%–97.1%) and specificity of 99.1% (95% CI, 98.5%–99.6%) in the training dataset and similarly high in the validation dataset: sensitivity 92.2% (95% CI, 89.9%–94.2%), specificity 96.9% (95% CI, 95.5%–98.0%; Table 6). When stratified by age group, the sensitivity and specificity remained high in both training and validation datasets (online Supplementary Table S7). 
Table 6.
 
Sensitivity and Specificity for Predicting Myopia*
Table 6.
 
Sensitivity and Specificity for Predicting Myopia*
Discussion
Using datasets from a large school-based myopia study, we developed and validated several ML models for predicting cycloplegic SER and myopia status based on the noncycloplegic clinical data. When the cycloplegic refractive error cannot be obtained, ML models have the potential to be used to estimate cycloplegic refractive error of an individual child as well as to determine the myopia prevalence rate in a specific population. In both the training and validation datasets, we found that ML models performed very well in predicting cycloplegic SER and myopia status. The XGBoost model performed best for predicting cycloplegic SER and the random forest model performed best or predicting myopia status. With high sensitivity (>91%) and specificity (>96%), these best-performing models accurately predicted myopia prevalence rate in all age groups. 
Using the same datasets as this study, we previously developed and validated a traditional regression-based prediction model for predicting cycloplegic SER.29 The model had a sensitivity of 87% and specificity of 98% in the training dataset and sensitivity of 84% and specificity of 98% in the validation dataset for detecting myopia. The combination of predicted cycloplegic SER from the traditional regression-based prediction model and UCVA (i.e., defining myopia positive as SER ≤−0.5 D or UCVA 20/40 or worse in either eye) increased sensitivity and specificity, though the ML models discussed here outperformed our previous regression-based prediction models. This improved prediction performance may be due to the power of ML models to accommodate complex nonlinear relationships between predictors and refractive error outcomes. These findings support that ML models can be useful in epidemiological studies of myopia in children, particularly when administering cycloplegic agents is not feasible. 
For our ML models, we considered ocular biometric measures that can be reliably measured without cycloplegia, demographics, and other easily obtainable measures such as UCVA. Based on the feature importance analysis, we found that the noncycloplegic SER, AL, AL/CR ratio and UCVA were consistently among the top four important features for predicting cycloplegic SER. These measures are well known to be associated with cycloplegic refractive error and have been used for predicting cycloplegic refractive error in prior studies.12,19,20,28,29 
Several studies have attempted to detect myopia in children without the application of cycloplegic eyedrops using statistical prediction models.1522 These statistical models explored the prediction of cycloplegic refractive error using various predictors including ocular biometric measures obtained under noncycloplegic conditions,19 noncycloplegic refractive error and UCVA,21 a combination of ocular biometric measures, noncycloplegic refractive error and UCVA.29 Given the variability in study design, previous studies yielded mixed results, with R2 ranging from 0.26 to 0.93. In particular, Sankaridurg et al.21 reported a regression-based prediction model that used age, UCVA, and noncycloplegic refractive error for predicting cycloplegic refractive error in a total of 6825 Chinese children aged four to 15 years. The model yielded R2 of 0.91, a sensitivity of 89.3% and a specificity of 97.6% for predicting myopia. The regression model developed specifically for children with UCVA worse than 6/6 yielded R2 of 0.93 with a sensitivity of 89.5% and a specificity of 97.4%.21 In comparison to previous studies using regression-based predictions, our study is the first to apply modern ML models that considered comprehensive predictors which are all readily obtainable without cycloplegia. Our ML models yielded better performance than previous models. Because our ML models were developed in a large sample and independently validated in another large sample of children of many ages (5 to 18 years old) with varied refractive error status (from −14.1 to 8.4 D), our model has great potential to be applicable to population-based myopia research, when measuring cycloplegic refractive error in all children is not feasible. 
In this study, we developed ML models for predicting both cycloplegic refractive error (a continuous variable) and myopia status (a binary variable). Predicting cycloplegic refractive error can be useful both for detecting myopia and quantifying myopia severity, because the predicted cycloplegic SER (a continuous measure) can be later used to define the presence of myopia (e.g., predicted cycloplegic SER ≤−0.5 D as myopia) and to determine the severity of myopia based on the magnitude of the predicted cycloplegic SER. Among the 6 ML models for predicting cycloplegic SER, we found the XGBoost performed best when considering its performance in the validation dataset. This model not only performed well for predicting cycloplegic SER (R2 = 0.974, MAE = 0.256 D in training; R2 = 0.935, MAE = 0.393 D in validation), it also yielded high sensitivity (94% in training, 91% in validation) and specificity (99% in training, 97% in validation) for detecting myopia. 
Estimating the prevalence rate of myopia is the primary interest of many large epidemiological studies where defining myopia using cycloplegic refractive error may not be feasible. To address this, we evaluated 4 ML models for directly predicting myopia status and found that the random forest model performed best. The random forest model provided higher sensitivity (with similar specificity) and more accurate prediction of myopia prevalence rate than using the XGBoost predicted cycloplegic SER. The myopia predicted prevalence rates from both XGBoost and random forest were very close to the observed myopia prevalence rate, supporting their potential use in the future epidemiological studies for estimating myopia rate. Because refractive error changes with age, we evaluated the performance of ML models for each age group and found that ML models performed consistently well across all groups five to 18 years, with small differences between predicted and observed myopia rate. These robust results support the use of ML models for predicting refractive error or myopia status in children of various ages. 
The generalizability of our study may be slightly limited because of its design of using 0.5% tropicamide for cycloplegia, a NIDEK autorefractor, and an all-Chinese participant population. Thus the findings from this study may not be directly generalizable to other cycloplegic agents, types of autorefractors, or other races/ethnicities. These ML prediction models would likely require additional training before being employed in other settings. In this cross-sectional study, we were only able to evaluate the ML models for predicting the cycloplegic refractive error at a single time point. Future longitudinal studies are needed to evaluate how these ML models perform for monitoring the progression of refractive error over time or other cycloplegic medications. 
In conclusion, we developed and validated several ML models for predicting cycloplegic refractive error and myopia status using readily available demographics and noncycloplegic measurements. These ML models (particularly the XGBoost and random forest) may help address epidemiological concerns about study attrition due to refusal of participation with cycloplegic eyedrops. The application of our ML models may provide more accurate estimates of myopia prevalence rate, severity of myopia and better determination of risk factors of myopia than noncycloplegic refractive error. 
Acknowledgments
Supported by Zhejiang Province Key Research Grant (grant number: 2021C03032) and NEI K23EY034893. 
Disclosure: B. Ying, None; R.S. Chandra, None; J. Wang, None; H. Cui, None; J.T. Oatts, None 
References
Sankaridurg P, Tahhan N, Kandel H, et al. IMI impact of myopia. Invest Ophthalmol Vis Sci. 2021; 62(5): 2. [CrossRef] [PubMed]
Morgan IG, Wu PC, Ostrin LA, et al. IMI risk factors for myopia. Invest Ophthalmol Vis Sci. 2021; 62(5): 3. [CrossRef] [PubMed]
Cho BJ, Shin JY, Yu HG. Complications of pathologic myopia. Eye Contact Lens. 2016; 42(1): 9–15. [CrossRef] [PubMed]
Flitcroft DI, He M, Jonas JB, et al. IMI—defining and classifying myopia: a proposed set of standards for clinical and epidemiologic studies. Invest Ophthalmol Vis Sci. 2019; 60(3): M20–M30. [CrossRef] [PubMed]
Koh V, Tan C, Nah G, et al. Correlation of structural and electrophysiological changes in the retina of young high myopes. Ophthalmic Physiol Opt. 2014; 34: 658–666. [CrossRef] [PubMed]
Wu HM, Seet B, Yap EP, Saw SM, Lim TH, Chia KS. Does education explain ethnic differences in myopia prevalence? A population-based study of young adult males in Singapore. Optom Vis Sci. 2001; 78: 234–239. [CrossRef] [PubMed]
Yotsukura E, Torii H, Inokuchi M, et al. Current prevalence of myopia and association of myopia with environmental factors among schoolchildren in Japan. JAMA Ophthalmol. 2019; 137: 1233–1239. [CrossRef] [PubMed]
Yang Y, Liao H, Zhao L, et al. Green space morphology and school myopia in China. JAMA Ophthalmol. 2024; 142: 115–122. [CrossRef] [PubMed]
Zhao J, Mao J, Luo R, Li F, Pokharel GP, Ellwein LB. Accuracy of noncycloplegic autorefraction in school-age children in China. Optom Vis Sci. 2004; 81: 49–55. [CrossRef] [PubMed]
Lin LLK, Shih YF, Hsiao CH, Su TC, Chen CJ, Hung PT. The cycloplegic effects of cyclopentolate and tropicamide on myopic children. J Ocul Pharmacol Th. 1998; 14: 331–335. [CrossRef]
Hu YY, Wu JF, Lu TL, et al. Effect of cycloplegia on the refractive status of children: the Shandong children eye study. PLoS One. 2015; 10(2): e0117482. [CrossRef] [PubMed]
Gu F, Gao HM, Zheng X, et al. Effect of cycloplegia on refractive error measure in Chinese school students. Ophthal Epidemiol. 2021.
Fotedar R, Rochtchina E, Morgan I, Wang JJ, Mitchell P, Rose KA. Necessity of cycloplegia for assessing refractive error in 12-year-old children: a population-based study. Am J Ophthalmol. 2007; 144: 307–309. [CrossRef] [PubMed]
Choong YF, Chen AH, Goh PP. A comparison of autorefraction and subjective refraction with and without cycloplegia in primary school children. Am J Ophthalmol. 2006; 142: 68–74. [CrossRef] [PubMed]
Foo VH, Verkicharla PK, Ikram MK, et al. Axial length/corneal radius of curvature ratio and myopia in 3-year-old children. Transl Vis Sci Technol. 2016; 5(1): 5. [CrossRef] [PubMed]
He X, Zou H, Lu L, et al. Axial length/corneal radius ratio: association with refractive state and role on myopia detection combined with visual acuity in Chinese schoolchildren. PLoS One. 2015; 10(2): e0111766. [CrossRef] [PubMed]
Ip JM, Huynh SC, Kifley A, et al. Variation of the contribution from axial length and other oculometric parameters to refraction by age and ethnicity. Invest Ophthalmol Vis Sci. 2007; 48(10): 4846–4853. [CrossRef] [PubMed]
Kimura S, Hasebe S, Miyata M, Hamasaki I, Ohtsuki H. Axial length measurement using partial coherence interferometry in myopic children: repeatability of the measurement and comparison with refractive components. Jpn J Ophthalmol. 2007; 51(2): 105–110. [CrossRef] [PubMed]
Magome K, Morishige N, Ueno A, Matsui TA, Uchio E. Prediction of cycloplegic refraction for noninvasive screening of children for refractive error. PLoS One. 2021; 16(3): e0248494. [CrossRef] [PubMed]
Ojaimi E, Rose KA, Morgan IG, et al. Distribution of ocular biometric parameters and refraction in a population-based study of Australian children. Invest Ophthalmol Vis Sci. 2005; 46(8): 2748–2754. [CrossRef] [PubMed]
Sankaridurg P, He X, Naduvilath T, et al. Comparison of noncycloplegic and cycloplegic autorefraction in categorizing refractive error data in children. Acta Ophthalmol. 2017; 95(7): e633–e640. [CrossRef] [PubMed]
Wang JY, Wang XY, Gao HM, et al. Prediction for cycloplegic refractive error in Chinese school students: model development and validation. Transl Vis Sci Techn. 2022; 11(1).
Wu JH, Liu TYA, Hsu WT, Ho JH, Lee CC. Performance and limitation of machine learning algorithms for diabetic retinopathy screening: meta-analysis. J Med Internet Res. 2021; 23(7): e23863. [CrossRef] [PubMed]
Cao K, Verspoor K, Sahebjada S, Baird PN. Accuracy of machine learning assisted detection of keratoconus: a systematic review and meta-analysis. J Clin Med. 2022; 11(3): 478. [CrossRef] [PubMed]
Wu JH, Nishida T, Weinreb RN, Lin JW. Performances of machine learning in detecting glaucoma using fundus and retinal optical coherence tomography images: a meta-analysis. Am J Ophthalmol. 2022; 237: 1–12. [CrossRef] [PubMed]
Chandra RS, Ying GS. Evaluation of multiple machine learning models for predicting number of anti-VEGF injections in the comparison of AMD treatment trials (CATT). Transl Vis Sci Technol. 2023; 12(1): 18. [CrossRef] [PubMed]
Chandra RS, Ying GS. Predicting visual acuity responses to anti-VEGF treatment in the comparison of AMD treatment trials (CATT) using machine learning. Ophthalmol Retina. 2023.
Zhao E, Wang X, Zhang H, et al. Ocular biometrics and uncorrected visual acuity for detecting myopia in Chinese school students. Sci Rep. 2022; 12(1): 18644. [CrossRef] [PubMed]
Wang J, Wang X, Gao HM, et al. Prediction for cycloplegic refractive error in Chinese school students: model development and validation. Transl Vis Sci Technol. 2022; 11(1): 15. [CrossRef]
Cortes C, Vapnik V. Support-vector networks. Machine Learning. 1995; 20: 273–297.
Breiman L . Random forest. Mach Learn. 2001; 45: 5–32. [CrossRef]
Chen T, Guestrin C. XGBoost: a scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016: 785–794.
Abu-Mostafa Y, Magdon-Ismail M, Lin H-T. Neural Networks. Nairobi, Kenya: AML Publishers; 2012.
Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc B. 1996; 1(58): 267–288.
Pedregosa F, Varoquaux G, Gramfort A, et al. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011; 12: 2825–2830.
Figure.
 
The observed and predicted prevalence rate of myopia from best performing machine learning models in each age group in the training dataset (A), and in the validation dataset (B).
Figure.
 
The observed and predicted prevalence rate of myopia from best performing machine learning models in each age group in the training dataset (A), and in the validation dataset (B).
Table 1.
 
Characteristics of Study Participants in the Training and Validation Datasets
Table 1.
 
Characteristics of Study Participants in the Training and Validation Datasets
Table 2.
 
Performance of Machine Learning Models for Predicting Cycloplegic Refractive Error in the Training and Validation Datasets
Table 2.
 
Performance of Machine Learning Models for Predicting Cycloplegic Refractive Error in the Training and Validation Datasets
Table 3.
 
Performance of Machine Learning Models for Predicting Myopia Status in the Training and Validation Datasets
Table 3.
 
Performance of Machine Learning Models for Predicting Myopia Status in the Training and Validation Datasets
Table 4.
 
Difference and Absolute Difference Between Predicted and Observed Cycloplegic Refractive Error by Age for the Best Performing Machine Learning Model XGBoost in the Training Dataset and Validation Dataset
Table 4.
 
Difference and Absolute Difference Between Predicted and Observed Cycloplegic Refractive Error by Age for the Best Performing Machine Learning Model XGBoost in the Training Dataset and Validation Dataset
Table 5.
 
Predicted Myopia Prevalence Rate by Age Based on the Best-Performing Model for Cycloplegic SER and the Best-Performing Model for Predicting Myopia Status in Training and Validation Datasets
Table 5.
 
Predicted Myopia Prevalence Rate by Age Based on the Best-Performing Model for Cycloplegic SER and the Best-Performing Model for Predicting Myopia Status in Training and Validation Datasets
Table 6.
 
Sensitivity and Specificity for Predicting Myopia*
Table 6.
 
Sensitivity and Specificity for Predicting Myopia*
×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×