We developed a robust predictive AI model for ICL surgery using local surgical data and an open-source code-development tool. Learning coding syntax has been a barrier for many medical researchers in developing machine learning models using surgical data from their clinics. Many cloud platform–based services are limited by an unintuitive interface design that requires a fee. Orange has overcome these shortcomings and provides researchers with an easy-to-learn and free interface. It helps researchers without coding experience build fast and clear machine learning models and provide validation results. To our knowledge, this is the first study to develop and externally validate a prediction model for the postoperative vault in ICL implantation using Pentacam equipment.
In our study, we confirmed that the algorithms developed using machine learning performed better than linear regression analysis–based methods. Existing widely used methods, such as the CASIA2 KS and NK formulas,
18,19 are based on linear regression models. In individual studies,
2,20 machine learning showed higher performance than these models, but this may be the result of overfitting to individual institution data.
7 By developing machine learning models for each individual institution using no-code tools, it is possible to develop an optimal lens sizing algorithm for each individual institution without a significant development burden.
To achieve accurate and safe ICL implantation, it is important to develop a nomogram based on operator-specific clinical data because the ICL size nomogram provided by the manufacturer is inaccurate (especially for East Asians),
2 and there is no standardized sizing method.
7 Because there is significant bias in measurement data between devices,
21,22 a nomogram must be developed individually for each device to ensure accurate surgery. Additionally, because measurements of the crystalline lens, iris, and anterior chamber angle are significantly affected by lighting,
23 the nomogram can vary depending on the measurement protocol used at each center. Therefore, although many nomograms for ICL sizing have been developed, limitations in their performance are well known.
24 As the number of variables used as input to the nomogram model increases, the accuracy of the development data improves, but the robustness of the model decreases and accuracy in external sets cannot be guaranteed.
25 If a customized nomogram is developed based on machine learning tailored to the biometry measurement environment and racial characteristics of the patients for each center, a more accurate surgery will be possible. The customized machine learning model can further increase the accuracy of phakic IOL surgery by providing vault values for each IOL size to surgeons at each institution (
Supplementary Fig. S1).
The Pentacam is a non-invasive Scheimpflug imaging system that is commonly used in ophthalmology clinics to measure the anterior segment of the eye.
26 It can measure the corneal topography, iris, anterior chamber angle, and front part of the crystalline lens. AS-OCT and the Pentacam have been reported to have comparable performance in measuring the anterior segment.
27 However, they have so far been neglected in developing models for ICL surgery. If our machine learning model, which is based on Pentacam measurements, can be applied in clinics, it will be possible to accurately perform ICL implantation without expensive AS-OCT equipment.
The postoperative vault prediction model developed in this study showed good performance compared with previously published studies. Our random forest model achieved MAEs of 124.8 µm and 152.4 µm in the internal and external validations, respectively, using Pentacam HR data without any coding experience. Previous studies that analyzed AS-OCT data using machine learning algorithms have also shown similar results, with MAE values between 100 and 150 µm in internal validation and decreased performance in external validation.
6,28 In these studies, the random forest or similar tree-based algorithms were the best performing algorithms and were developed based on the Scikit-learn library running on Python. In a previous study using the Pentacam, an MAE of 149.0 was reported for internal validation using an extra tree algorithm developed using the PyCaret library running on Python with a large GPU workstation. In terms of development efficiency, Orange software surpasses the previously used console-based machine learning development methods. Similar to Scikit-learn, the final trained model from the Orange software can be exported and used in the application.
In our study, there was little difference in performance between the Google code-free model and the machine learning model developed using the Orange software. Furthermore, the model developed using the Orange software was superior in the external validation set. Orange provides visualization tools for data flow and model development, allowing clinicians without development experience to easily train and use machine learning–based ICL sizing models. Compared with Google Cloud's no-code machine learning tool AutoML (currently renamed Vertex AI), Orange software offers many advantages. In Orange, researchers can experiment by tuning various models and data explorations using flow-based tools.
14 Because AutoML performs many optimizations internally, it is difficult to check the hyperparameters of the resulting model. Additionally, AutoML operates in a cloud environment, which makes it less accessible to researchers.
This study had several limitations. First, model development and validation were conducted through retrospective data collection. Therefore, the developed model should be tested prospectively to confirm its clinical applicability. Second, this study was trained on East Asian population data, so the applicability of the model is limited to other regional races.
29 Particularly, in the case of phakic IOL implantation, the IOL size distribution between Asians and Westerners has been reported to be significantly different, and biometric information by race is also significantly different. Third, the amount of data used in this study was relatively small for the development of the machine learning models. Because the number of input variables in the analysis data is large, small amounts of data can cause overfitting, and more data are required to create a more generalized regression model. High-performance machine learning models can be developed by collecting additional data from ICL surgeries. Fourth, the model was developed based on Pentacam data and was not compared with data obtained from the KS and NK formulas. Because the KS and NK formulas are based on the CASIA2 or UBM,
18,30 objective comparison was not possible with the current study design. The institution that performed the external validation of this study did not perform preoperative AS-OCT; therefore, it was difficult to confirm the superiority of the developed machine learning algorithm over existing methods.