Abstract
Purpose:
We developed a method to automatically locate and quantify graft detachment after Descemet's membrane endothelial keratoplasty (DMEK) in anterior segment optical coherence tomography (AS-OCT) scans.
Methods:
A total of 1280 AS-OCT B-scans were annotated by a DMEK expert. Using the annotations, a deep learning pipeline was developed to localize scleral spur, center the AS-OCT B-scans and segment the detached graft sections. Detachment segmentation model performance was evaluated per B-scan by comparing (1) length of detachment and (2) horizontal projection of the detached sections with the expert annotations. Horizontal projections were used to construct graft detachment maps. All final evaluations were done on a test set that was set apart during training of the models. A second DMEK expert annotated the test set to determine interrater performance.
Results:
Mean scleral spur localization error was 0.155 mm, whereas the interrater difference was 0.090 mm. The estimated graft detachment lengths were in 69% of the cases within a 10-pixel (∼150 µm) difference from the ground truth (77% for the second DMEK expert). Dice scores for the horizontal projections of all B-scans with detachments were 0.896 and 0.880 for our model and the second DMEK expert, respectively.
Conclusions:
Our deep learning model can be used to automatically and instantly localize graft detachment in AS-OCT B-scans. Horizontal detachment projections can be determined with the same accuracy as a human DMEK expert, allowing for the construction of accurate graft detachment maps.
Translational Relevance:
Automated localization and quantification of graft detachment can support DMEK research and standardize clinical decision-making.
Our results demonstrate that a deep convolutional neural network can accurately and automatically identify DMEK graft detachment. We believe that our deep learning pipeline has the potential to improve and standardize clinical decision making and can similarly be used as an objective and operator-independent outcome to improve DMEK research and reporting.
The number of DMEK procedures performed is rising rapidly driven by the superior visual results.
25,26 In 2018, 41.2% more DMEKs were performed in the United States, while the total number of endothelial keratoplasty procedures increased only 4.6% in comparison with the year before.
27 This invariably increases the need for DMEK detachment management, such as the decision to await spontaneous clearance, rebubble, or perform repeat DMEK.
28–30 Studies agree that management depends on the degree of detachment yet report diverging opinions for when to rebubble and varying definitions of partial detachment, including
visually significant graft detachment,
31 20% detached area,
32 more or less than one-third detached.
28 In current practice, the amount of detachment after DMEK is estimated by a clinician/surgeon over a succession of scans on a screen, rather than measured objectively. Thus the surgeon has to make a decision with regard to treatment without seeing all detached areas in a single image or accurately being able to quantify the total amount of detachment.
The high Dice scores for the projection results are similar to a human DMEK expert and indicate that accurate detachment maps can be constructed. Visual evaluation of some examples of these maps (
Fig. 6) indeed shows a strong similarity with the expert annotations. Since the center of the detachment map corresponds with the center of the cornea, the severity of the detached sections can be evaluated with their respective distance to the center. Moreover, follow-up OCT scans can be overlaid to assess detachment progression.
The Bland-Altman plot in
Figure 3 also indicates that the segmentation model works well for most individual B-scans. Examples 1 to 3 in
Figure 4 represent the results for the majority of the segmentations and show high segmentation accuracy, even when a graft is torn (Example 2). For some cases, the predicted detachment length differed substantially from the expert annotations (
Fig. 5). Part of the disagreement could originate in the inherent uncertainty of some graft sections that are difficult to annotate. Indeed, the DMEK experts do not always agree, but the 95% confidence interval for the interrater study is roughly half the size of the model prediction confidence interval. Moreover, we also found that the model makes a few substantial mistakes that are obvious to the human observer (e.g., Example 6). After visual inspection of the outliers, we noticed that most of the sizeable underestimations were B-slices of one specific OCT scan with a lot of detachment in the center. These errors are likely due to the lack of training examples with a large central detachment. The model might confuse some large center detachments for intraocular gas, which is only present in scans directly after surgery or rebubbling. Although the effect of these type of mistakes might be limited since they are obvious and will easily be spotted by the ophthalmologist, it could be addressed by adding more training data encompassing more variations, especially cases with center detachments. Furthermore, the current segmentation model did not take into account information from neighboring B-slices, as the DMEK expert did. Finally, some inaccuracy might result from the loss of information due to down-sampling the B-slices by a factor two before processing the scans with the graft segmentation model. Given the horizontal line-like structure of the graft and the down-sample factor, the horizontally most distant pixels could be misclassified. However, this error will be small compared to the whole length of the graft detachment.
Apart from missegmenting some cases with a lot of detachment in the center, it was sometimes challenging to distinguish remnant host Descemet's membrane from the DMEK graft. Furthermore, the presented models were trained and evaluated on a single data set collected with one type of AS-OCT device. For generalization toward multiple-sources, the models have to be retrained either with some images from other scanner types or with the use of other domain generalization techniques.
33
Prior image analysis work within the realm of DMEK detachment has only focused on binary classification; i.e. whether detachment is present or not
34 and whether rebubbling was performed.
35 We believe our detachment model is of clinical value as it provides quantitative measures about length and location of graft detachment. The segmentation accurately locates detachment in most AS-OCT B-scans and is much faster than a human rater. In clinical practice an ophthalmologist would not have time to annotate the detachment regions in detail, whereas our deep learning pipeline could provide an instant evaluation aiding the decision.
Although our aim was to develop a model for quantifying DMEK detachment, we also developed a scleral spur locating model as an intermediate step. Having this scleral spur localization model aided our AS-OCT B-scan preprocessing by cropping all images uniformly prior to the DMEK detachment model evaluation. This cropping step also provided practical benefits, as we did not have to reduce the standard U-net model size or the resolution of the B-scans further to fit within Graphics Processing Unit (GPU) memory. However, locating the scleral spur is valuable in and of itself. Potential applications include determining limbal chamber depth parameters such as angle-opening distance and trabecular-iris space area, relevant in glaucoma. Furthermore, it may also be a valuable tool for aligning AS-OCT scans between patient visits (e.g., to compare pachymetry map changes).
The refinement of the scleral spur estimates by fitting an ellipse resulted in a slightly bigger localization error. However, we could only evaluate for scleral spur points that were well discernable, since those were annotated by both experts. Our model also outputs an estimate for the scleral spur when the region itself is not visible (e.g., hidden behind the eyelid).
13 We believe that the ellipse fit step makes the localization more robust for these cases and reduces outliers. Whether our scleral spur model can be applied to other disease entities, such as acute angle-closure glaucoma, is a topic of future research.
In summary, we have introduced a deep learning pipeline based on AS-OCT that allows automatic and accurate quantification of graft detachment after DMEK. Our future research efforts will focus on evaluating the value of our algorithm for improving clinical decision making and clinical outcomes after DMEK.
This research is financially supported by the Dutch Research Council (NWO) TTW Perspectief program and Philips Research.
Disclosure: F.G. Heslinga, None; M. Alberti, None; J.P.W. Pluim, None; J. Cabrerizo, None; M. Veta, None