The direct comparison of the trained segmentation models to other state-of-the-art models was not in the focus of this work, since it could be shown in
Relation to Clinical Grading that the resulting model Acc yielded no significant influence for the redness extraction when compared to the ground truth. Still, a comparison with a state-of-the-art U-Net segmentation model, which was trained on the small dataset and evaluated on the test dataset, was made as described in
Appendix D. The U-Net performed very similarly for the segmentation task, resulting in IoUs of 0.9636 and 0.9793, accuracies of 0.9903 and 0.9899, and F1-scores of 0.9814 and 0.9889 for the iris and ocular surface, respectively. In addition, the performance of models from related works was investigated for comparison and listed in the following. Here, it showed that owing to the popularity increase of eye tracking applications in the last few years, the number of works dealing with the segmentation of the iris increased.
37–39 Also, approaches aiming at segmenting the ocular surface or sclera were found in literature.
19,40–42 For example in the related work of Sardar et al.,
38 a mean TP rate of 0.983 with a mean error rate of 0.261 could be achieved for iris segmentation from publicly available iris datasets by using an interactive deep learning approach. A residual encoder and decoder network was developed in the work of Naqvi et al.,
41 which achieved an equal error rate and mean F1-score of 0.009 and 96.242 for segmenting the sclera. A mobile monitoring application for ocular redness was implemented by Li et al.,
29 where an Acc of 0.992, an IoU of 0.977 and an F1-score of 0.982 for the segmentation of the sclera using a U-Net architecture could be achieved. These corresponding works show similar performance scores for different imaging modalities like a different image resolution ranging from 400 × 300 pixels to 3,000 × 1,700 pixels, different gaze directions and distances to the eye. In addition, the number of images used for training in these works exceeded one thousand going up to several thousand images, while the subjects are either healthy or show ocular surface pathologies like conjunctivitis and subconjunctival hemorrhage.