Meibomian gland dysfunction (MGD) is the most common underlying cause of dry eye syndrome where Meibomian glands (MGs) do not secrete enough lipids into the tears. The transillumination and infrared light are used to appreciate MG characteristics (i.e. measuring the percent of MG atrophy defined as the ratio of MG loss area to the total tarsal plate area) for MGD diagnosis.
1,2 Standardized MG atrophy grading scales have been developed to assess the severity of MG atrophy.
3,4
In recent years, artificial intelligence (AI) in computer vision has arisen with deep convolutional neural networks (CNNs), which learned predicted features via supervised learning on a large dataset of labeled images.
5,6 AI has shown huge progress in the field of medicine, including cancer diagnosis, lung segmentation, and tumor detection,
7–9 especially in the ophthalmic domain. For example, AI has been applied to build models to detect subclinical Keratoconus,
10,11 which is the leading cause of corneal transplantation. Different AI systems were developed to detect the cases of glaucoma and have achieved promising performance.
12,13 AI has also benefited the MG atrophy evaluation from meibography images and have shown significantly improved performance.
14 However, it is costly, or sometimes even impossible for training CNNs on large labeled data sets because most of them have imbalanced label classes (i.e. one class accounts for almost 90% of the data, whereas other classes have far fewer samples). Additionally, vision data sets may contain labeling errors, leading to training issues for CNN models, especially for the class with a few samples.
Unsupervised representation learning aims to learn a robust embedding space from data without human annotation. Recently, discriminative approaches especially contrastive learning-based approaches, such as (nonparametric instance discrimination [NPID],
15 MoCo,
16 SimCLR,
17 etc) have gained most ground and achieved the state-of-the-art on standard large-scale image classification benchmarks with increasingly more computation and data augmentations. Based on our experience from extensive experimentation (cross-level discrimination [CLD]
18), NPID remains competitive, especially on small data sets.
Furthermore, some unsupervised methods could be extended to the semisupervised learning (i.e. LLP,
19 and CPC version 2),
20 by first learning in an unsupervised way and then fine-tuning with few labeled data. Note that more details are provided in the Discussion section.
In this paper, NPID
15 was applied for image analysis of MG from meibography to investigate MG features based on visual phenotypes. Furthermore, the visualization and hierarchical clustering algorithms were applied to show the feature clustering of meibography images. Whereas completely ignoring class labels, this unsupervised network discriminates between individual instances (e.g. meibography images) and automatically learns the similarity between instances, as shown in
Figure 1. This approach automatically measures MG atrophy severity from meibography images, as well as discovers subtle relationships between meibography images according to visual similarity. Additionally, an extensive experimental design was implemented to assess performance of evaluating MG atrophy by comparing the results obtained by the unsupervised learning method with those from a team of clinicians as well as a supervised learning method.