In recent years, deep learning technology has substantial development and applications in the field of medical imaging. Deep learning
8 uses multilayer neural networks including convolutional layers to gain high accuracy and powerful learning performance in image classification.
9 Many neural networks were developed to assess the histopathological images of breast cancer,
10 malignant mesothelioma,
11 and coronary artery fibrous plaque detection.
12 For OCT images, deep learning models have also been studied, especially in the field of macular vision function. For example, Lee et al.
13 combined OCT images and electronic medical records to predict the occurrence of macular lesions in OCT images. Hwang et al.
14 constructed a deep learning model to predict outcomes and suggest further treatments for age-related macular degeneration (AMD), a specific type of disease of the macula. In addition to image classification, deep learning models were also developed for contour detection and layer segmentation of OCT images. For example, Orlando et al.
15 used a segmentation model to identify photoreceptor alteration in macular diseases. Recently, researchers developed models that can both classify the diseases and highlight the lesion areas on the images. For example, Fang et al.
16 designed the Lesion-Aware network, a neural network with attention modules to classify and highlight macular lesions. Although recent developments turned the spotlight on the automatic diagnosis of various eye conditions with improved performance, the majority of studies only focused on disease classification without highlighting the lesion regions from OCT images, which may not provide sufficient information for diagnosis. In contrast, the interpretability of deep learning model is also important for ophthalmologists to exam and affirm the prediction results. Although some studies used class activation map (CAM)-based technology
17 to visualize the model attention, this method does not classify or label different lesion types if they coexist on the same image. To help with clinical diagnosis, a better way to interpret the prediction results via highlighting the lesion regions along with type classification is preferred.