With the development and progression of glaucoma, the optic nerve head (ONH) and the macula typically exhibits complex neural- and connective-tissue structural changes including, but not limited to, thinning of the retinal nerve fiber layer (RNFL) and the macular ganglion cell complex layer
1,2; changes in the lamina cribrosa (LC) shape, depth, and curvature
3,4; and posterior bowing of the peripapillary sclera.
5,6 Clinically, optical coherence tomography (OCT) is the mainstay of imaging to observe such changes
7; however, signal interpretation (by humans or machines) for glaucoma diagnosis and prognosis remains a challenge.
Recently, a growing number of artificial intelligence (AI) studies have proposed to use deep learning algorithms to provide a robust glaucoma diagnosis from a single OCT scan of the ONH or of the macula. Such applications could have excellent clinical value, for example, by decreasing the number of tests needed to confirm a glaucoma diagnosis. Some of these algorithms were directly applied to the raw OCT scans,
8–10 whereas others first simplified the images to only a few classes (or colors) by highlighting relevant tissue structures.
11 Most algorithms achieved good to excellent performance. However, such algorithms need to be able to handle a considerable amount of information (e.g., voxel intensities distributed on three-dimensional [3D] grids) that could be heavily corrupted by noise, image artifacts, and 3D image orientation issues, thus limiting their ease of use and deployability.
To this end, a family of algorithms fitting under the category of geometric deep learning has been proposed to solve classification problems from structures represented as 3D point clouds (such as those in medical imaging),
12 with excellent performance. Geometric deep learning
13 is an emerging field of AI that proposes inductive biases and network architectures that can efficiently process data structures such as grids, graphs, and cloud of points while respecting their intrinsic symmetries and invariances.
14 In this study, we have leveraged the recently proposed PointNet neural architecture.
15 This deep neural network has been especially designed to process point clouds, that is, an unordered set of points. A PointNet takes a point cloud as input and provides a unified architecture for applications ranging from object classification, part segmentation, to scene semantic parsing. Although simple, it has been demonstrated empirically that PointNets exhibit a performance on par, or even better, than the state of the art.
15 In addition, because structural point clouds do not need to be dense and falling onto regular grids, this intrinsically means the amount of information needed to make, for example, a diagnosis could be significantly decreased, thus decreasing the black box element of AI. For our glaucoma diagnosis application, the ONH can simply be thought of as a complex 3D structure that can be represented by a cloud of points, as has been routinely performed in 3D histomorphometric and finite element studies.
16–18
In this study, we aimed to apply a geometric deep learning solution (PointNet) to provide a robust glaucoma diagnosis from a single OCT scan of the ONH. Each OCT scan was first preprocessed and each ONH was represented as a 3D point cloud, thus limiting the amount of information to be processed by several orders of magnitude. Our approach was compared with a 3D convolutional neural network (CNN), and its performance compared with that from RNFL thickness alone (the current gold standard).