Abstract
Purpose:
To investigate whether fractal dimension (FD), a retinal trait relating to vascular complexity and a potential “oculomics” biomarker for systemic disease, is applicable to a mixed-age, primary-care population.
Methods:
We used cross-sectional data (96 individuals; 183 eyes; ages 18–81 years) from a university-based optometry clinic in Glasgow, Scotland, to study the association between FD and systemic health. We computed FD from color fundus images using Deep Approximation of Retinal Traits (DART), an artificial intelligence–based method designed to be more robust to poor image quality.
Results:
Despite DART being designed to be more robust, a significant association (P < 0.001) between image quality and FD remained. Consistent with previous literature, age was associated with lower FD (P < 0.001 univariate and when adjusting for image quality). However, FD variance was higher in older patients, and some patients over 60 had FD comparable to those of patients in their 20s. Prevalent systemic conditions were significantly (P = 0.037) associated with lower FD when adjusting for image quality and age.
Conclusions:
Our work suggests that FD as a biomarker for systemic health extends to mixed-age, primary-care populations. FD decreases with age but might not substantially decrease in everyone. This should be further investigated using longitudinal data. Finally, image quality was associated with FD, but it is unclear whether this finding is measurement error caused by image quality or confounded by age and health. Future work should investigate this to clarify whether adjusting for image quality is appropriate.
Translational Relevance:
FD could potentially be used in regular screening settings, but questions around image quality remain.
Clinical records from eye examination appointments that took place between 2017 and 2022 at the Vision Centre at Glasgow Caledonian University (GCU), Glasgow, Scotland, United Kingdom, were analyzed. As part of the eye examinations, a record card was completed, and color fundus images were captured for each patient. The study was undertaken in accordance with the tenets of the Declaration of Helsinki. Ethical approval was obtained from the GCU School of Health Sciences Ethical Committee prior to the commencement of data collection (HLS/LS/A22/003). Participants provided written consent to have their anonymized clinical records used for research purposes. Retinal images were obtained using swept-source optical computed tomography (DRI OCT Triton Plus; Topcon, Tokyo, Japan).
The record cards were either digitized if they were in physical paper form or extracted from the patient management system. All of the images were available digitally on the device and exported in JPG format at a resolution of 2000 × 1312 pixels or greater and linked to the record cards. The following information was then collated in a consistent format: age at visit, sex, and information about the general health status from the “history and symptoms” field in the record card.
We had access to 183 images linked to 96 records belonging to 58 female and 38 male patients. In nine cases, only a single image was available on the device, presumably because only one eye was imaged during the examination. For consistency, we used the most recent visit for each individual, and all available images were included in our analysis. The data extraction process was labor intensive; thus, we had access to only one visit per patient. During data extraction, we prioritized extracting data for as many patients as possible at the cost of not extracting longitudinal data. The median age at visit was 61.80 years (interquartile range [IQR], 33.20) with the youngest and oldest patients being 18 and 81 years, respectively. Notably, 22 patients were under the age of 30, an age group that is not available in UK Biobank or AlzEye where the youngest subjects are 37
10 and 40,
9 respectively.
We used information about systemic health from the “history and symptoms” field of the record cards to analyze the relationship between systemic health and FD. This information was recorded in the context of an optometric examination and thus was generally coarse grained with varying levels of detail, primarily consisting of very short descriptions and commonly used clinical abbreviations (e.g., “Good, no problems,” “diabetes”, “HBP”). We stratified individuals into two groups based on this information: those described as having any systemic health condition (i.e., any non-ocular health condition) and those with no mention of any such condition.
Table 1 gives an overview of the two groups.
Table 1. Population Characteristics Stratified by Systemic Health Status
Table 1. Population Characteristics Stratified by Systemic Health Status
Although the level of detail for the systemic health information is limited at times, this approach should have high positive predictive value; that is, individuals in the “prevalent systemic conditions” group would be very likely to actually have systemic conditions. Sensitivity, on the other hand, would be imperfect. Prevalent conditions might not have been mentioned by the patient, might not have been deemed sufficiently relevant to record, or might have been undiagnosed at the time of visit. This should lead to lower apparent effect sizes. Another limitation is that we do not have information about the severity or duration of the conditions. This should likewise lead to lower apparent effect sizes than if we could account for severity. Thus, we expect that the limitations of this variable biased our analysis to be more conservative, and any apparent differences between the two groups are likely to be true differences but with underestimated effect sizes; in other words, the risk of a type 1 error was low and that of a type 2 error was high.
The mean FD in our data was 1.4904, with a standard deviation (SD) of 0.0391. An increase in image quality issues by one level was associated with a decrease in FD by 0.026 (95% confidence interval [CI], 0.031–0.021) in absolute terms, or by 0.656 SD (95% CI, 0.788–0.524; P < 0.001). This suggests that, although DART is designed to be more robust to image quality issues, there might still be an effect that must be adjusted for.
Figure 1 shows scatterplots of FD versus age for the raw data, as well as when FD was being adjusted for image quality in a mixed-effects model. Increasing age was associated with a significant decrease in FD. In a univariable model, an additional decade of age was associated with a decrease in FD by 0.232 SD (95% CI, 0.323–0.141;
P < 0.001). When adjusting for image quality issues, this changed to a decrease by 0.172 SD per decade (95% CI, 0.235–0.109;
P < 0.001), which is consistent in direction and magnitude with what has been reported in the literature.
20 Furthermore, although the effect of age on FD did decrease after adjusting for image quality issues, the direction was the same and the magnitude comparable. This suggests that, although the effect of image quality is significant, it does not preclude discovering meaningful associations in our data.
In addition to the linear association between age and FD, we also observed that the FD variance was higher in older patients. Some patients over 60 had FD comparable to those of patients in their 20s (
Fig. 1a). This suggests that patients could follow characteristically different trajectories, with only some seeing their FD decrease substantially as they age. The findings persisted when adjusting for image quality using the coefficients from the linear mixed-effects model (
Fig. 1b) and when taking the mean of both eyes per patient (
Fig. 1c). Sex showed no significant association with FD in a univariable model (
P = 0.252), when adjusted for image quality (
P = 0.156) and when adjusted for image quality and age (
P = 0.377).
We observed a significant association between prevalent systemic conditions and FD, even when controlling for age and image quality. This confirms previous findings that FD might be a biomarker for systemic health. This is further corroborated by the observation that FD had higher variance in older patients. Importantly, our data was collected during clinical practice in a primary-care setting and included patients from 18 to 81 years of age, and no images were excluded from analysis based on quality. Thus, our work suggests that FD can be used in an even more challenging setting that is closer to clinical reality than what previous work considered.
A major limitation is the coarseness of the information about prevalent systemic conditions and the lack of information about incident systemic conditions. However, as we argued in the Methods section, we expect that the variable for prevalent systemic conditions will have high positive predictive value; thus, the observed effect is likely to be a true effect and the effect size will be underestimated. Future work should further explore FD as a biomarker for systemic health in primary-care data, ideally with linkage to other medical records for more detailed information about systemic health.
We adjusted for image quality due to its strong association with FD, which is commonly done in the literature.
4 The underlying assumption is that image quality issues affect the computation of FD itself and thus must be adjusted for to recover retinal vascular complexity. This notion initially appears convincing, but upon reflection is potentially problematic.
Retinal vascular complexity is supposed to be a proxy for systemic health; however, age and poorer health have been found to be associated with not just vascular complexity but also image quality itself.
12 Plausibly, frailer individuals could be more difficult to image (e.g., because of difficulty sitting still). Furthermore, age-related changes to the eye, such as miosis and cataract, also decrease expected image quality.
Figure 2 shows a simplified causal diagram for FD measurements, with a potential direct effect of age on retinal vascular complexity (dashed pink arrow). To our knowledge, this effect has not been conclusively established in the literature but could plausibly exist. The dashed orange arrow indicates a potential effect of image quality on FD, which is the reason why image quality is commonly adjusted for.
However, observe that, even if we had a fully robust method where this arrow would not be present, we would expect to observe a significant association between FD and image quality, confounded by systemic health. In that case, adjusting for image quality would be inappropriate and would bias the association between FD and systemic health toward the null. Even if our method is not perfectly robust and there is an effect of image quality on FD (i.e., poor-quality images lead not only to measurement noise but also to a systematic bias in FD), then controlling for image quality could likewise bias our analysis.
Thus, we argue that adjusting for image quality is potentially problematic and should be more critically considered in oculomics research. This issue further highlights the need for robust methods for computing retinal traits, as more robust methods would reduce the need to adjust for image quality in the first place.
Note that this issue cannot be side-stepped by only examining high-quality images and excluding the rest. As mentioned in the Introduction, these exclusions introduce selection bias and exacerbate inequalities in healthcare research,
12 in addition to reducing sample sizes and statistical power. Future work should look at the repeatability and robustness to image quality of DART and traditional approaches to add empirical evidence to the question of whether adjusting for image quality can be avoided.
Supported by a grant from UK Research and Innovation (EP/S02431X/1 to JE) as part of the Centre of Doctoral Training in Biomedical AI at the School of Informatics, University of Edinburgh; Fondation Leducq Transatlantic Network of Excellence (17 CVD 03 to MOB); a grant from the Engineering and Physical Sciences Research Council (EP/X025705/1); British Heart Foundation and The Alan Turing Institute Cardiovascular Data Science Award (C-10180357); and Diabetes UK (20/0006221). MOB and NS acknowledge funding from Fight for Sight (5137/5138); SCONe projects funded by the Chief Scientist Office, Edinburgh & Lothians Health Foundation, Sight Scotland, and the Royal College of Surgeons of Edinburgh; RS Macdonald Charitable Trust.
Disclosure: J. Engelmann, None; S. Kearney, None; A. McTrusty, None; G. McKinlay, None; M.O. Bernabeu, None; N. Strang, None