November 2023
Volume 12, Issue 11
Open Access
Retina  |   November 2023
Evaluation of an Automated Choroid Segmentation Algorithm in a Longitudinal Kidney Donor and Recipient Cohort
Author Affiliations & Notes
  • Jamie Burke
    School of Mathematics, University of Edinburgh, College of Science and Engineering, Edinburgh, UK
  • Dan Pugh
    University/BHF Centre for Cardiovascular Science, The Queen's Medical Research Institute, University of Edinburgh, Edinburgh, UK
  • Tariq Farrah
    University/BHF Centre for Cardiovascular Science, The Queen's Medical Research Institute, University of Edinburgh, Edinburgh, UK
  • Charlene Hamid
    Imaging Facility, University of Edinburgh, The Queen's Medical Research Institute, Edinburgh, UK
  • Emily Godden
    Emergency Department, Royal Infirmary of Edinburgh, Edinburgh, UK
  • Thomas J. MacGillivray
    Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, UK
  • Neeraj Dhaun
    University/BHF Centre for Cardiovascular Science, The Queen's Medical Research Institute, University of Edinburgh, Edinburgh, UK
  • J. Kenneth Baillie
    Deanery of Clinical Sciences, University of Edinburgh, College of Medicine and Veterinary Medicine, Edinburgh, UK
  • Stuart King
    School of Mathematics, University of Edinburgh, College of Science and Engineering, Edinburgh, UK
  • Ian J. C. MacCormick
    Centre for Inflammation Research, The Queen's Medical Research Institute, University of Edinburgh, Edinburgh, UK
    Institute for Adaptive and Neural Computation, School of Informatics, University of Edinburgh, Edinburgh, UK
  • Correspondence: Jamie Burke, School of Mathematics, University of Edinburgh, College of Science and Engineering, Edinburgh, UK. e-mail: jamie.burke@ed.ac.uk 
Translational Vision Science & Technology November 2023, Vol.12, 19. doi:https://doi.org/10.1167/tvst.12.11.19
  • Views
  • PDF
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Jamie Burke, Dan Pugh, Tariq Farrah, Charlene Hamid, Emily Godden, Thomas J. MacGillivray, Neeraj Dhaun, J. Kenneth Baillie, Stuart King, Ian J. C. MacCormick; Evaluation of an Automated Choroid Segmentation Algorithm in a Longitudinal Kidney Donor and Recipient Cohort. Trans. Vis. Sci. Tech. 2023;12(11):19. https://doi.org/10.1167/tvst.12.11.19.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Purpose: To evaluate the performance of an automated choroid segmentation algorithm in optical coherence tomography (OCT) data using a longitudinal kidney donor and recipient cohort.

Methods: We assessed 22 donors and 23 patients requiring renal transplantation over up to 1 year posttransplant. We measured choroidal thickness (CT) and area and compared our automated CT measurements to manual ones at the same locations. We estimated associations between choroidal measurements and markers of renal function (estimated glomerular filtration rate [eGFR], serum creatinine, and urea) using correlation and linear mixed-effects (LME) modeling.

Results: There was good agreement between manual and automated CT. Automated measures were more precise because of smaller measurement error over time. External adjudication of major discrepancies was in favor of automated measures. Significant differences were observed in the choroid pre- and posttransplant in both cohorts, and LME modeling revealed significant linear associations observed between choroidal measures and renal function in recipients. Significant associations were mostly stronger with automated CT (eGFR, P < 0.001; creatinine, P = 0.004; urea, P = 0.04) compared to manual CT (eGFR, P = 0.002; creatinine, P = 0.01; urea, P = 0.03).

Conclusions: Our automated approach has greater precision than human-performed manual measurements, which may explain stronger associations with renal function compared to manual measurements. To improve detection of meaningful associations with clinical endpoints in longitudinal studies of OCT, reducing measurement error should be a priority, and automated measurements help achieve this.

Translational Relevance: We introduce a novel choroid segmentation algorithm that can replace manual grading for studying the choroid in renal disease and other clinical conditions.

Introduction
Chronic kidney disease (CKD) is a major cause of morbidity and mortality, affecting over 800 million people worldwide.1 CKD is a gradual loss of kidney function, resulting in a failure to filter blood effectively and is associated with alterations in microvascular structure and function.2 Assessment of microvascular function is therefore helpful in the diagnosis, prognosis, and treatment of renal disease. Current clinical evaluation is based on blood and urine testing and invasive kidney biopsy. 
The eye and the kidney show a striking resemblance in anatomy, physiology, and response to disease,3 and in particular, the choroidal microcirculation reflects the renal microcirculation.4 For example, they have an analogous vascular endothelium with similarly sized fenestrated vessel walls, allowing subretinal fluid exchange in the choroid and blood filtration in the kidneys. Moreover, the circulatory systems in the choroid and renal cortex have similar proportions of blood flow in contrast to their retinal and renal medullary counterparts.5 
Choroidal thickness (CT) correlates strongly with renal dysfunction in patients with CKD compared with sex- and age-matched healthy and hypertensive individuals.6 This suggests that choroidal thinning may relate to systemic microvascular injury in general and renal injury in particular. An overactive sympathetic drive may contribute to disease progression in CKD7 and could explain choroidal thinning since choroidal perfusion is influenced by its autonomic supply.6 These observations suggest the potential clinical utility of choroidal biomarkers in renal disease. There is already a growing interest in choroidal biomarkers derived from optical coherence tomography (OCT) in nonocular pathology such as sepsis,8,9 diabetes and CKD,6,10,11 and neurodegenerative disease.1214 
Standard OCT imaging focuses on imaging the retina while enhanced depth imaging OCT (EDI-OCT) provides a deeper and stronger signal, visualizing the choroid–scleral (C-S) junction with micron resolution. Measuring the choroid manually from OCT images is a time-consuming and subjective process. Automated image analysis to extract choroidal measures as potential biomarkers could have substantial clinical utility if it is sufficiently reliable, robust, and reproducible. Consequently, there have been many attempts to validate automated algorithms for quantifying the choroid.1518 However, only a handful of these proposed algorithms demonstrate their clinical utility in disease settings.19,20 To the best of our knowledge, such automated algorithms have rarely been applied to longitudinal image sets. This is important as accurate assessment of change over time is essential for tracking disease progression. 
We have previously developed an automated approach to choroid region segmentation,21 and here we present our evaluation of its performance and clinical utility in a longitudinal cohort of individuals with end-stage CKD who underwent renal transplant and healthy donors who underwent unilateral nephrectomy. Our primary objective was to compare our automated CT measurements with the current gold standard of manual measurement of CT, as quantitative validation across longitudinal data is an ideal approach to evaluate novel image-processing methodologies in the medical domain. 
As a secondary objective, we validate our segmentation algorithm clinically through estimating associations between the choroid and clinical biomarkers of renal function in both cohorts. In order to limit processing time, the choroid is typically measured manually using only thickness. Our approach permits automatic segmentation and subsequent calculation of choroidal thickness and area (CA), so we provide two metrics of the choroid to estimate associations with. 
Method
Study Population
We prospectively analyzed a longitudinal cohort of healthy kidney donors and patients with end-stage kidney disease undergoing living donor kidney transplantation (NCT0213274).22 Data collection and subsequent analyses were conducted after ethical approval from the South East Scotland Research Ethics Committee, in accordance with the principles of the Declaration of Helsinki, and all participants gave informed consent to recruitment. Eligibility criteria for recruitment were (1) donors must be living and healthy throughout the period of analysis, (2) recipients with end-stage CKD have a functional kidney transplant, and (3) participants must be aged 18 years or over. To prevent ocular issues confounding our results, our exclusion criteria were (1) any ocular pathology pretransplant, (2) any previous eye surgery, (3) a refractive error exceeding ±6 diopters, or (4) a diagnosis of diabetes mellitus. We judged image quality by the OSCAR-IB criteria23 and excluded images with B-scan signal quality ≤15, indistinguishable C-S junction due to speckle noise, or partial image cropping of choroid. 
Supplementary Figure S1 shows a flowchart on how the population was selected for this study. At the time of data extraction, 22 donors and 23 recipients were eligible for analysis, after excluding a total of 75 recipients and 5 donors. Most exclusions were from no eye scans being performed for these individuals. This was because, once recruited, they were scheduled for transplantation and then immediately returned to their referring institution after transplant away from Edinburgh, thus making it impractical to retain in the study and rescan. Each participant was assessed on the day of transplant, and then attempts were made at seven other time periods over the following year (eight time points in total). 
Donors and recipients were organized into two data sets for performance evaluation and clinical evaluation. We used the entire analysis cohort for performance evaluation of the automated approach against manual grading, resulting in 483 CT measurements for comparison. After excluding two donors and seven recipients due to missing clinical measurements/eye scans posttransplant or poor-quality EDI-OCT images, we used 20 donors and 16 recipients for clinical evaluation (i.e., association estimation with clinical variables related to renal function). Clinical data at each time point included standard biomarkers of renal function, including high sensitivity C-reactive protein, serum urea and creatinine, estimated glomerular filtration rate (eGFR), and urine protein to creatinine ratio. Table 1 shows the population statistics for the entire analysis cohort. 
Table 1.
 
Population Statistics for All Donors and Recipients, Rounded to the Nearest Integer
Table 1.
 
Population Statistics for All Donors and Recipients, Rounded to the Nearest Integer
Image Acquisition
We imaged each participant's right eye using the Heidelberg spectral domain OCT Spectralis Standard Module (Heidelberg Engineering, Heidelberg, Germany). Patients were examined between 9 am and 5 pm and were endeavored to be followed up at around the same time of day at each time point. The average (and standard deviation) daytime in which scans were taken at each time point is listed in Table 1. During each patient's examination, a horizontal line, EDI-OCT B-scan centered at the foveal pit was taken. Each B-scan covered a 30° (8.7 mm) region and was extracted as a 768 × 768 (pixel height × width) high-resolution image for downstream image processing. 
Each scan was captured using active eye tracking with Automatic Real Time (ART) software built into the Spectralis device, averaging up to a maximum of 100 scans to generate a single high-resolution image. EDI-OCT images include the B-scan, alongside an en face view of the fundus with a horizontal line representing the B-scan's location on the macula. Although refractive error and axial length were not collected for this population, the built-in scan focus parameter (an approximate value for refractive error) was extracted from the acquisition metadata for each scan and was used for downstream clinical evaluation (listed in Table 1). Figure 1 shows an example EDI-OCT scan with results from manual and automated assessment. At each time point, only one EDI-OCT B-scan of each individual's right eye was selected for analysis, and no repeated scans were taken during the same examination. 
Figure 1.
 
EDI-OCT scan of a donor's right eye at baseline. (A) En face view of the fundus with the location of the B-scan in green and markers for measuring CT and CA in red. (B) B-scan showing chorioretinal structures with manual and automated assessment shown.
Figure 1.
 
EDI-OCT scan of a donor's right eye at baseline. (A) En face view of the fundus with the location of the B-scan in green and markers for measuring CT and CA in red. (B) B-scan showing chorioretinal structures with manual and automated assessment shown.
Manual Assessment of Choroidal Thickness
Manual grading for measuring CT was performed by a single, trained operator across all image sessions for each participant. Manual grading of the choroid is described in the original study carried out by Balmforth et al.6 Briefly, this was achieved using the caliper tool in HeyEx software (version 1.10.4.0; Heidelberg Engineering). Choroid thickness was measured at three locations across the macula—at the fovea and 2000 microns nasal and temporal to the fovea. The choroid region in each B-scan was defined as the area between the lower surface of the junction between the hyperreflective retinal pigment epithelium (RPE) layer and Bruch's membrane complex (RPE-C junction), and the upper surface of the junction between the choroid and sclera (C-S junction). 
Manual grading used the caliper tools on the software to mark the center of the fovea at the level of the photoreceptor outer segment and mark 2000 microns temporal and nasal to this point, along the RPE-C junction. Choroid thickness was defined as the straight-line distance between the RPE-C and C-S junctions, measured locally perpendicular to the RPE-C junction. Figure 1 shows three CT measurements provided by the manual grader in green, including the reference lines measuring approximately 2000 microns temporal and nasal to the foveal pit along the RPE-C junction. 
Automated Assessment of Choroidal Thickness
Our automated choroid segmentation algorithm performed individual edge tracing to obtain the upper and lower choroid boundaries for each B-scan. The algorithm modeled each boundary using Gaussian process regression, building a posterior predictive function through a recursive Bayesian scheme and image gradient scoring mechanism to detect edge pixels.21 Figure 2 presents a schematic diagram of the image analysis pipeline of the automated approach. Preprocessing was done using median filter denoising and contrast enhancement using contrast-limited adaptive histogram equalization. Approximate edge maps were then computed using discrete derivative and morphologic erosion and dilation filters. 
Figure 2.
 
Schematic diagram of the automated approach segmenting the C-S junction of the choroid on an EDI-OCT B-scan.21 GP, Gaussian process; GPR, Gaussian process regressor; KDE, kernel density estimate.
Figure 2.
 
Schematic diagram of the automated approach segmenting the C-S junction of the choroid on an EDI-OCT B-scan.21 GP, Gaussian process; GPR, Gaussian process regressor; KDE, kernel density estimate.
Automated CT was measured in two different ways, yielding perpendicular CT measurements and parallel CT measurements. Perpendicular automated CT measured locally perpendicular to the RPE-C junction, while parallel automated CT measured parallel to the manual measurements to mimic any potential angular error made by the manual grader. See, for example, the nasal measurement in Figure 5C, where perpendicular automated CT is shown in red, parallel automated CT in blue, and manual CT in green. We measured CT in the same subfoveal, temporal, and nasal locations as manual grading. We measured CA by computing the area of pixels within a 3000-micron radius centered at the foveal pit, in accordance with the Early Treatment Diabetic Retinopathy Study area of 6000 × 6000 microns.24 
Statistical Analysis
Performance Evaluation
We measured agreement between approaches by comparing (parallel) automated CT to manual CT and performed Passing–Bablok analysis25 to determine any systematic or proportional differences between the two approaches. We also investigated residuals, calculated by subtracting manual CT from automated CT (automated – manual), using a Bland–Altman plot.26 Mean absolute error (MAE), Pearson correlation and intra-class correlation were also computed between both approaches, stratified by cohort and macular location. To compare the consistency of measurements across longitudinal data, we investigated the angular deviation made by the manual grader. This was done by computing angles between the caliper measurements made from manual grading against the perpendicular automated CT measurements. Automated assessment of the choroid and quantitative analyses comparing manual and automated measurements were done in Python (version 3.11.3). 
Rahman et al.27 found an unsigned residual of 32 µm as a threshold suggested to exceed interobserver variability between graders of EDI-OCT images. Therefore, we defined major discrepancies between automated and manual CT to be any residual greater than 32 µm in absolute value. These were selected for external adjudication by a clinical ophthalmologist, I.M. For each discrepancy, I.M. was shown two CT measurements displayed on the corresponding choroid and was blinded to which approach was used in each case. He was asked to rate each measurement and the overall visibility of the C-S junction using an ordinal scale: “bad,” “okay,” and “good.” I.M. was also asked to select the approach that was preferred—options for both or neither approaches were also available. 
Clinical Evaluation
We investigated how CT and CA measurements changed over time in donors and recipients, and estimated linear associations with clinical variables related to renal function using Pearson correlations—specifically eGFR, serum creatinine, and serum urea. Other clinical measures were too incomplete and inconsistently collected to be analyzed with the current sample size. CT was averaged across macular locations, resulting in a single value of thickness for each scan. We examined the difference in CT and CA between measurements at baseline and 1 year posttransplant, testing for difference in their means longitudinally via the Student's t-test for dependent, paired samples. We specified a P value of 0.05 as the threshold for statistical significance. 
We also looked for significant linear associations between markers of renal function and the choroid. We assumed a nested structure to the data and used linear mixed-effects (LME) models, setting an individual random effect to model the variation among individuals, accounting for age, sex, approximate refractive error (measured in diopters), and the time of day the scan was taken, relative to the time of day their baseline scan was taken (measured in decimal hours). For each clinical measure of renal function, we fitted an LME model for each choroidal image measure in turn as one of the associated independent variables. LME modeling was carried out using R (version 4.2.2).28 We identify independent variables as statistically significant predictors if their associated 95% confidence interval excludes 0. 
Results
Performance Evaluation With Choroidal Thickness
Performance results between automated and manual CT measurements are shown in Table 2. There was good agreement between manual and automated measures of CT, with an average residual of 1.8 ± 22.0 µm and MAE of 14.1 ± 16.9 µm across all 483 CT measurements. Moreover, Pearson and intraclass correlation show excellent correlations between both approaches. The largest residuals were at the subfoveal macular location. 
Table 2.
 
Performance Evaluation Between Automated and Manual CT, Stratified by Cohort and Macular Location
Table 2.
 
Performance Evaluation Between Automated and Manual CT, Stratified by Cohort and Macular Location
Passing–Bablok analysis computed a slope value of 1.02 (95% confidence interval [CI], 0.99–1.04) and an intercept of −3.64 µm (95% CI, −10.78 to 3.50 µm) with an R2 value of 0.93 (Fig. 3A). These 95% confidence intervals include 1 and 0, respectively, suggesting that the automated and manual approaches are equivalent.25,29 See Supplementary Figures S2 and S3 for correlation plots stratified by macular location and cohort. Bland–Altman analysis in Figure 3B computed an average residual of 1.84 µm with limits of agreement of −44.98 to 41.30 µm. Only 10.78% of all 483 CT measurements (52/483) exceeded an unsigned residual of 32 µm.27 
Figure 3.
 
(A) Correlation plot of manual and (parallel) automated CT measurements. (B) Bland–Altman plot showing residual values, with 52 major discrepancies highlighted as red crosses.
Figure 3.
 
(A) Correlation plot of manual and (parallel) automated CT measurements. (B) Bland–Altman plot showing residual values, with 52 major discrepancies highlighted as red crosses.
Table 3 presents the results from the external adjudication of all 52 major discrepancies. Overall, the automated CT measurements scored higher in terms of preference, where 79% (41/52) of the automated thicknesses were preferred compared to 58% (30/52) for manual thicknesses. Moreover, the automated approach scored higher in terms of quality, with only 1 measurement judged as “bad” versus 10 measurements judged “bad” for the manual approach. Figure 5 shows four major discrepancies from four different choroids. Figure 5A shows the largest observed discrepancies—temporal (97 µm) and subfoveal (114 µm)—with adjudicator I.M. preferring the automated measurements. 
Table 3.
 
Numerical Score of Both Approaches from Masked Adjudication of 52 Major Discrepancies, Stratified by the Visibility of the C-S Junction, and Qualitative Preference Scores for Both Approaches
Table 3.
 
Numerical Score of Both Approaches from Masked Adjudication of 52 Major Discrepancies, Stratified by the Visibility of the C-S Junction, and Qualitative Preference Scores for Both Approaches
Figure 4.
 
(A) Distribution of angular errors from manual CT measurements (blue). Average unsigned difference between manual and perpendicular automated CT (red). (B) Unsigned, longitudinal within-patient angular deviations, relative to baseline measurements. Colored lines represent distinct individuals using manual measurements.
Figure 4.
 
(A) Distribution of angular errors from manual CT measurements (blue). Average unsigned difference between manual and perpendicular automated CT (red). (B) Unsigned, longitudinal within-patient angular deviations, relative to baseline measurements. Colored lines represent distinct individuals using manual measurements.
We found that deviation in the angle of manual measurement from perpendicular led to a disproportionate error between perpendicular automated CT and manual CT. This is illustrated in Figure 4A, which shows the distribution (mean 0.5 and standard deviation 2.5) of angular errors made from manual grading. Absolute differences between manual CT and perpendicular automated CT grew quadratically as the angular error deviated from 0. We also found large deviation in manual grading when tracking the longitudinal, within-patient angular difference from baseline (Fig. 4B). Distributions at each time point were estimated through empirical mean and standard deviation values. Note that the mean fluctuation in manual grading was always above 0 in comparison to the mean fluctuation from the automated approach, which was constant at 0. 
Figure 5.
 
(A) A selection of major discrepancies between automated and manual CT measurements. Red lines represent perpendicular automated CT, blue represent parallel automated CT, and green represent manual CT. The lines in red are shown in (A, C) to demonstrate the observed angular deviation between perpendicular automated CT and manual CT. (A) Temporal and subfoveal major discrepancies from a donor, residuals 97 µm and 114 µm, respectively. Nasal angular deviation (5.4°) resulted in perpendicular and parallel residuals as 22 µm and 2 µm, respectively. (B) Subfoveal major discrepancy from a recipient, residual –47 µm. (C) Nasal angular deviation (8.1°) shows perpendicular and parallel residuals as 106 µm and 79 µm, respectively. (D) Temporal and nasal major discrepancy from a donor, residuals 66 µm and 65 µm, respectively.
Figure 5.
 
(A) A selection of major discrepancies between automated and manual CT measurements. Red lines represent perpendicular automated CT, blue represent parallel automated CT, and green represent manual CT. The lines in red are shown in (A, C) to demonstrate the observed angular deviation between perpendicular automated CT and manual CT. (A) Temporal and subfoveal major discrepancies from a donor, residuals 97 µm and 114 µm, respectively. Nasal angular deviation (5.4°) resulted in perpendicular and parallel residuals as 22 µm and 2 µm, respectively. (B) Subfoveal major discrepancy from a recipient, residual –47 µm. (C) Nasal angular deviation (8.1°) shows perpendicular and parallel residuals as 106 µm and 79 µm, respectively. (D) Temporal and nasal major discrepancy from a donor, residuals 66 µm and 65 µm, respectively.
Choroidal Association With Renal Function
All measures of renal function and choroidal measurements were significantly different 1 year posttransplant compared to baseline measurements in both cohorts (Table 4). Figure 6 shows the average percentage change in choroidal measurements from baseline measurements in both cohorts. In transplant recipients, there was a significant increase in choroidal thickness of 12.8% ± 4.8% (median, 11.4%) from baseline after 4 weeks posttransplant using automated CT (P = 0.02). After 1 year posttransplant, we observed a significant increase of 14.1% ± 11.4% (median, 10.7%) from baseline (P = 0.01). Interestingly, we observed a significant increase of 5.1% ± 5.6% (median, 5.4%) 1 week posttransplant in the choroid of donors (P = 0.01). The average choroid remained inflated relative to baseline measurements, and at some point between 4 and 8 weeks posttransplant, the choroid deflated. After 1 year posttransplant, we observed a significant decrease of −4.9% ± 7.9% (median, –3.5%) in automated CT (P = 0.01). 
Table 4.
 
Choroidal Measurements at Baseline and 1 Year Posttransplant (PT)
Table 4.
 
Choroidal Measurements at Baseline and 1 Year Posttransplant (PT)
Figure 6.
 
Longitudinal change in the choroid, measured relative to baseline measurements. Donors are shown as dashed lines and recipients as solid lines, with standard errors plotted at each time point.
Figure 6.
 
Longitudinal change in the choroid, measured relative to baseline measurements. Donors are shown as dashed lines and recipients as solid lines, with standard errors plotted at each time point.
In the recipient cohort, all choroidal measurements linearly correlated with eGFR, serum creatinine, and serum urea with statistical significance. All Pearson correlation coefficients with renal function were stronger in automated choroid measurements than in manual ones (Table 5). There were poor and insignificant linear correlations found between the choroid and renal function in the donor cohort, as seen in Supplementary Table S1, which also shows the corresponding P values for all correlation analyses performed. 
Table 5.
 
Pearson Correlation Coefficients Between Choroidal Measurements and eGFR, Serum Creatinine, and Serum Urea in the Recipient Cohort
Table 5.
 
Pearson Correlation Coefficients Between Choroidal Measurements and eGFR, Serum Creatinine, and Serum Urea in the Recipient Cohort
Using all choroidal measurements, in transplant recipients, LME modeling showed evidence of a statistically significant, positive linear association with eGFR and negative associations with serum creatinine and urea levels. Table 6 shows the model summary's for predicting eGFR, serum creatinine, and serum urea using each choroidal measurement in turn. Models using automated measures had greater conditional R2 values—a measure analogous to the R2 value for linear models but taking into account the random- and fixed-effects variables—and their respective independent variables showed stronger association to predicting renal function. Interestingly, the time difference between follow-up and baseline image acquisition was a statistically significant predictor of all markers of renal functions for models using all choroidal metrics. 
Table 6.
 
LME Summary Tables for Predicting eGFR, Serum Creatinine, and Serum Urea in Transplant Recipients Using Each Choroidal Measurement
Table 6.
 
LME Summary Tables for Predicting eGFR, Serum Creatinine, and Serum Urea in Transplant Recipients Using Each Choroidal Measurement
There were no statistically significant associations found between any choroidal measurement and markers of renal function in the donor cohort. However, sex was associated with predicting serum creatinine using automated choroidal measurements, with weak statistical significance. These LME results can be found in Supplementary Table S2
Discussion
We found that automated CT measurements agreed well with manual ones in general but have greater precision, especially when applied to a longitudinal series of images. Much of the superior precision from automated CT results from eliminating inconsistent measurement angle, since the automated method is always able to measure CT locally perpendicular at the exact same location across all time points for each patient—this is an important attribute for measurement protocol so that we can account for choroidal curvature, or if the choroid does not appear predominantly horizontal in the image. In contrast, error in the angle of manual measurement had a disproportionate impact. 
The RPE-C junction is almost never parallel to the C-S junction, making it a challenge for humans to measure perpendicular angles by eye across repeated measures, which is a major source of error. Furthermore, the different axial and lateral resolutions in each B-scan can result in the slightest change in marking a lateral position during manual grading to misrepresent the true CT. Figures 5A and 5C represent two exemplar cases, where the residuals between manual CT and perpendicular/parallel automated CT measurements are very different. 
Importantly, the superior precision of the automated method translated into stronger associations with renal function, at least in the recipient cohort, where statistical significance was observed. This illustrates the necessity for reliable and reproducible measurements in longitudinal studies and their subsequent impact on predicting clinical outcomes. 
Much of the discrepancy between manual and automated CT arose from choroidal features that are inherently difficult to measure. For example, the C-S junctions of larger choroids (Figs. 5A, 5C, 5D) are prone to poor-quality image acquisition due to a higher incidence of poor signal penetration. However, the external adjudicator preferred the automated measurements over the manual grader for these major discrepancies. Furthermore, the C-S junction may be obscured in choroids whose posterior of Haller's layer is low contrast (Fig. 5B)—here, the external adjudicator preferred the manual grader's measurement. The choroid in Figure 5A represents a source of major disagreement between the manual grader and the automated approach. The manual grader has defined the C-S junction as the boundary below the most visible posterior vessels (green), while the automated approach has successfully identified vessels with much lower visibility further below the clearer vessels (red), of which the external adjudicator agreed with. 
The choroids in Figure 5 represent just over half (51.9%) of all major discrepancies lying outside the threshold suggested to exceed interobserver variability between graders of EDI-OCT images.27 That is, most outliers came from the same four choroids at different time points. Moreover, of the 28 thickness measurements from choroids described as having poor visibility, 20 of them came from the larger choroids (Figs. 5A, 5C, 5D). 
From both correlation analysis and LME modeling, automated choroidal measurements were found to correspond with markers of renal function over time better than manual CT, probably because of reduced measurement error. All choroidal measurements changed substantially over time for all study participants, as did eGFR, serum creatinine, and serum urea. This is consistent with the choroid reflecting renal function during treatment of CKD. The choroid, and indeed CT/CA, does vary naturally during the day due to diurnal variation. Tan et al.30 describe the average change in CT across daytime hours to be approximately 8.5% ± 5.2%. The change in automated CT in transplant recipients 1 year posttransplant was 14.1% ± 11.4% (14.7% ± 11.2% for CA). This suggests that choroidal inflation in transplant recipients cannot be explained fully through diurnal variation but potentially through improved renal function.31 However, the same statement for healthy kidney donors cannot be made as confidently, where we only observed an average decrease in CT of −4.9% ± 7.9% (−5.3% ± 7.7% for CA). 
We must note that big fluid shifts occur at the time of transplantation, although we have not been able to account for any change in body fluid volume posttransplant and what impact that may have had on the choroid. However, these fluid shifts would only impact the choroid in the short term, and our follow-up data are of sufficient time course to exclude any potential long-term effect from body fluid volume on the significant changes we see in the choroid. 
In each LME model, we accounted for age, sex, approximate refractive error, and the intraparticipant relative-to-baseline daytime each scan was taken (relative daytime). The latter two measurements were used to assess the impact of myopia and diurnal variation in both cohorts for predicting renal function using the choroid. Relative daytime proved to be a significant predictor variable for markers of renal function alongside all choroidal measurements, while approximate refractive error did not. Note, however, that the standardized model coefficients are almost always larger in absolute value for the choroidal measurements versus relative daytime. 
In every model, there was an opposite effect of relative daytime compared with the choroid, suggesting that the size of the choroid is negatively offset such that the CT and CA in a follow-up scan taken later in the day (relative to the baseline acquisition time) are decreased in absolute value—and vice versa for follow-up scans taken earlier than the original baseline scan. This makes sense given we know that the choroid decreases marginally through the course of the day.30 
This suggests that while choroidal measurements still play a significant role in predicting markers of renal function for transplant recipients, it is important that daytime be recorded for downstream analysis, and image acquisition try be performed around the same time of day so as not to diminish any longitudinal signal present in the data. Note, however, that removing relative daytime still resulted in the choroid as a statistically significant predictor variable, but the overall model fit was marginally worse (data not shown). 
It may be that the immediate inflation of the choroid in the first 4 weeks after kidney donation in donors could be a result of a prompt systemic, cardiovascular response to the unilateral nephrectomy. While from 8 weeks posttransplant, we observed choroidal thinning from all measurements, consistent with previous literature,32 it is within the first month in a donor's recovery period that compensatory renal hypertrophy takes place.33,34 These observations align with Choi and Kim,31 who cautiously suggest that a decrease in subfoveal CT is associated with a decline in renal function, but its increase is associated with renal hypertrophy, albeit in patients with diabetic retinopathy. 
Low signal quality, vessel shadowing, lack of signal penetration, and the potential for extravascular fluid to obscure and occlude the boundary all contribute to the challenging task of tracing the C-S junction in OCT images. However, the automated segmentation methodology has the ability to overcome many of these problems due to its tunable hyperparameters and model uncertainty quantification.21 For example, the Bayesian recursive scheme allows uncertainty in obscured or occluded regions to be quantified and correctly interpolated. This feature is advantageous for choroid region segmentation. Moreover, the methodology provides an analytical, functional form for the edge that is explainable, interpretable, and reproducible—features that tend to be absent from many deep learning algorithms in medical image segmentation.35 
There were a number of limitations in this study. First, our two cohorts were limited in sample size, and not all participants or clinical variables were collected at every time point. Moreover, of the 20 donors and 16 recipients included for clinical evaluation, only 11 donors and 8 recipients were followed up 1 year posttransplant, which potentially introduced bias into our clinical evaluation and contributed to a reduction of statistical power to which we could draw our conclusions with. Furthermore, using a subcohort for clinical evaluation could potentially introduce selection bias when estimating associations between the choroid and renal function. Therefore, further studies with a larger and more complete data set would permit a higher confidence regarding the significance of our analyses. However, while the results of the clinical evaluation may suffer from a lack of complete image or clinical data, our primary objective of comparing manual CT with automated CT remains undiluted by these limitations. 
Another limitation is that the automated approach does require some, albeit minimal, form of manual intervention to select the edge endpoint pixels before tracing each boundary of the choroid in a single B-scan. Another limitation is the possibility of underapproximating the C-S junction for choroids such as the one in Figure 5B. Finally, while we have provided an analysis on regional quantities of the choroid and their potential links to microvascular injury and renal function, further work would aim to quantify any relationships with the vasculature within the choroid. This could be done, for example, by computing the choroid vasculature index, which has recently become a popular and more robust way to characterize the choroid for ocular and nonocular pathology.11,3638 
Conclusions
Our automated segmentation methodology can replace human measures for studying the choroid in renal disease and potentially other clinical conditions. We observed no systematic or proportional differences when comparing manual CT to automated CT measurements. While the automated approach agreed strongly with manual ones, automated CT eliminated angular error and therefore had greater precision when assessing within-patient longitudinal data. The error in manual measurement ultimately led to a lack of strength and significance in their clinical associations. High precision and reproducibility of image measures are critical for longitudinal studies of the choroid where repeated measurements are made using highly sensitive retinal imaging modalities for predicting clinical outcomes. 
We observed significant choroidal thickening in recipients 1 year posttransplant, and the choroid corresponded to improved eGFR, serum creatinine, and urea levels. In donors, we observed choroidal inflation in the first 4 weeks after the unilateral nephrectomy, followed by significant thinning 1 year posttransplant. Initial inflation of the choroid could possibly be related to compensatory renal hypertrophy, while long-term thinning may be linked to long-term impairment of renal function. Future work should aim at a fuller analysis with a larger and more complete data set in order to increase statistical power. 
Our general-purpose segmentation methodology is freely available on GitHub,39 and we are working on producing an automated, end-to-end, and open source framework for choroidal image analysis in OCT data. Automated tools to assist clinical prognosis and treatment will strengthen future clinical studies, enabling the delivery of robust, reproducible, and responsible research in areas that have previously been at risk of human error. 
Acknowledgments
The authors thank all participants in the study as well as all staff at the Edinburgh Imaging Facility who contributed to image acquisition for this study. 
Supported by the Medical Research Council (grant MR/N013166/1), as part of the Precision Medicine Doctoral Training Programme with the University of Edinburgh. 
Disclosure: J. Burke, None; D. Pugh, None; T. Farrah, None; C. Hamid, None; E. Godden, None; T.J. MacGillivray, None; N. Dhaun, None; J.K. Baillie, None; S. King, None; I.J.C. MacCormick, None 
References
Kovesdy CP . Epidemiology of chronic kidney disease: an update 2022. Kidney Int Suppl. 2022; 12: 7–11. [CrossRef]
Houben AJHM, Martens RJH, Stehouwer CDA. Assessing microvascular function in humans from a chronic disease perspective. J Am Soc Nephrol. 2017; 28: 3461–3472. [CrossRef] [PubMed]
Wong CW, Wong TY, Cheng C-Y, Sabanayagam C. Kidney and eye diseases: common risk factors, etiological mechanisms, and pathways. Kidney Int. 2014; 85: 1290–1302. [CrossRef] [PubMed]
Farrah TE, Dhillon B, Keane PA, Webb DJ, Dhaun N. The eye, the kidney, and cardiovascular disease: old concepts, better tools, and new horizons. Kidney Int. 2020; 98: 323–342. [CrossRef] [PubMed]
Nickla DL, Wallman J. The multifunctional choroid. Prog Retin Eye Res. 2010; 29: 144–168. [CrossRef] [PubMed]
Balmforth C, van Bragt JJMH, Ruijs T, et al. Chorioretinal thinning in chronic kidney disease links to inflammation and endothelial dysfunction. JCI Insight. 2016; 1: e89173. [CrossRef] [PubMed]
Kaur J, Young BE, Fadel PJ. Sympathetic overactivity in chronic kidney disease: consequences and mechanisms. Int J Mol Sci. 2017; 18: 1682. [CrossRef] [PubMed]
Courtie E, Veenith T, Logan A, Denniston AK, Blanch RJ. Retinal blood flow in critical illness and systemic disease: a review. Ann Intens Care. 2020; 10: 1–18. [CrossRef]
Erikson K, Liisanantti JH, Hautala N, et al. Retinal arterial blood flow and retinal changes in patients with sepsis: preliminary study using fluorescein angiography. Crit Care. 2017; 21: 1–8. [CrossRef] [PubMed]
Ishibazawa A, Nagaoka T, Minami Y, Kitahara M, Yamashita T, Yoshida A. Choroidal thickness evaluation before and after hemodialysis in patients with and without diabetes. Invest Ophthalmol Vis Sci. 2015; 56: 6534–6541. [CrossRef] [PubMed]
Nakano H, Hasebe H, Murakami K, et al. Choroid structure analysis following initiation of hemodialysis by using swept-source optical coherence tomography in patients with and without diabetes. PLoS One. 2020; 15(9): e0239072. [CrossRef] [PubMed]
Di Staso F, Ciancaglini M, Abdolrahimzadeh S, D'apolito F, Scuderi G. Optical coherence tomography of choroid in common neurological diseases. In Vivo. 2019; 33: 1403–1409. [CrossRef] [PubMed]
Garcia-Martin E, Jarauta L, Pablo LE, et al. Changes in peripapillary choroidal thickness in patients with multiple sclerosis. Acta Ophthalmol. 2019; 97: e77–e83. [CrossRef] [PubMed]
Bulut M, Yaman A, Erol MK, et al. Choroidal thickness in patients with mild cognitive impairment and Alzheimer's type dementia. J Ophthalmol. 2016; 2016: 7291257. [CrossRef] [PubMed]
Mishra Z, Ganegoda A, Selicha J, Wang Z, Sadda SR, Hu Z. Automated retinal layer segmentation using graph-based algorithm incorporating deep-learning-derived information. Sci Rep. 2020; 10: 1–8. [PubMed]
Zhang L, Buitendijk GHS, Lee K, et al. Validity of automated choroidal segmentation in SS-OCT and SD-OCT. Invest Ophthalmol Vis Sci. 2015; 56: 3202–3211. [CrossRef] [PubMed]
Masood S, Fang R, Li P, et al. Automatic choroid layer segmentation from optical coherence tomography images using deep learning. Sci Rep. 2019; 9: 1–18. [PubMed]
Zahavi O, Domı́nguez-Vicent A, Brautaset R, Venkataraman AP. Evaluation of automated segmentation algorithm for macular volumetric measurements of eight individual retinal layer thickness. Appl Sci. 2021; 11: 1250. [CrossRef]
Chen H-J, Huang Y-L, Tse S-L, et al. Application of artificial intelligence and deep learning for choroid segmentation in myopia. Transl Vis Sci Technol. 2022; 11: 38–38. [CrossRef]
Dolz-Marco R, Andreu-Fenoll M, Hernández-Martı́nez P, Pinazo-Durán MD, Gallego-Pinazo R. Automated macular choroidal thickness measurement by swept-source optical coherence tomography in pseudoxanthoma elasticum. Int J Retina Vitreous. 2016; 2: 1–6. [CrossRef] [PubMed]
Burke J, King S. Edge tracing using Gaussian process regression. IEEE Trans Image Process. 2021; 31: 138–148. [CrossRef] [PubMed]
Dhaun N . Optical Coherence Tomography And NEphropathy: the OCTANE study. Updated April 14, 2021. ClinicalTrials.gov. Accessed June 29, 2022.
Tewarie P, Balk L, Costello F, et al. The OSCAR-IB consensus criteria for retinal OCT quality assessment. PLoS One. 2012; 7: e34823. [CrossRef] [PubMed]
Early Treatment Diabetic Retinopathy Study design and baseline patient characteristics: ETDRS report number 7. Ophthalmology. 1991; 98: 741–756. [CrossRef] [PubMed]
Passing H, Bablok W. A new biometrical procedure for testing the equality of measurements from two different analytical methods. Application of linear regression procedures for method comparison studies in clinical chemistry, Part I. J Clin Chem Clin Biochem. 1983; 21(11): 709–720, doi:10.1515/cclm.1983.21.11.709.
Bland JM, Altman D. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986; 327: 307–310. [CrossRef]
Rahman W, Chen FK, Yeoh J, Patel P, Tufail A, Da Cruz L. Repeatability of manual subfoveal choroidal thickness measurements in healthy subjects using the technique of enhanced depth imaging optical coherence tomography. Invest Ophthalmol Vis Sci. 2011; 52: 2267–2271. [CrossRef] [PubMed]
Bates D, Mächler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw. 2015; 67: 1–48. [CrossRef]
Ranganathan P, Pramesh CS, Aggarwal R. Common pitfalls in statistical analysis: measures of agreement. Perspect Clin Res. 2017; 8: 187. [CrossRef] [PubMed]
Tan CS, Ouyang Y, Ruiz H, Sadda SR. Diurnal variation of choroidal thickness in normal, healthy subjects measured by spectral domain optical coherence tomography. Invest Ophthalmol Vis Sci. 2012; 53: 261–266. [CrossRef] [PubMed]
Choi MG, Kim JT. Strong correlation of renal function with choroidal thickness in patients with type 2 diabetes: retrospective cross-sectional study. J Clin Med. 2020; 9: 2171. [CrossRef] [PubMed]
Mulè G, Vadalà M, La Blasca T, et al. Association between early-stage chronic kidney disease and reduced choroidal thickness in essential hypertensive patients. Hypertens Res. 2019; 42: 990–1000. [CrossRef] [PubMed]
Anderson RG, Bueschen AJ, Lloyd LK, Dubovsky EV, Burns JR. Short-term and long-term changes in renal function after donor nephrectomy. J Urol. 1991; 145: 11–13. [CrossRef] [PubMed]
Rojas-Canales DM, Li JY, Makuei L, Gleadle JM. Compensatory renal hypertrophy following nephrectomy: when and how? Nephrology. 2019; 24: 1225–1232. [CrossRef] [PubMed]
Renard F, Guedria S, Palma ND, Vuillerme N. Variability and reproducibility in deep learning for medical image segmentation. Sci Rep. 2020; 10: 1–16. [CrossRef] [PubMed]
Iovino C, Pellegrini M, Bernabei F, et al. Choroidal vascularity index: an in-depth analysis of this novel optical coherence tomography parameter. J Clin Med. 2020; 9: 595. [CrossRef] [PubMed]
Betzler BK, Ding J, Wei X, et al. Choroidal vascularity index: a step towards software as a medical device. Br J Ophthalmol. 2022; 106: 149–155. [CrossRef] [PubMed]
Han X, Du N-N, Li S, Diao Z-L, Fu L, Liu W-H. Choroidal vascularity index assessment: a potential noninvasive technique for diagnosing diabetic nephropathy. J Ophthalmol. 2022; 2022: 3124746. [PubMed]
Burke J . Edge tracing using Gaussian process regression. GitHub. 2021, https://github.com/jaburke166/gaussian_process_edge_trace. Accessed December 15, 2022.
Figure 1.
 
EDI-OCT scan of a donor's right eye at baseline. (A) En face view of the fundus with the location of the B-scan in green and markers for measuring CT and CA in red. (B) B-scan showing chorioretinal structures with manual and automated assessment shown.
Figure 1.
 
EDI-OCT scan of a donor's right eye at baseline. (A) En face view of the fundus with the location of the B-scan in green and markers for measuring CT and CA in red. (B) B-scan showing chorioretinal structures with manual and automated assessment shown.
Figure 2.
 
Schematic diagram of the automated approach segmenting the C-S junction of the choroid on an EDI-OCT B-scan.21 GP, Gaussian process; GPR, Gaussian process regressor; KDE, kernel density estimate.
Figure 2.
 
Schematic diagram of the automated approach segmenting the C-S junction of the choroid on an EDI-OCT B-scan.21 GP, Gaussian process; GPR, Gaussian process regressor; KDE, kernel density estimate.
Figure 3.
 
(A) Correlation plot of manual and (parallel) automated CT measurements. (B) Bland–Altman plot showing residual values, with 52 major discrepancies highlighted as red crosses.
Figure 3.
 
(A) Correlation plot of manual and (parallel) automated CT measurements. (B) Bland–Altman plot showing residual values, with 52 major discrepancies highlighted as red crosses.
Figure 4.
 
(A) Distribution of angular errors from manual CT measurements (blue). Average unsigned difference between manual and perpendicular automated CT (red). (B) Unsigned, longitudinal within-patient angular deviations, relative to baseline measurements. Colored lines represent distinct individuals using manual measurements.
Figure 4.
 
(A) Distribution of angular errors from manual CT measurements (blue). Average unsigned difference between manual and perpendicular automated CT (red). (B) Unsigned, longitudinal within-patient angular deviations, relative to baseline measurements. Colored lines represent distinct individuals using manual measurements.
Figure 5.
 
(A) A selection of major discrepancies between automated and manual CT measurements. Red lines represent perpendicular automated CT, blue represent parallel automated CT, and green represent manual CT. The lines in red are shown in (A, C) to demonstrate the observed angular deviation between perpendicular automated CT and manual CT. (A) Temporal and subfoveal major discrepancies from a donor, residuals 97 µm and 114 µm, respectively. Nasal angular deviation (5.4°) resulted in perpendicular and parallel residuals as 22 µm and 2 µm, respectively. (B) Subfoveal major discrepancy from a recipient, residual –47 µm. (C) Nasal angular deviation (8.1°) shows perpendicular and parallel residuals as 106 µm and 79 µm, respectively. (D) Temporal and nasal major discrepancy from a donor, residuals 66 µm and 65 µm, respectively.
Figure 5.
 
(A) A selection of major discrepancies between automated and manual CT measurements. Red lines represent perpendicular automated CT, blue represent parallel automated CT, and green represent manual CT. The lines in red are shown in (A, C) to demonstrate the observed angular deviation between perpendicular automated CT and manual CT. (A) Temporal and subfoveal major discrepancies from a donor, residuals 97 µm and 114 µm, respectively. Nasal angular deviation (5.4°) resulted in perpendicular and parallel residuals as 22 µm and 2 µm, respectively. (B) Subfoveal major discrepancy from a recipient, residual –47 µm. (C) Nasal angular deviation (8.1°) shows perpendicular and parallel residuals as 106 µm and 79 µm, respectively. (D) Temporal and nasal major discrepancy from a donor, residuals 66 µm and 65 µm, respectively.
Figure 6.
 
Longitudinal change in the choroid, measured relative to baseline measurements. Donors are shown as dashed lines and recipients as solid lines, with standard errors plotted at each time point.
Figure 6.
 
Longitudinal change in the choroid, measured relative to baseline measurements. Donors are shown as dashed lines and recipients as solid lines, with standard errors plotted at each time point.
Table 1.
 
Population Statistics for All Donors and Recipients, Rounded to the Nearest Integer
Table 1.
 
Population Statistics for All Donors and Recipients, Rounded to the Nearest Integer
Table 2.
 
Performance Evaluation Between Automated and Manual CT, Stratified by Cohort and Macular Location
Table 2.
 
Performance Evaluation Between Automated and Manual CT, Stratified by Cohort and Macular Location
Table 3.
 
Numerical Score of Both Approaches from Masked Adjudication of 52 Major Discrepancies, Stratified by the Visibility of the C-S Junction, and Qualitative Preference Scores for Both Approaches
Table 3.
 
Numerical Score of Both Approaches from Masked Adjudication of 52 Major Discrepancies, Stratified by the Visibility of the C-S Junction, and Qualitative Preference Scores for Both Approaches
Table 4.
 
Choroidal Measurements at Baseline and 1 Year Posttransplant (PT)
Table 4.
 
Choroidal Measurements at Baseline and 1 Year Posttransplant (PT)
Table 5.
 
Pearson Correlation Coefficients Between Choroidal Measurements and eGFR, Serum Creatinine, and Serum Urea in the Recipient Cohort
Table 5.
 
Pearson Correlation Coefficients Between Choroidal Measurements and eGFR, Serum Creatinine, and Serum Urea in the Recipient Cohort
Table 6.
 
LME Summary Tables for Predicting eGFR, Serum Creatinine, and Serum Urea in Transplant Recipients Using Each Choroidal Measurement
Table 6.
 
LME Summary Tables for Predicting eGFR, Serum Creatinine, and Serum Urea in Transplant Recipients Using Each Choroidal Measurement
×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×