Abstract
Purpose:
It has been suggested that the detection of visual field progression can be improved by modeling statistical properties of the data such as the increasing retest variability and the spatial correlation among visual field locations. We compared a method that models those properties, Analysis with Non-Stationary Weibull Error Regression and Spatial Enhancement (ANSWERS), against a simpler one that does not, Permutation of Pointwise Linear Regression (PoPLR).
Methods:
Visual field series from three independent longitudinal studies in patients with glaucoma were used to compare the positive rate of PoPLR and ANSWERS. To estimate the false-positive rate, the same visual field series were randomly re-ordered in time. The first dataset consisted of series of 7 visual fields from 101 eyes, the second consisted of series of 9 visual fields from 150 eyes, and the third consisted of series of more than 9 visual fields (17.5 on average) from 139 eyes.
Results:
For a statistical significance of 0.05, the false-positive rates for ANSWERS were about 3 times greater than expected at 15%, 17%, and 16%, respectively, whereas for PoPLR they were 7%, 3%, and 6%. After equating the specificities at 0.05 for both models, positive rates for ANSWERS were 16%, 25%, and 38%, whereas for PoPLR they were 12%, 33%, and 49%, or about 5% greater on average (95% confidence interval = −1% to 11%).
Conclusions:
Despite being simpler and less computationally demanding, PoPLR was at least as sensitive to deterioration as ANSWERS once the specificities were equated.
Translational Relevance:
Close control of false-positive rates is key when visual fields of patients are analyzed for change in both clinical practice and clinical trials.
First, for the three datasets, we calculated the positive rates of PoPLR and ANSWERS at significance levels (α) ranging from P < 0.001 to P < 0.15; that is, we obtained the proportion of series for which the P value derived by PoPLR and ANSWERS was lower than α.
Second, we assessed whether the P values derived by PoPLR and ANSWERS were accurate. For this, we computed the positive rates for each dataset as in the first analysis, but after randomly re-arranging the time order of the visual fields in each series once. Because the re-ordering is at random, this process is expected to reduce any systematic change in the original series to chance levels. Therefore, the progression rate measured in a sample of re-ordered series equals the false-positive rate, within chance variation. If the P values returned by PoPLR and ANSWERS are accurate, the false-positive rate should equal the nominal significance level α within sampling error. That is, for α = 0.05, the empirically calculated false-positive rate with the re-ordered series should be approximately 5%; for α = 0.15, it should be approximately 15%. Because the computation of P values with ANSWERS is computationally demanding, we derived the false-positive rates from only one random permutation of each series.
Finally, because the false-positive rate estimates were based on relatively small sample sizes, we carried out an alternative assessment of the accuracy of the P values derived by ANSWERS. We derived individualized P values with ANSWERS in an approach similar to PoPLR. More precisely, the P values were derived by establishing the null distribution of ANSWERS’ S-statistic from 1000 random permutations of each original visual field series. By design, this approach ensures close control over the false-positive rate and the accuracy of the P values. Positive rates were then obtained for this modified ANSWERS model as in the first analysis. Because ANSWERS is computationally highly demanding this became practicable only through use of the high-performance computing facilities at the University of Melbourne.
Our implementation of ANSWERS failed to converge in 2 out of the 101 eyes of the P3 dataset and 6 out of the 139 eyes from the Rotterdam dataset. The eight eyes that could not be analyzed with ANSWERS were removed from the study. Thus, the sample sizes for the subsequent analyses were 99 eyes for the P3 dataset, 150 eyes for the DIGS-ADAGES dataset, and 133 eyes for the Rotterdam dataset.
The upper panel of
Figure 1 shows the positive rates for PoPLR and ANSWERS for the three datasets. The lower panel of
Figure 1 shows the false-positive rates calculated after randomly re-ordering the visual fields in each series.
The positive rates for ANSWERS were clearly greater than for PoPLR, as shown in the upper panels of
Figure 1. But, as shown in the lower panel, so were the false-positive rates. To correct for this disparity in false-positive rates and thus allow for a fair comparison of the positive rates obtained with the models,
Figure 2 shows the positive rate (
y-axes in the uppers panel of
Fig. 1) as a function of the false-positive rates (
y-axes in the lower panels of
Fig. 1).
For all three datasets, positive rates for PoPLR were similar or greater than those for ANSWERS.
To confirm these findings, we recomputed
P values using 1000 random visual field permutations with ANSWERS.
Figure 3 shows the positive rates obtained for random permutation with this modified version of ANSWERS for the Rotterdam dataset. For comparison,
Figure 3 also shows the positive rate obtained for PoPLR and for ANSWERS as a function of the false-positive rate (the black and red curves in the right panel of
Fig. 2).
The positive rates obtained for random permutation with the modified ANSWERS (blue curve) matched those for ANSWERS expressed as a function of its false-positive rates (red curve). Positive rates were similar to each other and always clearly lower than for PoPLR.
Supported by National Institutes of Health (NIH) grant EY025756 (to L.R.), Computational Optometry (IMF), and an unrestricted grant from Research to Prevent Blindness. The DIGS and ADAGES studies were supported by NIH grant numbers P30EY022589, EY021818, EY11008, U10EY14267, and EY019869; Eyesight Foundation of Alabama; Alcon Laboratories, Inc.; Allergan, Inc.; Pfizer, Inc.; Merck, Inc.; Santen, Inc.; Edith C. Blum Research Fund of the New York Glaucoma Research Institute (New York, NY, USA); and an unrestricted grant from Research to Prevent Blindness (New York, NY, USA). The P3 study was supported by NIH grant numbers EY19674 and EY020922; and an unrestricted grant from the Good Samaritan Foundation.
Disclosure: I. Marín-Franch, None; P.H. Artes, None; A. Turpin, None; L. Racette, None