The SZEST procedure, described in full below, represents a ZEST procedure with a prior probability mass function (
pmf) centered on an estimate of threshold. This is in contrast to the ZEST procedures in the perimetry literature, where the
pmf is typically derived from population data.
3,16 Figure 1 shows a comparison of threshold estimates and number of presentations made by SZEST with four Gaussian prior
pmfs to the exact same procedure with a population-derived
pmf (a standard ZEST procedure, the
pmf used is shown in Turpin et al.,
3 Fig. 2). As would be expected,
Figure 1 shows that using a prior
pmf weighted close to the true threshold of an observer improves procedure performance compared with a generic prior
pmf. However, using a prior
pmf weighted away from true threshold can increase error and number of presentations above the population-based ZEST procedure. In this study, therefore, we investigate how good the prior
pmf of SZEST must be in order to provide gains in clinical applications, where we consider SITA Standard to represent current clinical standards.
As the exact details of SITA Standard are not available, we evaluate SZEST against FT as a surrogate for SITA Standard. FT is a staircase procedure that has largely been superseded in clinical use by the SITA procedures.
4 The full details of the SITA algorithms are not publically available, so we use FT as a surrogate as it has similar test–retest characteristics to SITA Standard.
2,3,17–20 SITA Standard is on average 1 to 1.5 presentations faster than FT at each location, thus, we make sure that SZEST is also faster than FT to allow our indirect comparison between SZEST and SITA to be fair.
FT presents stimuli in 4 dB increments until a response reversal occurs, and in 2 dB increments thereafter. The output threshold is the intensity of the “last seen” stimulus after two response reversals. For primary locations of the 24-2 pattern (±9°, ±9° tested first), FT begins at 25 dB, so this is what we used as the first presentation for simulations of single locations (Experiment One). If a patient does not respond to the brightest stimulus (0 dB) twice then 0 dB is returned as the output threshold. In our FT implementation, if the output threshold was more than 4 dB away from the starting point, then a second staircase was started, identical to the first except starting from the initial estimate. This second estimate was taken as the output threshold, and presentations made in both staircases were counted.
For full-field simulations (Experiment Two), FT used a growth pattern, whereby after sensitivity was estimated at the four primary locations, thresholds at neighboring locations were estimated with the procedure starting from a decibel value equal to the neighboring location's estimate plus a correction for eccentricity. The growth pattern used in this study was identical to that illustrated in Turpin et al.,
3 figure 1.
SZEST is a modified ZEST procedure in which the prior
pmf is a Gaussian distribution centered on a sensitivity prediction made by a structural measure. We chose a Bayesian procedure because Bayesian-like procedures are well established in perimetry and easily incorporate different sources of prior information or different desired outcomes. For example, the weighting applied to prior information can be adjusted according to the predictive power of the information and the consequences of erroneous predictions. Staircase procedures such as FT can also be modified to incorporate prior information by altering the start point of the staircase, but this has little effect on procedure bias when the starting guess is within ±10 dB of the true value.
21 Finally, Bayesian procedures have been shown in previous simulation studies to estimate threshold with less bias than FT.
3
ZEST procedure performance is influenced by its prior
pmf, likelihood function and termination criteria.
5,22 We were primarily interested in the effect of altering the prior
pmf, so used the same likelihood function as in previous simulations (shown in Turpin et al.,
3 figure 2) whose slope is derived from frequency-of-seeing curves of healthy observers to perimetric stimuli.
23 We chose to use dynamic termination criteria as these reduce test variability.
22 We trialled many combinations of termination criteria and prior
pmf SDs (data not shown), and settled on a prior
pmf with SD 5 dB, and a termination criterion of posterior
pmf SD less than 1.5 dB. This represented the best trade-off between accuracy and test duration in initial simulations where prior information perfectly predicted sensitivity. The procedure was implemented as a standard ZEST procedure.
3,6,22 Once the procedure terminated, the expectation of the final posterior
pmf was rounded to the nearest integer decibel value and reported as the final threshold estimate. The simulated perimeter had a range of stimulus intensities from 0 to 40 dB, but
pmfs were calculated over a range extended by 10 dB either side of this so that thresholds close to the range limits of the perimeter were achievable. For full-field simulations using SZEST each location was tested independently; no growth pattern was used.