The steps used by the MRF were scaled to range from 0 to 30 dB over seven discrete levels. Adopting seven levels retains estimates within the capacity of the 8-bit output and can be justified in terms of sampling efficiency. Optimizing sampling efficiency recognizes that threshold variability increases with reduced sensitivity, so step size needs to increase. The conclusion is that seven to eight steps are optimal to sample over a range of 30 dB.
12 We have adopted a three-presentation binary Baysean protocol to yield eight steps (2
3) across the 30-dB range (Zippy Estimation by Sequential Testing, ZEST). Our thresholding approach is a binary logic that commences at a level (probability density function [PDF]; see Fig. 1 of Vingrys and Pianta
13) that is easily visible to 97.5% of normal observers (17 dB). This starting value has the advantage that it provides reinforcement of the task in regions of normal sensitivity, because it is easily seen, and can be used as a screening level for regions with abnormal sensitivity. The rest of the decision tree sequence is predetermined by a modified ZEST procedure
14 (see Fig. 3 of Vingrys and Pianta
12). For the MRF implementation, we have assumed that our observers were ‘reliable', knowing full well that they are not. We believe that doing so is justified by the small number of false responses returned by the majority of patients, which in practice is less than 5% (see later where we find no change in 97% of retested values) and that, this approach requires fewer presentations than does one that continues testing after polling a false response early in a test sequence in an otherwise reliable person (see Fig. 2 of Phipps et al.
15). We deal with cases of lapses in concentration, by retesting points found to be removed from their neighbors. In doing so, we acknowledge that some people will be unreliable during testing but we believe that such people will frustrate endpoint estimates from any test logic. However, one benefit of our approach is that it is rapid requiring three steps for an endpoint, so retesting unreliable locations or unreliable patients becomes a preferred option to continuing with the thresholding processes for a long time in an effort to rectify false response.
15 The modified ZEST used in our implementation varies the slope of the likelihood function with sensitivity rather than using a fixed slope.
14 The population PDF and the likelihood functions associated with a response matrix were derived by reanalysis of 17,390 threshold determinations reported by Vingrys and Pianta
13 scaled to 30 dB. The ZEST logic was then applied to yield a binary decision tree, which gave eight potential end-points (2
3) that end up being spaced by approximately 4 to 6 dB across the available range of 0 to 30 dB. The MRF implementation has adopted seven of these eight outcomes by collapsing the lower three into two levels, as these were found to be closely spaced. These levels are: 0, 6, 12, 17, 22, 26, and 30 dB.