The GST was developed by Turpin et al.
11 with the aim of enabling detection of VF abnormalities in less than a minute, with high sensitivity and specificity towards glaucoma diagnosis. Although the GST was developed by computer simulation on the basis of VF data of a large population comprising subjects with glaucoma and normal controls, it has yet to be validated in a prospective clinical study. The present findings demonstrated that the duration of the GST in normal controls and patients with early glaucoma was less than a minute, while that in patients with moderate and advanced glaucoma was between 1 and 1.5 minutes. The sensitivity and specificity of the GST were 73% to 100% and 98% to 100%, respectively. These findings support the results obtained by computer simulation in the previous study.
11 In addition, in the present study, evaluation of diagnostic ability and repeatability of the GST relative to the G-Dynamic threshold revealed that, while the AUC of the GST
1st in the early glaucoma group was significantly lower than that of the G-Dynamic test, there were no significant differences in AUC at any other glaucoma stage among the three tests.
In the early glaucoma group (MD >−3 dB), the AUC and sensitivity (at 90% specificity) values of the GST
1st and GST
2nd measured by SAP were 0.864 to 0.889 and 72.7% to 78.8%, respectively; the corresponding values in the moderate glaucoma group (MD, −6 to −3 dB) were 0.968 to 0.970 and 93.9%, respectively. Previous studies on diagnosis of early glaucoma (MD > −6 dB) by specific perimetry — which involves stimulation of the magnocellular or koniocellular layer of the lateral geniculate body — reported AUC and sensitivity (at 90% specificity) values of 0.690 to 0.990
9,21–24 and 32.1% to 97.0%, respectively.
9,23,24 Previous studies on diagnosis of early glaucoma (MD > −6 dB) on the basis of total and quadrant thicknesses of the retinal nerve fiber layer observed on OCT images reported AUC and sensitivity (at 90% specificity) values of 0.610 to 0.947
25–28 and 22.0% to 86.8%, respectively.
25–28 Thus, the values of AUC and sensitivity at 90% specificity of the GST obtained by SAP in the present study are equivalent to or higher than those obtained by specific perimetry and OCT in previous studies. Because specific perimetry measures the neural pathway of retinal ganglion cells with low redundancy, the variability of specific perimetry findings have been reported to be higher than those of SAP findings not only in patients with glaucoma but also in normal controls.
15–17 Nevertheless, because of its high specificity and good repeatability, the GST still is able to discriminate between normal presentation and early glaucoma as well as the threshold strategy, specific perimetry, or optical coherence tomography (OCT).
In the present study, the GST exhibited moderate to complete repeatability for glaucoma diagnosis and each test point. However, the repeatability of each test point was slightly lower than that of glaucoma diagnosis, especially in the early and moderate glaucoma groups. The best cutoff criterion for diagnostic decision with GST in early to advanced stages of glaucoma was the presence of 1 or 2 abnormal points. The diagnostic decision became easier with the progression of glaucoma, and its repeatability was high even when each abnormal point did not completely correspond with GST
1st and GST
2nd. In contrast, upon evaluating the repeatability of each test point to determine if it indicated normal or abnormal VF, the κ value of each test point was lower than that of the diagnostic decision. Additionally, in early and moderate glaucoma, patients exhibit normally sensitive as well as relatively defective areas rather than absolutely defective areas. A previous study on the test–retest variability of each test point reported high variability not only at the relatively defective points but also at the normal points.
29 In addition, other studies have reported fixation behavior of 0.43° to 2.9° during SAP.
30,31 It is thought that fixation behavior during VF measurement affects the repeatability of each test point in patients with early and moderate glaucoma, who exhibit normal and relatively defective areas.
In the present study, the sensitivity and specificity of the optimal cutoff value for the number of abnormal points for differentiating between normal and abnormal VF were the highest when one or two locations remained unseen. For conventional SAP measurement with the HFA, the optimal cutoff value for the number of abnormal points on the pattern deviation probability plot for differentiating between normal and abnormal VF was defined by the number of points on the pattern deviation probability plot with
P < 5%, with more than three of the points being contiguous and one point having <1% probability.
1 Additionally, the optimal cutoff values at the highest sensitivity and specificity were determined to be one or two points in the FDT N-30 test,
23,32 two points in the FDT Matrix 30-2 test,
32 and five points in the Pulsar T30W test.
23 Despite the discrepancy in number of test points between the present and previous studies, the present findings indicating an optimal cutoff value of one or two points are closest to those obtained with FDT in terms of number of test points.
23,32 In a computer simulation, the sensitivity and specificity were reported to be the highest when three locations remained unseen in the HFA and the Wills dataset.
11 The slight difference between the present and previous findings might be attributable to the GST points being based on the G program. In addition, the control subjects in the present study were slightly younger than the patients in each of the glaucoma groups. Therefore, the present method likely is better to discriminate between normal and glaucomatous VF defects than the previous method.
The present study has a few limitations. First, subjects with cataract were excluded from the glaucoma and control groups. The GST was administered at a stimulus intensity <5% of the age-corrected normal limit, because of which, participants with overall decreased sensitivity due to ocular media opacity might have a received false-positive diagnosis. Second, to avoid the learning effect of perimetry, which is a well-established phenomenon in clinical practice, perimetric novices were not included in the present study. Third, the present study had a small sample size. Although the sample size was adequate for ROC analysis, it was far too small for comparison of AUC. These limitations must be corrected for evaluation of consecutive participants.
In conclusion, in the present study, the duration of the GST in the control and early glaucoma groups was less than 1 minute, while that in the moderate and advanced glaucoma groups was within 1.5 minutes. The sensitivity and specificity of the GST were 73% to 100% and 98% to 100%, respectively. The present findings support the previously reported results of computer simulation.
11 The newly developed suprathreshold GST for early to advanced glaucoma exhibits high diagnostic ability and moderate to strong repeatability, indicating that it would be an effective clinical method for glaucoma screening by SAP.