This study demonstrated that the ratio between PhNR PT amplitude and the B-wave amplitude (defined as the a-wave trough to b-wave peak; PT/B) was the measure that exhibited the lowest magnitude of test-retest variability (and was henceforth referred to as the PhNR/B ratio as it was the primary outcome measure used in this study). This study also determined that the PhNR/B ratio did not exhibit systematic changes over the seven measurements obtained at each session, and that the magnitude of its repeatability did not vary significantly under different testing conditions (both between sessions or between electrode placement by different examiners) using a handheld, nonmydriatic ERG system with self-adhering skin electrodes. It also demonstrated that improved repeatability could be achieved by increasing the number of recordings obtained.
In this study, we observed that the optimal measure for the PhNR was the PT/B ratio, producing the lowest level of test-retest variability relative to its EDR. Although we had previously reported the PT amplitude as the most repeatable measure,
23 reanalysis of the data from that study after normalizing the different measures confirmed that the PT/B ratio also exhibited the lowest level of test-retest variability in that study. Normalization to the EDR has also been performed for both standard automated perimetry (SAP)
26 and retinal nerve fiber layer (RNFL) thickness on optical coherence tomography (OCT)
27 in eyes with glaucoma, when seeking to compare the number of discriminable steps (or SDs from the normal to floor values based on measurement variability) between different stimulus parameters and devices, respectively. For instance, the global RNFL thickness ranged from having between 8.3 to 10.6 discriminable steps for the three different devices.
27 For the PhNR/B ratio in this study, there would be 6.5 discriminable steps from normal to floor values based on its CoR of 30% of the EDR (using a similar method of calculation). Although these findings may suggest that the PhNR may be less useful for monitoring disease progression than the RNFL thickness, its usefulness may lie in its potential prognostic value
17 and relevance as a biomarker for neuroretinal function that is reversible,
32 although future studies are required to investigate this further.
We also confirmed in this study that there were no systematic changes between the first two PhNR/B ratio measurements within each session. This is consistent with previous findings that there were no significant interocular differences for PhNR measurements within the same session for normal participants (suggesting that there were no significant intrasession changes, if both eyes are assumed to be equal).
24 We sought to establish this because a previous study observed an unexplained decline in the pattern electroretinogram (PERG) amplitude between repeated measurements within the same session,
33 although such intrasession changes have not been observed with conventional full-field ERG.
34 This finding suggests that the first measurement is comparable with all subsequent measurements, and a practice measurement is not required to obtain comparable subsequent responses. We also confirmed that no systematic changes occurred over the seven measurements performed in this study, which is important because clinicians may wish to take multiple measurements in both eyes for participants with noisy recordings in order to obtain a reliable response.
We also observed that there were no significant differences in the repeatability of the PhNR/B ratio within- and between-sessions, and between different electrode placements. We had hypothesized that the intersession variability would be significantly higher than the intrasession variability (as observed in previous studies using other electroretinography techniques
34–36) due to differences in electrode placement. We examined this by replacing the electrodes within the same session in a masked fashion by a different examiner. We sought to introduce further interexaminer variability by including a clinician and nonclinician examiner for electrode placements, but yet did not find significant differences in the measurement repeatability. It is possible that these findings are attributed to the self-adhering electrode array used in this study, which appeared to be more stable and easier to set up than the conjunctival fiber and skin gold cup electrodes that we have used previously from our anecdotal observations; further studies are required to determine if this is the case.
Intersession repeatability of the PhNR/B ratio also improved when including an increasing number of sweeps in this study, which is consistent with the findings from our recent study
28 and also the notion that the statistical estimate of a mean improves with more samples. We had hypothesized that no further improvements in the measurement repeatability would be observed when deriving a final trace from more than one recording (that consisted of obtaining 200 sweeps) because it would have fallen within the noise levels of the measurement. However, we observed that a final trace that included up to three recordings exhibited a significantly smaller magnitude of variability compared with those derived from just one recording. These findings are useful for both researchers and clinicians when deciding how many sweeps to include when using this testing protocol, depending on the level of measurement precision desired.
The ability to obtain PhNR measurements using a portable, handheld, full-field ERG system that is simple to implement clinically compared with the standard full-field ERG system overcomes previous impediments to its widespread and routine implementation. Interestingly, we also observed that the repeatability of the PhNR BT and PT/B ratio in this study was more than three times better than our previous study, where recordings were obtained using conjunctival electrodes and included only 10 sweeps.
23 The repeatability of the PhNR BT measure in this study was also approximately 2 and 1.5 times better than those obtained using skin and conjunctival electrodes, respectively, in another previous study
24 that included 40 sweeps (the limits of agreement were 88% and 63% of the mean value of the normal subjects for the skin and conjunctival electrodes, respectively). This improved repeatability is most likely attributed to the larger number of sweeps (through more recordings) used to derive the final trace in this study, and suggests that the protocol used in this study could have a comparable (if not better) level of repeatability than those often used in clinical or research studies.
11,17,20–24 However, it is likely that the repeatability of the PhNR may be even better when using corneal bipolar electrodes compared with the self-adhering skin electrodes used in this study, but the use of such electrodes are more invasive and requires experienced examiners. The finding that the PT/B ratio for measuring the PhNR was the most repeatable measure also highlights its use over standard measurements of the PhNR amplitude from the baseline when evaluating eyes with diseases that only affect this component of the full-field ERG, such as glaucoma. However, the finding that the PT/B ratio was the most repeatable measure in normal subjects should not diminish the importance of examining the BT amplitude in diseased eyes, because ocular diseases that affect other components of the full-field ERG response (including the a- and b-waves) will affect the PT/B ratio and thus obscure the ability to detect changes to the PhNR. In addition, improvements in the repeatability of the PhNR (by means of the protocol or measure used) should not be the sole way of judging clinical performance, and future studies are required to examine whether such improvements result in an improved ability to detect and monitor ocular diseases and evaluate treatment efficacy.
Some potential limitations need to be recognized when interpreting the results of this study, including the method by which the EDR was calculated. Although it is assumed that the floor amplitude of the PhNR in response to a brief flash is zero based on animal studies
1 and our observations of patients with advanced glaucoma, this has not yet been confirmed by comparing the amplitude of the PhNR with visual sensitivity on SAP.
27 In addition, we assumed that the a- and b-waves of the ERG recording would be unchanged when the PhNR response was at its floor, which is an assumption that is required when evaluating any PhNR measure that include these other components (such as the PT/B ratio). However, these assumptions were kept consistent for all methods used to measure the PhNR in this study, and we therefore believe the comparison is valid. The limited sample size and age range of the participants may also prohibit its generalizability for younger or older participants. However, we sought to include participants that were of a similar age to most participants with preperimetric glaucoma, because the PhNR may be most useful for this stage of the disease. Future studies are required to confirm these findings in other age groups and in a larger sample size. In addition, the self-adhering skin electrodes used in this study obtains smaller signal amplitude than corneal or conjunctival electrodes typically used in ERG recordings. For instance, the a-wave trough to b-wave peak amplitude of the self-adhering skin electrodes was 26% of the amplitude for conjunctival electrodes in our previous study,
28 smaller compared with skin gold cup electrodes using a full-field red-on-blue stimulation (between 36%
28 and 38%
24) and when measuring PERG amplitude (30%
33). However, these studies have demonstrated that the magnitude of test-retest variability relative to signal amplitude was similar between skin and conjunctival electrodes,
24,28,33 and one study showed that the capacity of both electrodes to diagnose glaucoma was similar.
33 Nevertheless, future studies are required to establish whether the self-adhering skin electrodes used in this study is comparably effective for detecting and monitoring changes in the PhNR in eyes with glaucoma.
In conclusion, this study showed that the PT/B ratio was the optimal measure of the PhNR and that its CoR fell within ± 30% of its EDR when obtaining measurements using a portable, handheld full-field ERG system that uses self-adhering skin electrodes. These findings highlight the potential clinical use of this technique as an objective measure of neuroretinal function in glaucoma and other ocular conditions.