Abstract
Purpose:
This article describes the Humphrey field analyzer (HFA) dataset from the Department of Ophthalmology at the University of Washington.
Methods:
Pointwise sensitivities were extracted from HFA 24-2, stimulus III visual fields (VF). Total deviation (TD), mean TD (MTD), pattern deviation, and pattern standard deviation (PSD) were calculated. Progression analysis was performed with simple linear regression on global, regional, and pointwise values for VF series with greater than four tests spanning at least four months. VF data were extracted independently of clinical information except for patient age, gender, and laterality
Results:
This dataset includes 28,943 VFs from 7248 eyes of 3871 patients. Progression was calculated for 2985 eyes from 1579 patients. Median [interquartile range] age was 64 years [54, 73], and follow-up was 2.49 years [1.11, 5.03]. Baseline MTD was −4.51 dB [−8.01, −2.65], and baseline PSD was 2.41 dB [1.7, 5.34].
Conclusion:
MTD was found to decrease by −0.10 dB/yr [−0.40, 0.11] in eyes for which progression analysis was able to be performed. VFs with deep localized defects, PSD > 12 dB and MTD −15 dB to −25 dB, were plotted, visually inspected, and found to be consistent with neurologic or glaucomatous VFs from patients. For a small number of tests, extracted sensitivity values were compared to corresponding printouts and confirmed to match.
Translational Relevance:
This open access pointwise VF dataset serves as a source of raw data for investigation such as VF behavior, clinical comparisons to trials, and development of new machine learning algorithms.
We present an open-sourced, observational VF dataset curated from a single academic institution with progression analysis performed on series with at least four VF tests over at least four months. The raw sensitivities and progression of pointwise, clusterwise and global values are included in the raw data files. To our knowledge, this is the first open access VF dataset of this magnitude to be made available for research.
Rates of change in this dataset are in line with those presented previously. Spry and Johnson
11 reported a progression rate of 0.64 dB per decade in normal eyes, which is in line with the difference we found between the MS and MTD progression rate. They also reported that the rate may be affected by age, eccentricity of the test location, and the hemifield.
11 In this dataset, the most damage at baseline occurred in the superior hemifield (
Fig. 2), in agreement with previous reports.
12,13 However, in our dataset, the inferior field was the fastest progressing region. Other studies in glaucoma patients report a wide range for mean deviation rate of progression from −0.05 to −0.57 dB/year.
14–20 The variability of MD rates could be influenced by factors such as different baseline glaucoma severity, surgical interventions during the follow-up period, and different follow-up times. Although MD is not available in this dataset because of proprietary location-specific variability estimates, the surrogate mean total deviation has been shown to be equvalent in interpretation.
7,8
Pointwise sensitivities were extracted from the binary header data in the VF file for this dataset. Pointwise sensitivities can also be obtained by exporting the data from Zeiss Forum or, more recently, by extracting values from the DICOM file or images of the VF report with third-party software.
21,22 Zeiss Forum was not available at our institution at the time of data extraction. Duplicate tests from the extraction process used for this dataset had to be identified and removed. These duplicate tests likely occurred due to a combination of correction of user input errors, relocations of devices across different sites, or transitions between data servers across the years.
This open access VF dataset is the first of its size to be published and aims to lower the barrier to entry for the scientific community.
4 We provide both summary statistics and technical validation of our dataset. An open-access VF dataset would have a number of applications. It could be used to explore localized rates of change and interactions of neighboring VF locations. It can serve as a clinically-derived point of comparison to other VF datasets from other studies or for future clinical trials. It could also be used to evaluate the effects of different criteria for progression analysis. The size of this dataset opens avenues for possible machine learning applications.
Limitations to this dataset exist. First, the VF data were extracted independently of clinic information other than patient age, gender, and laterality. Some clinical information was not included, such as initiation and timing of treatment and surgical interventions. The dataset represents all patients undergoing VF testing at an academic institution with or without glaucoma diagnosis, and may not reflect the general patient population. Further work will have to be performed to identify, categorize, and rectify relevant health records. Second, reliability indices were not extracted, but the effects of less-reliable tests would be somewhat mitigated by the number of eyes and tests in this dataset. Third, proprietary information in the HFA limited the information that could be extracted. Probability deviation maps were not available, along with the Glaucoma Hemifield Test classification and Statpac analysis such as the Guided Progression Analysis, which are used clinically to help determine glaucoma progression and diagnosis. Likewise, our TD and PD values were not directly extracted from the test but rather calculated from the sensitivity values. However, as explained in the Methods section, the TD maps were derived as deviations from the normative values extracted from the machine and are therefore likely to be accurate. We confirmed this by showing that the difference between the average rate of progression for MS and MTD matched the expected sensitivity decline because of aging alone (−0.06 dB/year).
11 PD maps were derived according to the Imaging and Perimetry Society standards from TD maps. Therefore, provided that the TD values are correct, they should be reflective of what would be obtained from the actual HFA printouts. It is also worth mentioning that other freely available software, such as the visualFields package for R,
23 provide independent normative datasets and tools to calculate all these metrics from any dataset, including ours. It should be finally noted that we used simple linear regression to quantify progression. Better and more precise methods exist.
24–28 However, the scope of our progression analysis was mainly to provide descriptive statistics of the sample for researchers making use of our database, for example to select specific patients based on their rate of progression. As such, we chose simple linear regression as a straightforward method, easily replicated by other researchers for validation of their results. On the other hand, because we are making our database fully available, researchers in different fields would be able to apply their preferred method for detection of progression for their specific applications.
The transition of health data to an electronic health record format opens the possibility of access to large datasets for research. Large, high-quality datasets have provided the foundation for innovative statistical and computational models but also present a barrier of entry. This VF dataset aims to help establish an open access repository so that the scientific community may use it to accelerate discoveries in this field.
The authors thank the patients and the clinicians in the glaucoma, neuro-ophthalmology, and comprehensive divisions at the University of Washington Department of Ophthalmology who contributed to the formation of this VF dataset.
Supported by NIH/NEI K23EY029246 (Bethesda, MD), NIH/NIA R01AG060942 (Bethesda, MD), Latham Vision Research Innovation Award (Seattle, WA), and an unrestricted grant from Research to Prevent Blindness (New York, NY).
Disclosure: G. Montesano, CenterVue SpA (C); A. Chen, None; R. Lu, None; C.S. Lee, None; A.Y. Lee, U.S. Food and Drug Administration (F), Santen (R), Carl Zeiss Meditec (R), and Novartis (R), personal fees from Genentech (I), Topcon(I), and Verana Health (I)