Findings from validity testing of the IVI show that we can combine disparate data sets of low patient responses from multiple studies to calibrate item measures and response category thresholds (
Table 2 and Supplementary Material) for the research and clinical community to use when employing the IVI as a PROM. Consistent with similar efforts in other fields standardizing the unit of measurement ensures invariant scales across studies that employ the same instrument, enabling users to administer the IVI to any size sample of visually impaired patients and directly compare results across studies.
9
Previous studies have divided the IVI items into three domains: reading, mobility, and emotional health. However, our analysis demonstrates that most of the variance in the observed person measures (75%) can be explained by a single vision-related principal axis that is highly correlated with reading and mobility. Given the high correlations observed and the majority of variance explained by vision-related items, a summary IVI score representing all items (in contrast to item subset analyses; e.g., domains), is likely adequate when measuring and reporting outcomes from medical interventions. Medical interventions are typically restorative in nature and the treatment effect is at the person level. This is in contrast to low vision rehabilitation where the effects are observed at the item level.
23 People's observations to the IVI reasonably fit the expectations of the model, however, like other VFQs, item fit analysis showed large amounts of variance.
24,25 DIF by center showed mostly good agreement between data sets in item measure estimates. Data sets (Finger et al.
13 and Gothwal et al.
17) with extreme demographic (younger) and vision characteristics (more profound levels of VA loss) were the greatest sources of DIF between centers. ANOVA by covariate (age, gender, disorder, VA) showed that certain items more commonly showed greater DIF magnitude. Item examples included eyesight interfering with reading ordinary size print (for example, newspapers), reading labels or instructions on medicine bottles, go carefully to avoid falling or tripping, felt sad or low because of your eyesight, worried about your eyesight getting worse, visiting family or friends, and opening packages. Some of the bias evident in responses to these items may represent true underlying differences between data sets, such as an excess of extremes in category responses. This observation does not preclude use of the IVI. Rather, were we designing a new instrument, we would consider modifying these items to minimize DIF for the relevant covariates differing between data sets.
The IVI person-item map shows that the strongest information in the estimated measures occurs in the middle of the visual ability distribution. At both tails of the distribution, there is an absence of items targeting patients with either very good or very poor visual ability. This is an important consideration when determining how appropriate the IVI questionnaire will be for measuring baseline visual ability and treatment outcomes in a given population. Detecting meaningful change (e.g., minimum clinically important difference) with a VFQ requires consideration of larger standard errors with extreme scores, and therefore a smaller, but meaningful effect may not be resolvable in people close to the floor or ceiling of the measure. To better target visual ability measurement in populations with more profound vision loss, the IVI –Very Low Vision and the Ultra-Low Vision Visual Functioning Questionnaire were developed.
13,26
VA was the strongest predictor of IVI person measure estimates with evidence of a monotonic decline in person measures with declining VA. This is consistent with all valid VFQs and supports the premise that VA (or resolution capacity) is one of the primary factors in assessing visual ability.
27 Gender was also predictive of IVI person measure estimates, with females showing relatively worse person measure estimates compared to males. Neither ocular diagnosis (AMD versus other) nor age were associated with person measure estimates.
The IVI-28 item calibration estimates provided in
Table 2 or in
Supplementary File S1 (MS excel with computations encoded or
Supplementary File S2 Winsteps CON file) may be used by researchers and clinicians to measure visual ability in patients with vision impairment. The strengths of this work include the large number of respondents to the IVI and the diverse patient population, both geographically and visually. Providing these IVI item calibrations enables researchers to more accurately and precisely compare treatment outcomes involving people with vision impairment, especially when the number of patients is small. Study limitations include an inability to assess responses to IVI items by more detailed diagnoses and that the calibrations do not reflect responses from individuals younger than 18 years old.