Vision restoration for the blind attempts to restore, at least to some extent, the visual perception of the type we know to be available for fully or partially sighted individuals. I define “vision” here, as a sensory experience that results in perception with spatial-temporal characteristics typical of the natural visual system, although it may be of lower resolution and lower temporal bandwidth. Such vision should support the ability to convert a 2D (sensory) image back to a 3D perception of the scene from which it was derived (the inverse problem
1). This may be achieved using monocular cues (e.g. perspective, occlusion, relative size, and motion parallax). Such vision will exhibit Gestalt properties,
2 such as good continuation, closure (reification), common fate, common region, element connectedness, proximity, etc. Vision enables us to recognize objects from novel viewpoints and to identify novel objects after seeing examples from the same category (generalization). Vision is used to control our self-actions, and discounts self-motion in the environment, enabling us to perceive a stable world as we move through it (even in virtual motion) and distinguish other motions within the sensory input. Such vision will also enable figure-ground segregation. Restored vision should exhibit as many of these properties as possible. It is necessary to demonstrate that prosthetic devices or other vision restoration modalities, such as gene therapy, optogenetics, or stem cell therapy, provide these capabilities.
I explicitly distinguish visual prostheses designed to restore vision from visual aids that are not designed or required to provide vision or visual experience. For example, text-to-speech systems provide reading capabilities but do not restore vision. The long cane, an amazingly effective mobility aid, enables blind people to commute without restoring vision. Visual sensory substitution devices (SSDs) using non-vision input (i.e. hearing
3 and tactile
4) may be considered to be visual prostheses. I reserve the term “visual SSD” for systems that are intended to provide vision through another sensory modality. This is distinguished from cross modal aids (e.g. Braille or text to speech applications). In an analogue way, I consider the cochlear implant to be an auditory restoration device, whereas sign language and captions for the hearing impaired to be merely auditory aids. Testing a visual prosthesis should first and foremost determine if the prosthesis provides vision. Secondary testing can then evaluate the performance parameters of the system. Unfortunately, the testing is frequently limited to the secondary parameter’s evaluation skipping the crucial proof of visual perception.
It is not known if SSDs, such as tactile stimulation of the skin,
5 electrical stimulation of the tongue (for example, the BrainPort
4; Wicab, Inc.; Middleton, WI), or visual-to-auditory sensory substitution devices
6–9 lead to visual perception. All these SSD systems lead to a sensory response, but that is not necessarily vision as it is defined above. It is established that electrical retinal stimulation systems like the Argus II
10 (Second Sight, Sylmar, CA), the Alpha IMS
11 (Retina Implant AG, Reutlingen, Germany), the IRIS II (Pixium Vision, Paris, France), or the PRIMA
12 (Pixium Vision) result in visual sensory responses (phosphenes). This, however, does not mean that these retinal prosthetics restore vision, as defined here. In fact, 16 patients implanted with either the Argus II or the Iris II system described their perception of the experience “as fundamentally, qualitatively different than natural vision,”
13 and those perceptions did not change over time resulting in the eventual abandonment of the use of the devices.
13
One could argue that if a visual aid provides any discrimination of light from dark, it is a vision restoration system (light perception). I would concede that such a definition may be acceptable in some general sense, but such a general definition is neither useful nor helpful in the context of visual prosthesis for the blind and the evaluation of vision restoration systems or treatments, even if that minimal capability enables the user to perform better on some visual tasks. A single light sensor system may be of some limited use, but it needs not be implanted and can be provided at extremely low cost and risks. However, no one considers such a simple visual aid to provide vision restoration. Such performance does not justify the cost and risk of implanted prosthetics or other invasive vision restoration systems. I will argue here that most current evaluation methods of prosthetic systems that use standard clinical or psychophysical vision test paradigms do not demonstrate that the systems actually restore vision, despite the ability of the users to perform above chance on the clinical tests. In fact, some testing explicitly showed that the performance is essentially equivalent to a single sensor system.
14,15
Standard clinical vision testing procedures are not designed to test the ability of prosthetic devices to restore vision, and thus they are inappropriate measurement tools. Yet, there is an effort to develop a consensus about the use of these techniques in evaluating prosthetics.
16 The vision tests used in clinics and psychophysics laboratories (i.e. perimetry, color vision testing, acuity, contrast sensitivity, and orientation discrimination tests) are designed to measure the limits of a working, functioning visual system that meets our perceptual needs. They are not designed to determine if the visual system sees. The fact that the patients (normally or partially sighted) see is taken for granted in the design and application of these tests.
The ability to perform well on the clinical tests is necessary but insufficient to prove vision restoration. For example, it is not possible to interpret perspective without being able to distinguish retinal line orientation. Yet, being able to discriminate line orientation on a multiple-alternative forced-choice (MAFC) test is not an indication of the ability to interpret perspective. Likewise, it is not possible to parse depth using self-motion (motion parallax) without being able to discriminate speed differences or direction. Yet, above chance performance on a motion discrimination task does not mean one can use motion parallax to determine the 3D structure of a scene. Apparently, such capabilities do not develop over time with the use of prosthetic vision systems.
13 Importantly, some MAFC tests used to determine motion detection or discrimination fail in demonstrating such capabilities because they can be passed without perceiving any motion.
14
There are two types of problems with the use of clinical tests in evaluating prosthetic vision; (1) With the extensive training needed and applied with prosthetic vision devices, the users may learn to defeat the MAFC test using unintended secondary, possibly non-visual cues. This is rarely a problem in a single clinical encounter or even in a properly designed laboratory procedure. (2) If a test is not designed to explicitly test for the presence of vision, passing it or demonstrating a certain performance level may be wrongly interpreted as an indication of vision. This problem is not specific to MAFC, it is just the paradigm used most often in the evaluation of prosthetic vision systems. MAFC testing can be used to determine the resolution limits of visual prostheses or other specific parameters. This is an appropriate use and interpretation of the testing paradigm. By itself, however, it does not tell us much about the perception of the users. Frequently, the first problem may defeat even the ability to accurately determine simple parameters, such as thresholds.
14