This study examined and demonstrated a correlation between real-life surgical experience and scores obtained on the Eyesi Surgical Simulator, which provides a set of simulated surgical maneuvers that closely resemble skills required in actual vitreoretinal surgery. A statistically significant difference between tasks was obtained for baseline scores when comparing residents and retina fellows (
P = 0.027), as well as residents and retina staff (
P = 0.04). Although outcomes for retina staff were higher than those for retina fellows, a statistically significant difference was not achieved (
P = 0.87). This finding might be explained because of the choice of difficulty level for the tasks (level five out of eight); a higher difficulty level might have helped reveal more subtle differences between in-training surgeons and experienced surgeons. Vergmann et al.,
4 compared similar Eyesi simulator tasks among medical students, ophthalmology residents, and trained vitreoretinal surgeons. All groups showed statistically significant differences, but no differentiation between experienced vitreoretinal surgeons and in-training vitreoretinal residents was assessed. Although a correlation between real-life surgical experience and virtual simulation exists, it seems crucial to include different modules or specific difficulty ranks to be able to distinguish between levels of expertise.
As seen in the
Table, a significant improvement was obtained between the first and second sessions in the overall group performance scores (
P = 0.029). Ophthalmology residents showed greater improvement (14.8%) than did retina fellows and staff (9.9%), likely because ophthalmology residents had a greater opportunity for improvement than retina fellows or staff. On the other hand, Cissé et al.,
9 published a similar study and found no improvement in ophthalmology residents on repetitive tasks; however, the outcomes might have been biased because all tasks were done in the same day, thus allowing fatigue and loss of concentration to play an important role in the results. Even though constant training improves VRS performance, the main goal should be to determine an objective score to determine when an in-training surgeon is adequately qualified to perform a surgery on a patient. Further evaluation should be addressed in order to determine acceptable performance test scores.
A statistically significant decrease of 15.7% in performance scores was observed when tasks were performed with the non-dominant hand (
P = 0.043), but, surprisingly, no difference was observed between any group. It could be expected that retina fellows and staff would have better bimanual dexterity, but this was not reflected in our scores. González-González et al.,
5 interestingly showed how the learning curve for use of the non-dominant hand for anterior segment surgery was steeper when compared to that of the dominant hand, which might be the reason for our outcomes. The sleep deprivation consisted of 17 continuous hours without sleeping, a time span intentionally selected by analyzing the times after completed night shifts when our retina fellows performed surgery. The outcomes revealed a decrease of 3.36% on performance scores, although no statistically significant difference was observed (
P = 0.6). Our study resembles the outcomes reported by Ellman et al.,
10 who demonstrated that acute sleep deprivation among thoracic surgical residents did not affect operative efficiency, morbidity, or mortality in cardiac surgical operations. Grantcharov,
11 however, found significantly more errors and longer surgical times with regard to VRS surgical skills on post-call mornings. Our study showed only a moderate decline in sleep-deprived surgeon performance (comparable to the level for simulated surgical skills when they were rested), but these outcomes may not necessarily translate into acceptable performance under actual surgical conditions, which might differ.
Surprisingly, when participants were tested 45 minutes after having a cup of coffee with 391 mg of caffeine (Health Canada recommends an intake of no more than 400 mg per day), an increase in performance scores of 6.1% was observed, and a trend toward statistical significance was achieved (
P = 0.06), Interestingly, only two of 22 participants showed a decrease in performance scores, neither of whom was a regular coffee drinker. Both of these participants described feeling more alert, but they reported feeling tremors, although no statistical difference was seen for their anti-tremor test scores. To our knowledge, the only study evaluating the effects of caffeine in ophthalmic surgeons was published by Humayun et al.,
12 who reported that after an intake of 200 mg of caffeine tremor increased, although no statistical significance was achieved; they did not specify if participants were or not regular coffee drinkers. Although our study showed no statistically significant decrease in performance after a small alcohol intake (5.2%;
P = 0.5), tests of other factors important to a safe surgery such as cognitive skills or surgical judgment were not tested. These factors are at least as important as performance skills, so outcomes might not necessarily be extrapolated to real surgery. When Kocher et al.,
13 evaluated VSR surgical performance after sleep deprivation and alcohol intake, simulating a night out with a 0.86% breath alcohol level, they observed a significant deterioration in performance. Evaluation of breath or blood alcohol levels was not performed in our study, and the intake of alcohol was smaller; it is possible that a significant decrease in performance could have been achieved with higher alcohol intake.
Limitations of our study include the small number of participants, as we were limited by the total numbers of trainees and attendings in our program. Besides proving correlation between real-life experience and VRS surgical skills, challenging scenarios evaluated by computer-simulated abilities may not accurately reflect true surgical performance, as other factors should be taken into consideration. Our study evaluated moderate sleep deprivation, not the chronic sleep deprivation that might be more common among residents and fellows. A small amount of alcohol intake was evaluated, and breath or blood alcohol levels should have been tested in order to determine objective systemic effects. A strength of our study is the comparison between real-life surgery and scores obtained in VRS, not only on regular scenarios but also challenging scenarios, such as after sleep deprivation, use of the non-dominant hand, and caffeine or alcohol intake.