A nonparametric bootstrap was used to resample with replacement sensitivity data from retrospective and prospective cohorts (
Fig. 1).
18 From the original cohort, we resampled a subset of the data (set size
x). Each resample could consist of the same subjects, as replacement was performed. This process was repeated
k (number of resamples) times.
To determine the effect of cohort size on “normative distribution” parameters, we systematically varied the number of resample sensitivities from the total cohort (we termed this the set size, x). For example, x = 6 indicates a set of six sensitivity values resampled from the original total cohort (i.e., either the retrospective or prospective cohorts). Using the resampled sensitivities, we determined the mean (i.e., the central tendency), 95th percentile and 5th percentile (i.e., the distribution limits), and the standard deviation (SD). We tested a range of values for x (for the retrospective cohort: x = 6, 12, 24, 36, 48, 60, 75, 100, 150, 200, 250, 300, 350, 400, 450, and 500; for the prospective cohort: x = 6, 12, 18, 24, 30, 36, 48, 60, 72, 85, and 100), which was capped at the total number of subjects in the cohort. To determine the confidence limits for the descriptive parameters from these set sizes, we tested two levels of k (number of resamples), the number of resamples from the total cohort, which were k = 100 and k = 200. We also tested whether or not the value of k affected the resultant descriptive statistics independently of the set size. Thus, for set sizes of x = 6, 30, 60, and 500, we tested different levels of k: 1, 4, 8, 12, 16, 20, 24, 28, 36, 48, 60, 72, 84, 96, 120, 150, 200, 250, 300, 400, 500, and 750. This bootstrap procedure was performed on a custom written macro program using Visual Basic Editor in Microsoft Excel 2010 (Microsoft Corporation, Redmond, WA).
The differences in the mean and distribution limits between the true sample values (i.e., parameters from the original cohort data, which we refer to as the ‘ground truth' parameters) and the bootstrapped values were determined for each combination of x and k. The difference (in dB) was plotted as a function of x to determine the limit at which there ceased to be a significant change (i.e., when the difference between ground truth and bootstrapped parameters was minimized). A positive value in the difference from the ground truth and bootstrapped parameters indicates that the ground truth was higher than the bootstrap, while the converse was true for a negative difference.