|
Individual-Level Score Estimation
|
|
|
| Number of Iterations before Using Results
|
| The estimation routine that MaxDiff employs (HB) uses many thousands of iterations to find stable scores prior to actually "using" (saving) the results for each respondent. 20,000 iterations is the default, and should give you a healthy margin of safety, in the sense that the scores should have every opportunity to stabilize (assuming appropriate design and sample size). You may change the number of iterations if you wish.
|
|
|
| Constraints
|
| Sometimes you know beforehand that some levels rationally should be preferred to others. For example, perhaps you included three items in your list related to end-of-year bonus:
|
|
|
| Receive a $500 end-of-year bonus
|
| Receive a $750 end-of-year bonus
|
| Receive a $1,000 end-of-year bonus
|
|
|
| Every rational respondent should prefer higher bonus to lower bonus, and you can inform the estimation routine of this preference. Click the Edit button under the Constraints section if you would like to specify known preference orders and constrain the estimated scores to conform to these expectations. The method that MaxDiff uses to constrain the scores is called "simultaneous tying" and it is described in the CBC/HB Technical Paper on our website.
|
| ** Data Summary **
|
| Total respondents = 300
|
| Total 'Best' choices = 4500
|
| Total 'Worst' choices = 4500
|
| Average sets per respondent = 15.0
|
| 'Best' responses by position:
|
| (1) 25.64%
|
| (2) 25.35%
|
| (3) 24.89%
|
| (4) 24.12%
|
|
|
| 'Worst' responses by position:
|
| (1) 24.34%
|
| (2) 24.68%
|
| (3) 25.25%
|
| (4) 25.73%
|
| Rescaled (Probability) Scores These individual-level item scores are positive values summing to 100 that reflect the likelihood of items being chosen within the questionnaire. Most researchers will probably use this scaling procedure, as it is easiest to interpret and present to others. This approach has the valuable property of ratio-scaling. That is to say, an item with a score of 20 is twice as important (or preferred) as an item with a score of 10. Click here for more details regarding the rescaling procedure.
|
|
|
| Raw Scores These are weights that directly follow from the MNL (multinomial logit) procedure employed within the HB engine. The items can have positive or negative weights and are zero-centered (the "average" item has a weight of 0). These weights are on an interval scale, which does not support ratio operations. In other words, you cannot state that an item with a score of 2.0 is twice as important (or preferred) as an item with a score of 1.0. Advanced analysts may choose to use these raw scores.
|
| · | Microsoft Excel format --an Excel Workbook .XLS file is saved with multiple sheets (one for each of the three sheets displayed in this dialog)
|
| · | .CSV (Comma Delimited) --only the current displayed sheet is saved to the file. To save all information from the three sheets to .CSV format, select each sheet separately and save each to a unique .CSV file.
|
|
|
|
|
|
|
|
| Number of Draws to Be Used for Each Respondent
|
| Each iteration that HB completes results in an estimate of each respondent's scores called a "draw." From one draw to the next, the scores can vary by a random perturbation. One could plot the distribution over many draws and we'd generally find that the draws are distributed normally. MaxDiff averages these draws for each respondent across as many draws as indicated in this field, and reports that average as the final raw score for each item. Theoretically, the more draws used per respondent, the more precise the estimates. However, you will find that 10,000 draws already provides a high degree of precision.
|
|
|
| Prior Degrees of Freedom
|
| This value is the additional degrees of freedom for the prior covariance matrix (not including the # parameters to be estimated), and can be set from 2 to 100000. The higher the value, the greater the influence of the prior variance and more data are needed to change that prior. The scaling for degrees of freedom is relative to the sample size. If you use 50 and you only have 100 subjects, then the prior will have a big impact on the results. If you have 1000 subjects, you will get about the same result if you use a prior of 5 or 50. As an example of an extreme case, with 100 respondents and a prior variance of 0.1 with prior degrees of freedom set to the number of parameters estimated plus 50, each respondent's resulting scores will vary relatively little from the population means. We urge users to be careful when setting the prior degrees of freedom, as large values (relative to sample size) can make the prior exert considerable influence on the results.
|
|
|
| Prior Variance
|
| The default is 1 for the prior variance for each raw score, but users can modify this value. You can specify any value from 0.1 to 100. Increasing the prior variance tends to place more weight on fitting each individual's data, and places less emphasis on "borrowing" information from the population parameters. The resulting posterior estimates are relatively insensitive to the prior variance, except 1) when there is very little information available within the unit of analysis relative to the number of estimated parameters, and 2) the prior degrees of freedom for the covariance matrix (described above) is relatively large.
|
|
|
| Random Starting Seed
|
| Any computer algorithm that uses random numbers needs a starting "seed" value. You may set this as an integer from 0 to 32000. When using different random seeds, the final scores will vary, but insignificantly, assuming convergence has been reached and many draws have been used. If you specify a "0," this uses the system's clock as the starting seed. If you want to be able to repeat results and achieve the same answer, then you should use a specific integer greater than 0.
|
|
|
| Sets to Include
|
| Clicking Filter under the Sets to Include area accesses a dialog in which you can specify which sets to include. Most users will utilize all sets within analysis. However, there may be instances in which advanced users wish to use only a subset of the full data.
|
|
|
| Respondents to Include
|
| Clicking Filter under the Respondents to Include area allows you to choose whether to include All Respondents, Qualified/Completes Only, or Disqualified and Incompletes Only in the analysis.
|
|
|
| Responses to Include
|
| If you asked MaxDiff questions that included both a selection of "best" and "worst," you can select which response types to use in analysis. Researchers have noted that there can be statistically significant differences in scores developed from bests versus worsts. This control lets you investigate these issues if you'd like, including the ability to use answers from bests only in computing item scores.
|