Choice Modeling - Diagnostic - Experimental Design - Balances and Overlaps

From Q
Jump to: navigation, search

Calculates the level balances and overlap between levels for a choice model experimental design. You must first select the output to use this QScript. The D-Error and various diagnostic statistics described below are also shown. This blog post describes the calculations underlying metrics such as mean.version.balance referred to below.

Example

The example below is from a very small Random design where 2 attributes (Color and Speed) each have 3 levels. The design has 5 questions, each with 3 alternatives per question and 2 versions.

The output consists of the following:

d-error D-error is a measure that quantifies how good or bad a design is at extracting information from respondents in an experiment. A lower D-error indicates a better design. It is usual to compare d-errors for designs created using different algorithms, rather than consider the number by itself.

overlaps A vector where each element is the percentage of questions with some overlap (repetition of a level) by attribute. The number of levels of each attribute are shown in brackets. For example 70% of the questions have at least one repeated Color level.

The following 4 values are stated as a score between 0 and 1 where 1 represents perfect balance and 0 is the worst possible design. Although they can be interpreted in isolation, it is generally useful to compare scores for designs created by different algorithms.

All values use the concept of balance. The balance of an attribute is defined as the sum across levels of the absolute differences between the counts and the average count. This value is then scaled to give a value of 1 if all absolute differences are zero (i.e. the count of every level is the same as its mean, which implies perfect balance). The worst design consists of repeating the same levels for all alternatives and has a balance of zero. By replacing the sum across levels with the sum across pairs of levels of different attributes we arrive at an analogous pairwise balance. These values are not calculated for Alternative specific and Partial profiles designs.

mean.version.balance The average balance of all versions, then averaged across all attributes.

mean.version.pairwise.balance The average pairwise balance of all versions, then averaged across all pairs of attributes.

across.version.balance The balance for the whole design across all versions, then averaged across all attributes.

across.version.pairwise.balance The pairwise balance for the whole design across all versions, then averaged across all pairs of attributes.

singles The counts of the occurrence of each level of each attribute across all versions of the design. For example, the level Blue occurs 12 times across the design whereas Yellow occurs only 10 times. For Partial profile designs, the number of times that the levels are constant are also shown.

pairs The counts of the co-occurrence of a pair of levels from different attributes within the same alternative. This is calculated across all versions of the design. For example, Red occurs 5 times in the same alternative as Medium but only once with Slow. In this example, the uneven distribution of singles and pairs balances are a consequence of the Random design. It is generally better for the levels to be evenly balanced.

References

Hoare, J. (2018, July 20). How Good is your Choice Model Experimental Design? [Blog post]. Accessed from https://www.displayr.com/how-good-is-your-choice-model-experimental-design/.

See also Choice Modeling - Experimental Design.

For details of the D-error calculation see

  1. Yap, J. (2018, August 20). What is D-Error? [Blog post]. Accessed from https://www.displayr.com/what-is-d-error/.
  2. Yap, J. (2018, August 21). How to Compute D-error for a Choice Experiment [Blog post]. Accessed from https://www.displayr.com/how-to-compute-d-error-for-a-choice-experiment/.
  3. Huber, J., & Zwerina, K. (1996). The importance of utility balance in efficient choice designs. Journal of Marketing research, 307-317. Accessed from https://faculty.fuqua.duke.edu/~jch8/bio/Papers/Huber%20Zwerina%201996%20Marketing%20Research.pdf.

Code

includeWeb("QScript R Output Functions");

main();

function main() {

    // The following 2 variables contain information specific to this diagnostic.
    var required_class = "ChoiceModelDesign";
    var output_name_suffix = "balances.and.overlaps";
    
    var item = checkSelectedItemClass(required_class);
    if (item == null)
        return false;
    var r_name = stringToRName(item.referenceName);

    // The following lines contain the R code to run
    var expression = "c(d.error = " + r_name + "$d.error, " + r_name + "$balances.and.overlaps)";

    return createROutput(item, expression, output_name_suffix);
}