I am new to Choice Design and using Ngene and I have some questions that I so far was not able to answer (I have read the Ngene user manual and searched in this forum).
I am planning to conduct an unlabelled choice experiment with two alternatives and one opt-out option per choice set.
First, I would like to conduct a pilot study with a zero priors design to obtain priors and then use Baysian priors based on the estimates from the pilot study, as suggested in the Ngene user manual.
The code for the pilot study I am using is currently the following:
Code: Select all
Design
;alts = alt1*, alt2*, alt3
? alt1 and alt2 are unlabeled alternatives, alt3 is a generic no choice option
;rows = 16
;block = 2
;eff = (mnl, d)
;model:
u(alt1) = b1[0]*x1[0.79,1.49,2.49,3.99] + b2.effects[0|0|0]*x2[0,1,2,3] + b3.effects[0]*x3[0,1] + b4.effects[0]*x4[0,1] + b5[0]*x5[0.2,0.4] /
u(alt2) = b1*x1 + b2*x2 + b3*x3 + b4*x4 + b5*x5 /
u(alt3) = b0[0]
$
Now I have a few questions, and I would appreciate your advice on those.
1. The designs resulting from this code have a d-error of around 0.28.
I feel that this is relatively high, and I am wondering whether I might have made any mistake in the design, or whether that is a realistic d-error value for this pilot study with zero priors.
2. If I replace the effects coded variables by dummy coded variables (ceteris paribus) the resulting d-error is much higher. I don't understand why this should be the case.
3. The resulting design seems to use 'complementary' levels for the two alternatives, which is particularly striking for the price attribute. If Alternative A has a price of 0.79, Alternative B always has the price of 3.99, or the other way round. In other choice sets the two medium price levels are combined. But there are no combinations with the lowest and second lowest price level, for instance.
I feel that this might be problematic because of the huge price differences (I derived prices from getting actual prices for products with the different attributes I use in the study.)
As participants are expected to be sensitive to prices, I expect that in choice sets with the lowest and the highest price, it is quite clear how they will decide, no matter how the other attributes are defined.
Designs with more similar alternatives seem to have a higher d-error.
I don't know how to resolve that issue - or whether I should just take the design as it is, although there might be 'dominant' alternatives (not in all attributes, but referring to the huge price difference).
Do you have any suggestion on that?
Furthermore, I have some questions that are not directly related to this pilot study design but more general.
4. I have read in the manual that for dummy/effects coded variables, the highest level is treated as the base level.
I am not sure about what that means for the design, i.e. if I had priors, whether I would need to insert them into the code in a certain order, or whether this does not matter?
5. I found some previous studies that used some of the attributes I use in my study as well, so I would have some coefficients I could use as priors, but I am not so sure on how exactly I could to that.
I asked a colleague and he suggested to stick with the zero priors pilot study to get the priors and then cross-check with the coefficients from the literature and maybe use some combinations.
Would you agree with that procedure, or would it be better to use some priors from the literature already for the pilot study design?
My problem would be that I would need to combine values from different studies, and that I don't have estimates for all of the attributes.
For instance another study provides estimates for two levels of my effects-coded variables b2, but not for the third level.
Also, the other study used dummy coding, while I would like to used effects coding.
I would expect that I would then need to adjust the coefficients, as they would be smaller for effects coding?
Also the study indicates standard errors for the coefficients, and I understood that if I would like to use distributions, I need to use standard deviations, not standard errors.
My colleage said that it does not matter, it would also be possible to use the standard errors. But how would Ngene now that it's standard errors and not standard deviations?
Would it be possible in any way to use priors from the literature if I include an additional level for my variable? Wouldn't this affect the coefficient values maybe?
I would appreciate any ideas and answers to my questions!

Thanks a lot in advance!