Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Dataiku's category handling = Dummy encoding with dropping dummy option seems to be using a level with the least exposure/volume as a dummy.
Q1. Is there a way to set this dummy manually instead of Dataiku's default method? Want to avoid using category handling = custom preprocessing option.
Q2. Using Variable type = Categorical with Drop one dummy option on input variable of double type seems to be dropping 2 levels. For example, there are only 3 regression coefficients from a variable with 5 levels). I would of expected there would be 4 regression coefficients since 1 is used as a dummy). Does anyone know the reason for this?
Many thanks in advance.