0 votes
Is it okay to use categorical variable directly or is it better to use one-hot encoding?
by

1 Answer

0 votes
It's ok to use categorical variable directly. The model will automatically do the one-hot encoding. This is also called dummification.

You can chose in project "Settings" between one-hot encoding and "impact encoding". For text variable, there other options available: tf-idf, hashing, etc.
by
1,116 questions
1,157 answers
1,303 comments
11,030 users

┬ęDataiku 2012-2018 - Privacy Policy