0 votes
Is it okay to use categorical variable directly or is it better to use one-hot encoding?
asked by

1 Answer

0 votes
It's ok to use categorical variable directly. The model will automatically do the one-hot encoding. This is also called dummification.

You can chose in project "Settings" between one-hot encoding and "impact encoding". For text variable, there other options available: tf-idf, hashing, etc.
answered by
974 questions
1,002 answers
1,049 comments
2,415 users

┬ęDataiku 2012-2018 - Privacy Policy