0 votes
Is it okay to use categorical variable directly or is it better to use one-hot encoding?
asked by anonymous

1 Answer

0 votes
It's ok to use categorical variable directly. The model will automatically do the one-hot encoding. This is also called dummification.

You can chose in project "Settings" between one-hot encoding and "impact encoding". For text variable, there other options available: tf-idf, hashing, etc.
answered by
923 questions
956 answers
1,781 users