Coming soon: We’re working on a brand new, revamped Community experience. Want to receive updates? Sign up now!

0 votes

I want to split the data table in the workflow into train and test set with split done randomly. How can we do that? I tried using "split" recipe but i didnt get the desired output.. Can any one help me with this ?


Hi Sam, can you provide more information on what you tried, what was the actual output, and what was the desired output?
I have uploaded a CSV table in DSS and now i want to split the table into 2 parts (Train test with random seeds) in the flow.  I thought that "split" recipe would do that but it did not have the option of splitting it randomly. Can you please let me know how we can split the table into 2 parts

I am also having problem with this link

Thank you

1 Answer

0 votes

These settings will let you achieve what you desire:

- define a custom variable "rand()"

- add two filters on that variable, one <0.8, the other >=0.8

1,339 questions
1,365 answers
11,916 users

©Dataiku 2012-2018 - Privacy Policy