Coming soon: We’re working on a brand new, revamped Community experience. Want to receive updates? Sign up now!

+2 votes

1 Answer

+7 votes
Best answer

There are three kinds of datasets in dataiku: 

  • Managed datasets:  datasets that are created by recipe. Dataiku assumes he “owns” the data and schema of those datasets
  • Uploaded datasets:  raw files that are uploaded through the user interface. They are actually stored locally in raw form, in a specific folder (that you can find in DATA_DIR) with a subfolder per dataset name PROJECTNAME.DATASETNAME
    You can “modify” an uploaded dataset by reuploading it. An uploaded dataset is actually a folder, meaning that it can contain several files
  • Non-managed, non-uploaded datasets. Usually it's a dataset that points to exisiting data (a table in a sql table for instance) that you can use as an input for a recipe 


selected by
Thank you!
1,337 questions
1,362 answers
11,912 users

©Dataiku 2012-2018 - Privacy Policy