0 votes

I have a semi-big dataset (9GB, ~25 Mio entries). I tried to get a first look at it by doing Dataset -> Clicking on any column -> Analyze -> on "whole data".

This starts the following process:

... which fails everytime. I tried storing the dataset on Hive and as a single local file. It always fails, in Hadoop it fails considerably faster. (~4h).

Is this a known problem? Any suggestions what to do about it? If there is a connection error, shouldn't the results be calculated on the server still? Do I have to maintain a connection from my PC to the DataIku server the whole time?

Also, a side note:The progress bar doesn't really do anything? It stays white until it fails.



1 Answer

+1 vote
Coming soon: We’re working on a brand new, revamped Community experience. Want to receive updates? Sign up now!
1,328 questions
1,350 answers
11,898 users

©Dataiku 2012-2018 - Privacy Policy