+1 vote
by

1 Answer

0 votes
Best answer
Currently the visual preparation recipes are working line by line. This allows to work on very large datasets by streaming, potentially in parallel on Hadoop. The downside is that each line is processed independently, so, an exponential moving average cannot reliably be done in this type of recipe.

If the dataset fits in memory, I would go with a Python recipe and use the functions from Pandas.
by
1,120 questions
1,163 answers
1,308 comments
11,052 users

┬ęDataiku 2012-2018 - Privacy Policy