0 votes
Hi,

I am trying to do a join a 2 dataset and the job keep failing. I get the error :

IO Exception: "java.io.IOException: Stream Closed"; "/home/dataiku/dss/tmp/dataset-to-h2/kXsmfIEs/dataset.h2.db" [90031-176], caused by: IOException: Stream Closed

Type: org.h2.jdbc.JdbcSQLException

Is it because my dataset are too big ?

thanks
asked by anonymous

1 Answer

0 votes
It's indeed very possibly caused by a "disk full" situation. The "DSS" engine of the join recipe pulls all data from all datasets to the local disk to perform the join. As much as possible, we strongly recommend that you perform joins either on Spark, Hadoop or a SQL database
answered by
469 questions
489 answers
321 comments
284 users