0 votes

I am trying to do a join a 2 dataset and the job keep failing. I get the error :

IO Exception: "java.io.IOException: Stream Closed"; "/home/dataiku/dss/tmp/dataset-to-h2/kXsmfIEs/dataset.h2.db" [90031-176], caused by: IOException: Stream Closed

Type: org.h2.jdbc.JdbcSQLException

Is it because my dataset are too big ?

asked by anonymous

1 Answer

0 votes
It's indeed very possibly caused by a "disk full" situation. The "DSS" engine of the join recipe pulls all data from all datasets to the local disk to perform the join. As much as possible, we strongly recommend that you perform joins either on Spark, Hadoop or a SQL database
answered by
528 questions
547 answers
309 users