Coming soon: We’re working on a brand new, revamped Community experience. Want to receive updates? Sign up now!

0 votes

Hello,

I have a filesytem organized this way:

/folder/YEAR/MONTH/DDHH

I tried to partition at the DDHH level, with one folder per partition. Since it is not a 'regular' structure (such as %Y/%M/%DD/.*), I did the partitioning as %Y/%M/%{dimension_2}/.* and it outputs 718 partitions of 1 file (json)

After this operation, I get a problem reading a file from a specific partition :

Error in pull background thread, aborting push
org.codehaus.jackson.JsonParseException: Illegal character ((CTRL-CHAR, code 0)): only regular white space (\r, \n, \t) is allowed between tokens

On the other hand, when I load the suspected file as one simple dataset, I have no problem for reading it.

Any suggestion?

Many thanks in advance!

by
Just to be sure, you could try to run "jq" on all 718 files to be sure all of them are correct

1 Answer

0 votes
The file is correct. I manage to read after loading it as a new dataset.

It's after partioning that I get the error..
by
Coming soon: We’re working on a brand new, revamped Community experience. Want to receive updates? Sign up now!
1,339 questions
1,365 answers
1,557 comments
11,916 users

©Dataiku 2012-2018 - Privacy Policy