0 votes

Hello,

I have a filesytem organized this way:

/folder/YEAR/MONTH/DDHH

I tried to partition at the DDHH level, with one folder per partition. Since it is not a 'regular' structure (such as %Y/%M/%DD/.*), I did the partitioning as %Y/%M/%{dimension_2}/.* and it outputs 718 partitions of 1 file (json)

After this operation, I get a problem reading a file from a specific partition :

Error in pull background thread, aborting push
org.codehaus.jackson.JsonParseException: Illegal character ((CTRL-CHAR, code 0)): only regular white space (\r, \n, \t) is allowed between tokens

On the other hand, when I load the suspected file as one simple dataset, I have no problem for reading it.

Any suggestion?

Many thanks in advance!

asked by Cecile
Just to be sure, you could try to run "jq" on all 718 files to be sure all of them are correct

1 Answer

0 votes
The file is correct. I manage to read after loading it as a new dataset.

It's after partioning that I get the error..
answered by
891 questions
920 answers
893 comments
1,387 users