0 votes

I have to deal with CSV file with UTF-16 encoding due to specific characters (asian chars).

DSS detects a wrong charset instead of the UTF-16 (iso-8859-15). The consequence is that the first name column is invalid, probably because the BOM of the CSV is interpreted as the first column name.

Fortunatelly, I can manually edit the charset and it works. But in several "automatic" cases I will be not there to edit it :)

Is-there a way to correct that ?

To reproduce the issue, you can download this file : http://www.filedropper.com/romain

NOTE : The file is correctly detected with "file" unix command as "Little-endian UTF-16 Unicode text, with CRLF line terminators"
asked by

Please log in or register to answer this question.

974 questions
1,002 answers
2,415 users

┬ęDataiku 2012-2018 - Privacy Policy