Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
I am a new Dataiku user, and I'd like some advice on the best way to leverage the tool to flag records with invalid values.
Example: I have a column called "Fruit," and I expect the following values for that column: 'Apples',' 'Oranges', & 'Grapes'. I want to flag in a different column any records that don't contain one of these values.
Some options that I've considered are:
I'm sure this can be done in python too, but I am not fluent in python. Therefore, I am hoping to find an easy alternative.
Any recommendations based on how others have handled this use case?
Hi @AnalyticsAnton ,
What you're looking for is achievable using User-defined meanings.
Thanks to a custom meaning, you can define a values list to specify what are the valid values.
Then, you'll be able to use any preparation processor based on invalid rows/cells.
For more details about the difference between the storage types and the meanings, you can refer to the Definitions page of the reference documentation.
Have a great day!
Hi @AnalyticsAnton ,
What you're looking for is achievable using User-defined meanings.
Thanks to a custom meaning, you can define a values list to specify what are the valid values.
Then, you'll be able to use any preparation processor based on invalid rows/cells.
For more details about the difference between the storage types and the meanings, you can refer to the Definitions page of the reference documentation.
Have a great day!