Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Hello,
I write to you again.
There are an other part I don't understand what I have to do here
Feature engineering the referrer URLs
-Split URL in referer, extracting only the hostname
-Use the Find and replace processor on referer_host, replacing t.co with twitter.com and matching on the complete value of the string
-In the same column, replace www. with an empty expression (i.e. no value), matching on substring
-Once more for referer_host, replace \..* with an empty expression, matching on regular expression. This step allows us to later put all traffic from the local Google domains under a single group.
-Reduce clutter by removing eight more columns: server_ts, referer, type, visitor_params, session_params, event_params, br_lang, and tz_off
I don't know how I can do that. Is there anyone who has a course which
The Prepare recipe should be able to do what you want. Give it a try and let us know.
https://knowledge.dataiku.com/latest/data-preparation/prepare-recipe/concept-prepare-recipe.html
In fact there is a also a documented sample of using Web Logs: