The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Google CSV export format issue - AdWords, WebmasterTools, Analytics..."
Antal_Sofalvy
Member Posts: 13 Contributor II
Hello,
Recently I've realized some issues with Google generated CSV files. When I CSV Read them into Rapidminer (v7.1) the result is strange - I attached a file for
The same happens in case of any CSV files exported from Google web related tools, like AdWords, Keyword Planner, WebmasterTools, etc.
The CSVs are usually tab separated and this is win1250 (I think) - but charset does not an issue
Please help me what I am overlooking?
Thanks,
Antal
PS Read File turnaround works but VERY time consuming...
Tagged:
0
Best Answer
-
bhupendra_patil Employee-RapidMiner, Member Posts: 168 RM Data Scientist
it looks like some sort of encoding issue,
in your import wizard try one of the UTF -encoding like in the screen shot below
1
Answers
Thanks it looks OK - so encoding setting is the solution.
The issue is, that NOT every CSV is encoded accordingly - so I have to find out one by one... but it is my challenge.
Thanks,
Antal
Thank you!
So it is encoding - hopefully all Google CSVs are UTF16...
Regards,
Antal
BTW if you want to set tab / tabulator for coloumn separator type \t
I have not found it in the documentation, hopefully it is useful