Deal with Extract Macro datetime issue
Hi RapidMiner,
I got an issue when using Extract Macro with datetime values. The datetime value i want to extract is "Dec 03, 2016 9:04:51 AM", but after using Extract Macro, the value extracted is "03/12/2016 09:04:51 AM" (I used Log operator to see this value), which mean "Mar 12, 2016 9:04:51 AM" (day becomes month and month becomes day). And then when I used Filter Examples to keep only records after "Dec 03, 2016 9:04:51 AM", i got all records after "Mar 12, 2016 9:04:51 AM", which is not what i want. Do you have any idea to solve this? Thank you very much.
I attached my process here.
Best,
phivu
Best Answers
-
Edin_Klapic Employee-RapidMiner, RMResearcher, Member Posts: 299 RM Data Scientist
Hi phivu,
I recommend using Generate Attributes to create a numerical Attribute representing the date value in milliseconds since Jan, 1st 1970 (see screenshot).
You can then apply Extract Macro and Filter Examples on this attribute.
Best,
Edin
1
Answers
Thanks for the process but I can't see your data. Can you verify that the date-time you actually use in the process is set as date-time format by RapidMiner?
I'm not saying this isn't a problem, but it sounds like a date format issue--the difference described is between the American vs the European way of writing dates (3/12/16 is March 12th to an American and Dec 3rd to a European). The date format could be modified in RapidMiner using Date to Nominal and then Nominal to Date using a different format. There may be a more efficient way of solving this though.
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Hi Thomas,
The datetime i used is under RapidMiner format. I attached the input data in this message.
The problem here i think is the inconsistence between "Extract Macro" (EM) and "Filter Examples" (FE). FE uses the datetime value from EM but it reads the value in a different format (as what Brian explained above).
Thanks.
Best,
phivu
You can do a similar extraction directly with the "Date to Numeric" operator, and choose units other than milliseconds as well (if you'd rather think about this in a unit that is not as large in absolute terms or that is more meaningful to you like days or weeks).
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts