The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Log File Import
Hey Everyone,
I'm trying to use RapidMiner to Analyze Some Logged Events which I am getting from a System. Unfortunately, the Logs are not very well sorted and need cleaning up. I have tried the csv-import module on the textfile-logs which I have, but even with regex-matching I find it to be not customizable enough to cover all items in the log. Is there a similar extension or Plugin with more flexible Definitions which I could use to import logs into a Table?
Eventually, the lines look a bit like
<some Information> :Event :Timestamp : <some item containing "::" sometimes>: Action
but not all do, there are simple one-worder lines as well.
Do I need to buid external Data Preprocessing, or can this be done within RM?
I tried searching the forum and the extension marketplace as well with limited success. Any recommendations?
Best regards,
Lino
0
Answers
Could you share your log file, or a least a sample of this log file, in order we can
better understand and try to find the more relevant solution ?
Regards,
Lionel
I builded a process which extract the information like that :
the process :
Regards,
Lionel
What are the extension / type of your simple text/logfile ?
Regards,
Lionel
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Your static example (with id1 / id3) is possible using the Lag Series operator of the Values Series extension (to install from MarketPlace).
But I understand , you are searching a more general use : Can you describe the "rules" to create your new attribute and the "rule(s)" to subtracting the time values ?
Regards,
Lionel
- You are looking only at the "BLUETOOTH_CONTROL " action ?
- If I good understand for example, the first BLUETOOTH_CONTROL is associated to the first "mouse pressed" ?
and you want calculate the timestamp difference between these 2 events ? and write this value in a new attribute at the level of the "BLUETOOTH_CONTROL" row ?
- Is the "BLUETOOTH_SOURCE" action is associated to the second "mouse pressed" ?
Regards,
Lionel
OK, your explanations are clear : no need of drawing
Unfortunately, I didn't find a solution with RapidMiner's native operators, so I propose
a process using a Python script. In the results, you have a new column called "timestamp_diff" :
To execute this process, you need to :
- install Python on your computer.
- install the Python Scripting extension (from the marketplace).
The process :
Hope it helps,
Regards,
Lionel
Don't hesitate to post your future questions, here in the community.
Good continuation...
Best regards,
Lionel