The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
How to delete attributes or rows of the exampleset automatically
Hi guys,
i am a new member of the Rapidminer community and would like know, how can i just remove or delete automatically several attributes or rows, which contain certain kind values? For my apllication i dont need the time stamp and would like to delete them from my example set. Thank you
i am a new member of the Rapidminer community and would like know, how can i just remove or delete automatically several attributes or rows, which contain certain kind values? For my apllication i dont need the time stamp and would like to delete them from my example set. Thank you
0
Best Answer
-
lionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195 UnicornHi @sharki,
That was interesting to solve !
The idea here is to find and replace the timestamp values by the caracter "?" using a regex to "capture" the timestamp values for each attribute, so I'm using Generate Attributes operator with the following expression :if(finds(eval(concat("att_",%{iteration})),"(0[1-9]|[1-2][0-9]|3[0-1]).(0[1-9]|1[0-2]).[0-9]{4} (2[0-3]|[01][0-9]):[0-5][0-9]"),"?",eval(concat("att_",%{iteration})))
then I loop over the attributes to remove the example(s) which contains the caracter "?"...
Note that the attributes names have to be "att_1", "att_2", "att_3" etc. ... but according to the last screenshot of your first post it is already the case.
You have just to put the process in attached file at the end of your own process.
Hope this helps,
Regards,
Lionel
PS : In attached file, the .xls file I used to create a fictive exampleset representative of yours.
6
Answers
I have maybe an idea. Can you share your dataset ?
In addition, can you elaborate :
What do you want to do exactly ?
- if an attribute contains at least a date, you remove this attribute ?
- if a row contains at least a date, you remove the row ?
Regards,
Lionel
Here is my process so far.
<?xml version="1.0" encoding="UTF-8"?>
So after Combine Documents Operator i got a Dataset, which look like that
the example set looks like that before transpose
and at the end of the whole process like that
So if you take a look at my first picture, then maybe you would know my intention of my set up. If i could clear up the time stamp in the rows, then i will get exact the same dataset like the one in the first picture. And because i work with a dynamic data set, therefore i would like to know to delete the rows, colums or unwanted values in my exampleset automatically, so that i would'nt just have to delete the rows, colums , attributes or values by hand. Sorry for the long answer. i hope, i could express well, what i would like to do.
What is the complete pattern of your timestamp ? (ie DD.MM.YYYY ? or something else ..?)
In the screenshot you shared, the timestamp is truncated so, I can not determine it.
Regards,
Lionel
the pattern of the timestamp is DD.MM.YYYY HH:MM i guess. The application records every ten minutes different values of the parameters, which are measured by several sensores.
Good luck for your study !
Regards,
Lionel