The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
ETL standard text modification
Hello
What are the options for preparing a dataset within Rapidminer Studio when it comes to string attributes ?.
I know I can split and Combine, but is there a way to remove White space, transform into upper lower or capital case etc, with and without generating a new attribute. Is there a dedicated Operator for this or do I have to use the Script Operator ?
I there an other Operator showing pre build function or any kind of help when comming to scripting, maccros, or regular expressions ?
thanks ;D
What are the options for preparing a dataset within Rapidminer Studio when it comes to string attributes ?.
I know I can split and Combine, but is there a way to remove White space, transform into upper lower or capital case etc, with and without generating a new attribute. Is there a dedicated Operator for this or do I have to use the Script Operator ?
I there an other Operator showing pre build function or any kind of help when comming to scripting, maccros, or regular expressions ?
thanks ;D
Tagged:
0
Answers
Generate Attribute has quite some text function. Further there is of course the text mining extension.
~Martin
Dortmund, Germany
Generating Attribute is indeed the best way to do this.
What you can do if you don't want to create new attributes, is just to use the old attribute name and it will overrides the values.
For pre-build functions take a look at the tutorial process in the help to the Generate Attributes operator, it shows a lot of examples what you can do with macros and expressions.
Best,
David
I have a last question though regarding simple and more complex cross table, I saw the example with pivoting operator, and also check de pivot and transpose.
I am unable to create the same kind form on the left with the input data on the right of this picture
Any hint how to proceed ? ???
Thanks
OWC's stats extension (commerical) includes an operator which should do the trick. See: https://oldworldcomputing.com/products/statistics-extension-for-rapidminer
Best,\Martin
Edit: Apparently there is a demo process, see http://www.myexperiment.org/workflows/1488.html
Dortmund, Germany
I reviewed the second link and the problem is that he is doing an aggregation, and in my case I shouldn't have one.
I am looking for a free and quickest solution, the less operation needed, the better actually.
If you have any other suggestions please feel free to share
^(?!Type$|myID$).*$ -> this performs a negative lookahead for all atrribute names separated by | within the brackets (the $ tells it that the attribute name must end that way so "Type" is excluded from the depivot, but "Type2" is included. Any other attributes get included in the depivoted operation.