The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Transform CSV list into Dummy Variables"
I have a list variable (social media usage) that I would like to recode in a few easy steps from list to a collection of dummies (sm1, sm2, sm3, ...).
My process looks like this:
And my output looks like this:
I'm an intuitive learner and just started with RapidMiner 2 days ago, I understand that at this point I'd need to introduce a transformation/ recode variable operator in between normalisation and results, but I'm not quite sure how to do that. Didn't find any transformation/ recode operator in the operator pannel.
Thank you for taking your time with my question. I know I could have googled it, but I'm a little overwhelmed with the sheer amount of information.
My process looks like this:
And my output looks like this:
I'm an intuitive learner and just started with RapidMiner 2 days ago, I understand that at this point I'd need to introduce a transformation/ recode variable operator in between normalisation and results, but I'm not quite sure how to do that. Didn't find any transformation/ recode operator in the operator pannel.
Thank you for taking your time with my question. I know I could have googled it, but I'm a little overwhelmed with the sheer amount of information.
Tagged:
0
Best Answer
-
IngoRM Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM FounderHi,You can use the operator "Split" for this with the setting "unordered split". Below is the XML code of a process with a small example.Hope this helps,
Ingo<?xml version="1.0" encoding="UTF-8"?><process version="9.2.001"><br> <context><br> <input/><br> <output/><br> <macros/><br> </context><br> <operator activated="true" class="process" compatibility="9.2.001" expanded="true" name="Process"><br> <parameter key="logverbosity" value="init"/><br> <parameter key="random_seed" value="2001"/><br> <parameter key="send_mail" value="never"/><br> <parameter key="notification_email" value=""/><br> <parameter key="process_duration_for_mail" value="30"/><br> <parameter key="encoding" value="UTF-8"/><br> <process expanded="true"><br> <operator activated="true" class="utility:create_exampleset" compatibility="9.2.001" expanded="true" height="68" name="Create ExampleSet" width="90" x="45" y="34"><br> <parameter key="generator_type" value="comma separated text"/><br> <parameter key="number_of_examples" value="100"/><br> <parameter key="use_stepsize" value="false"/><br> <list key="function_descriptions"/><br> <parameter key="add_id_attribute" value="false"/><br> <list key="numeric_series_configuration"/><br> <list key="date_series_configuration"/><br> <list key="date_series_configuration (interval)"/><br> <parameter key="date_format" value="yyyy-MM-dd HH:mm:ss"/><br> <parameter key="time_zone" value="SYSTEM"/><br> <parameter key="input_csv_text" value="Social Media Facebook, Twitter, Instagram Facebook, LinkedIn, Twitter LinkedIn, Instagram, Whatsapp LinkedIn, Twitter"/><br> <parameter key="column_separator" value=":"/><br> <parameter key="parse_all_as_nominal" value="false"/><br> <parameter key="decimal_point_character" value="."/><br> <parameter key="trim_attribute_names" value="true"/><br> </operator><br> <operator activated="true" class="split" compatibility="9.2.001" expanded="true" height="82" name="Split" width="90" x="179" y="34"><br> <parameter key="attribute_filter_type" value="all"/><br> <parameter key="attribute" value=""/><br> <parameter key="attributes" value=""/><br> <parameter key="use_except_expression" value="false"/><br> <parameter key="value_type" value="nominal"/><br> <parameter key="use_value_type_exception" value="false"/><br> <parameter key="except_value_type" value="file_path"/><br> <parameter key="block_type" value="single_value"/><br> <parameter key="use_block_type_exception" value="false"/><br> <parameter key="except_block_type" value="single_value"/><br> <parameter key="invert_selection" value="false"/><br> <parameter key="include_special_attributes" value="false"/><br> <parameter key="split_pattern" value=", "/><br> <parameter key="split_mode" value="unordered_split"/><br> </operator><br> <connect from_op="Create ExampleSet" from_port="output" to_op="Split" to_port="example set input"/><br> <connect from_op="Split" from_port="example set output" to_port="result 1"/><br> <portSpacing port="source_input 1" spacing="0"/><br> <portSpacing port="sink_result 1" spacing="0"/><br> <portSpacing port="sink_result 2" spacing="0"/><br> </process><br> </operator><br></process>
2
Answers