The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Generate Aggregation and Group by
aruberutou
Member Posts: 23 Contributor II
Hello,
I would like to count each unique item in a column, and create a new column with the corresponding count information for each item.
In excel, I would simply use =countif(ColumnA, ColumnA(i)) and copy down.
I expected that to be equally simple in RapidMiner; perhaps using GenerateAttribute. However, this is no "group by" function, and so my count column always returns "1" for each value.
Am I overlooking something?
I would like to count each unique item in a column, and create a new column with the corresponding count information for each item.
In excel, I would simply use =countif(ColumnA, ColumnA(i)) and copy down.
I expected that to be equally simple in RapidMiner; perhaps using GenerateAttribute. However, this is no "group by" function, and so my count column always returns "1" for each value.
Am I overlooking something?
Tagged:
0
Answers
I feel silly. Its seems a simple solution was to multiply the data, aggregate one thread, then rejoin using the aggregated attribute as the key.
Its still messier than I would have hoped, but perfectly usable, I guess.
Nevertheless, please let me know if I am missing something.
Dortmund, Germany