The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
[SOLVED] Rolling Up Data
Hi,
I have data that looks something like below
id attr1 attr2, attr3
a 0 1 0
a 1 0 0
a 0 0 0
a 0 0 1
I m using K-means to cluster this data but i want to cluster the IDs together so for example a,b,c would be clustered together because their attributes are the same
At the moment this isnt working as k-means is treating each row as a record as opposed to each id as a record
To get around this i was hoping that i could collapse four records (a single ID) onto one line
So for example
id, attr1, attr2, attr3
a 1 1 1
Is this possible within Rapidminer?
Thanks for your help
I have data that looks something like below
id attr1 attr2, attr3
a 0 1 0
a 1 0 0
a 0 0 0
a 0 0 1
I m using K-means to cluster this data but i want to cluster the IDs together so for example a,b,c would be clustered together because their attributes are the same
At the moment this isnt working as k-means is treating each row as a record as opposed to each id as a record
To get around this i was hoping that i could collapse four records (a single ID) onto one line
So for example
id, attr1, attr2, attr3
a 1 1 1
Is this possible within Rapidminer?
Thanks for your help
0
Answers
The trick to fixing this problem was to use the aggregate operator and set the aggregation property to MAX around the label i wanted to roll up to in this case the ID
Thanks