The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Minor problem with the Extract Cluster Prototypes operator
data:image/s3,"s3://crabby-images/a8385/a8385b59398acdc5ae3741d98cce8e995197c15e" alt="hughesfleming68"
data:image/s3,"s3://crabby-images/11187/1118740d1e0121081c696f0c499ba11e56270b16" alt=""
If I cluster with K-medoids the last value in the data set becomes a centroid so I need to exclude it. I am doing this by finding the value, generating a macro, generating an attribute and finally a filter examples operator with an expression. It does not work. I can't seem to be able to see the data after the Extract Prototypes operator to filter it. In the screenshot, it should be trivial to exclude cluster_11 but I am not making much progress. I am using RM 9.7, not the 9.8 beta.
data:image/s3,"s3://crabby-images/ff1e2/ff1e28845ae3830aee67bfe7bd5958ad233ef2f7" alt="Image: https://us.v-cdn.net/6030995/uploads/editor/nv/yj4a6dlf1cyn.jpg"
data:image/s3,"s3://crabby-images/ff1e2/ff1e28845ae3830aee67bfe7bd5958ad233ef2f7" alt="Image: https://us.v-cdn.net/6030995/uploads/editor/nv/yj4a6dlf1cyn.jpg"
Tagged:
0
Best Answer
-
MartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533
RM Data Scientist
Hi Alex,you could just do aggregate count(cluster) group_by cluster, join this on cluster and then filter for count(cluster) != 1 ?Best,Martin- Sr. Director Data Solutions, Altair RapidMiner -
Dortmund, Germany5
Answers
Dortmund, Germany
I will have to think about that. I could do with a couple of days off. I could be missing the simple solution to this problem.
At the moment, I can't isolate and filter the centroid of cluster_11 after the Extract Prototypes. On the next run it might be the centroid of cluster_3 that is the last value in the data set. Here is a screen shot with a crude example. The last value in the data set is the second to last black line at the bottom which is also the centroid of one of the clusters. I have taken the centroid values and generated attributes. It is kind of a false value when it is also an outlier. That value doesn't match anything in the data set. If it does, then it is random. A simple filter examples expression should have fixed it but the composite value in the data set is invisible to me. I will upload a process that should make things clearer.
regards,
Alex
Regards,
Alex