The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Within Cluster Distances"
Hi,
I have run a kmeans procedure and got the cluster centroids. Now, i want to calculate the euclidean distance from each row to its respective cluster centroids. In other words, i need to calculate the within cluster distances. Any ideas?
What i can of think is:
For simplicity, assuming only one cluster,
1. I will merge(combine rows) the exampleset with the cluster centroid exampleset. And do the 'ExampleSet2Similarity' operator. And then have to delete many rows(unncessary calculations) in the resultant exampleset. This complicates when the number of clusters are many and for large data too.
2. Merge(combine columns) of the exampleset with the corresponding cluter centroids (this may take some time). And then have to actually compute the formula for euclidean distance.
But i find both of these methods as tedious and was wondering if there was a simpler solution for this problem.
Thanks for your time,
Shubha
I have run a kmeans procedure and got the cluster centroids. Now, i want to calculate the euclidean distance from each row to its respective cluster centroids. In other words, i need to calculate the within cluster distances. Any ideas?
What i can of think is:
For simplicity, assuming only one cluster,
1. I will merge(combine rows) the exampleset with the cluster centroid exampleset. And do the 'ExampleSet2Similarity' operator. And then have to delete many rows(unncessary calculations) in the resultant exampleset. This complicates when the number of clusters are many and for large data too.
2. Merge(combine columns) of the exampleset with the corresponding cluter centroids (this may take some time). And then have to actually compute the formula for euclidean distance.
But i find both of these methods as tedious and was wondering if there was a simpler solution for this problem.
Thanks for your time,
Shubha
Tagged:
0
Answers
yes there is a much simpler solution:
The operator ClusterCentroidEvaluator provides the possibility to calculate the average squared within distance automatically, given an exampleSet and a cluster model.
Greetings,
Sebastian
Thanks, Shubha