The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Cluster algortyms
what is the x-means and k- medoid ? And also what is the difference between k-means of this two algortym ?
Tagged:
0
Best Answer
-
rfuentealba RapidMiner Certified Analyst, Member, University Professor Posts: 568 UnicornHello,
Let's start with understanding k-Means. You set a number of clusters (k) and the algorithm determines what examples belong to that cluster by determining how far are they from that specific cluster. Then the centroids of each cluster are calculated by averaging the distances of all the examples that belong to that cluster to that cluster.
The k-Medoids algorithm is almost the same as the k-Means algorithm with one difference: the center of a cluster is moved to an example, rather than an imaginary number taken from the calculation specified above.
The x-Means algorithm is an improvement. You don't have to determine the number of clusters. Instead, someone said that there is a possibility of determining the correct number of clusters by running a quick heuristic (e.g. an algorithm that belongs to IA but not to Machine Learning). That heuristic determines how many K's are required for that specific example set, and then the algorithm is more or less the same as a k-Means.
There is a lot of n-dimensional geometry in explaining these algorithms. That is why you need to use these with numbers only.
Hope this helps.
All the best,
Rodrigo.
7
Answers
No, fuzzy clustering algorithms use a different type of function, called the "fuzzer" or "fuzzifier", to see if an algorithm belongs to certain cluster or not. While the idea of clustering remains the same, fuzzy clustering uses similarity, intensity and distance as the three stooges main points of analysis, and one example can potentially (though not commonly) belong to more than one cluster. That isn't possible with k-Means, k-Medoids and X-Means, because these are "hard labeled".
Fuzzy C means is available in the "Information Selection" plugin for RapidMiner. It's not part of the standard RapidMiner, BTW.
All the best,
Rodrigo.
kind regards
---------------------
Are you asking about taking the sum of volume column based on the cluster number? If so, you can use the Aggregate operator and group by based on cluster column.
If this is not the answer you are looking for. please explain a bit more about your requirement.
Thanks
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
I have the same questions as Varun has. I will work on your problem tomorrow, I promise.
Rodrigo.
firtsly ı need say that ı am doing clustering with 4 attribute which include "volume"
and ı am doing this clustering in the warehouse(storage) so volume is very important for me so when ı cluster to items sum of each clusters has to be equal(volume) . if ı have to give an example
.in this data ı wanna that cluster 1 gonna be = 1-3-5 cluster 2 gonna be =2-4-6 because sum of volume of every cluster same that it is 60 .ı hope u got what ı mean .ıf u dont pls say it to me.ı am waiting for ur answer .
- item no volume
1 102 15
3 20
4 25
5 30
6 20
Kind Regards,