The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

"Clustering data"

mskhmskh Member Posts: 13 Learner I
edited May 2019 in Help
Hi,
I calculate standard deviation and average of my data set. I want to cluster my data set into 3 clusters which cluster_0 consists of data between 0 to average, cluster_1 between average and average+standard deviation and cluster_3 consists of data between 2*standard deviation to maximum value. which clustering technique i should use?
Thanks
Tagged:

Answers

  • Telcontar120Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    If you have discrete cluster identities in mind, then this really isn't an application of clustering. Clustering techniques are generally non-deterministic and unsupervised ML algorithms.
    But you can easily code your "clusters" manually in RapidMiner using some if/then logic within Generate Attributes, or Discretize by User Specification.
    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.